Updated 3 weeks ago
Stream the Flickr30k image dataset on XetHub in seconds. Flickr30k is the benchmark for sentence-based image description, containing 31,000 images collected from Flickr alongside annotatations. Obtained from Kaggle.
Updated 3 weeks ago