Stream the Flickr30k image dataset on XetHub in seconds. Flickr30k is the benchmark for sentence-based image description, containing 31,000 images collected from Flickr alongside annotatations. Obtained from Kaggle.
Updated 2 days ago
Personalize a ChatGPT for your own documents.
Updated 1 week ago
Preserve generated Stable Diffusion images with comments and metadata. Duplicate this repository to store your generated images in your own account, and customize to use your own code, tokens, and endpoints.
Updated 1 week ago
An app to visually summarize any CSV data files stored in the data folder.
Updated 1 week ago
MyGPT Workshop: Build a ChatGPT For Your Own Data in One Hour
Updated 1 week ago
A clone of the Visual Behavior Neuropixels dataset, collected over 153 sessions with 81 mice.
Updated 1 week ago
Assembled from URLs hosted at https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T
Updated 2 weeks ago
Llama 2 model, weights, and more.
Updated 2 weeks ago
Falcon RefinedWeb is a massive English web dataset built by TII and released under an ODC-By 1.0 license.
Updated 2 weeks ago
Updated 4 weeks ago
Try Meta's Code Llama models on your laptop or cloud VM in seconds.
Updated 1 month ago
Updated 1 month ago
A small langchain demo project of a QA on movies
Updated 1 month ago
URL and caption metadata for the LAION-400M dataset - 400M English (image, text) pairs built for research purposes to enable testing model training on larger scale for broad researcher and other interested communities.
Updated 2 months ago
Blog Authorship Corpus Over 600,000 posts from more than 19 thousand bloggers. Obtained from Kaggle.
Updated 2 months ago