Updated 2 days ago
Stream the Flickr30k image dataset on XetHub in seconds. Flickr30k is the benchmark for sentence-based image description, containing 31,000 images collected from Flickr alongside annotatations. Obtained from Kaggle.
Updated 2 weeks ago
Updated 4 weeks ago
Updated 4 weeks ago
An app to visually summarize any CSV data files stored in the data folder.
Updated 1 month ago
Add custom views to your repository by following the instructions in this template.
Updated 2 months ago
Updated 2 months ago
Repo for running benchmarks against Git LFS, DVC, and LakeFS.
Updated 3 months ago
A database and collection of LLM results across models and questions.
Updated 3 months ago
Updated 3 months ago
Fine tune an LLM with everything tracked together.
Updated 3 months ago
Simplify the LLM finetuning workflow in Google Colab with XetHub!
Updated 3 months ago
Personalize a ChatGPT for your own documents.
Updated 4 months ago
Preserve generated Stable Diffusion images with comments and metadata. Duplicate this repository to store your generated images in your own account, and customize to use your own code, tokens, and endpoints.
Updated 4 months ago
Imported from https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T-Sample
Updated 4 months ago