XetHub

Fine tune an LLM with everything tracked together.

Updated 4 months ago

Add custom views to your repository by following the instructions in this template.

Updated 3 months ago

Updated 7 months ago

19k+ players and 110 attributes extracted from the latest edition of FIFA. Obtained from Kaggle.

Updated 6 months ago

A clone of the Visual Behavior Neuropixels dataset, collected over 153 sessions with 81 mice.

Updated 8 months ago

A small langchain demo project of a QA on movies

Updated 9 months ago

Falcon RefinedWeb is a massive English web dataset built by TII and released under an ODC-By 1.0 license.

Updated 8 months ago

Try Meta's Code Llama models on your laptop or cloud VM in seconds.

Updated 9 months ago

Updated 9 months ago

URL and caption metadata for the LAION-400M dataset - 400M English (image, text) pairs built for research purposes to enable testing model training on larger scale for broad researcher and other interested communities.

Updated 10 months ago

Blog Authorship Corpus Over 600,000 posts from more than 19 thousand bloggers. Obtained from Kaggle.

Updated 10 months ago

Performance Evaluation for working with large binary files with history

Updated 1 year ago

Simplify the LLM finetuning workflow in Google Colab with XetHub!

Updated 4 months ago

Updated 1 week ago

A database and collection of LLM results across models and questions.

Updated 3 months ago

People