XetHub

Updated 1 year ago

Updated 1 year ago

Performance Evaluation for working with large binary files with history

Updated 1 year ago

A langchain demo project for pygrunn talk.

Updated 12 months ago

Blog Authorship Corpus Over 600,000 posts from more than 19 thousand bloggers. Obtained from Kaggle.

Updated 9 months ago

URL and caption metadata for the LAION-400M dataset - 400M English (image, text) pairs built for research purposes to enable testing model training on larger scale for broad researcher and other interested communities.

Updated 9 months ago

A small langchain demo project of a QA on movies

Updated 9 months ago

Updated 8 months ago

Try Meta's Code Llama models on your laptop or cloud VM in seconds.

Updated 8 months ago

Updated 8 months ago

Falcon RefinedWeb is a massive English web dataset built by TII and released under an ODC-By 1.0 license.

Updated 8 months ago

Assembled from URLs hosted at https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T

Updated 8 months ago

A clone of the Visual Behavior Neuropixels dataset, collected over 153 sessions with 81 mice.

Updated 7 months ago

MyGPT Workshop: Build a ChatGPT For Your Own Data in One Hour

Updated 7 months ago

Mount and load Llama 2 model and weights on XetHub in minutes.

Updated 7 months ago

People