Updated 1 year ago
Updated 1 year ago
Performance Evaluation for working with large binary files with history
Updated 1 year ago
A langchain demo project for pygrunn talk.
Updated 12 months ago
Blog Authorship Corpus Over 600,000 posts from more than 19 thousand bloggers. Obtained from Kaggle.
Updated 9 months ago
URL and caption metadata for the LAION-400M dataset - 400M English (image, text) pairs built for research purposes to enable testing model training on larger scale for broad researcher and other interested communities.
Updated 9 months ago
A small langchain demo project of a QA on movies
Updated 9 months ago
Updated 8 months ago
Try Meta's Code Llama models on your laptop or cloud VM in seconds.
Updated 8 months ago
Updated 8 months ago
Falcon RefinedWeb is a massive English web dataset built by TII and released under an ODC-By 1.0 license.
Updated 8 months ago
Assembled from URLs hosted at https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T
Updated 8 months ago
A clone of the Visual Behavior Neuropixels dataset, collected over 153 sessions with 81 mice.
Updated 7 months ago
MyGPT Workshop: Build a ChatGPT For Your Own Data in One Hour
Updated 7 months ago
Mount and load Llama 2 model and weights on XetHub in minutes.
Updated 7 months ago