Updated 1 year ago
Performance Evaluation for working with large binary files with history
Updated 1 year ago
Imported to XetHub from: https://github.com/onnx/models. A collection of pre-trained, state-of-the-art models in the ONNX format.
Updated 4 months ago
Updated 1 year ago
MyGPT Workshop: Build a ChatGPT For Your Own Data in One Hour
Updated 7 months ago
A langchain demo project for pygrunn talk.
Updated 12 months ago
Falcon RefinedWeb is a massive English web dataset built by TII and released under an ODC-By 1.0 license.
Updated 8 months ago
Assembled from URLs hosted at https://huggingface.co/datasets/togethercomputer/RedPajama-Data-1T
Updated 8 months ago
A clone of the Visual Behavior Neuropixels dataset, collected over 153 sessions with 81 mice.
Updated 7 months ago
Mount and load Llama 2 model and weights on XetHub in minutes.
Updated 7 months ago
URL and caption metadata for the LAION-400M dataset - 400M English (image, text) pairs built for research purposes to enable testing model training on larger scale for broad researcher and other interested communities.
Updated 9 months ago
Blog Authorship Corpus Over 600,000 posts from more than 19 thousand bloggers. Obtained from Kaggle.
Updated 9 months ago
19k+ players and 110 attributes extracted from the latest edition of FIFA. Obtained from Kaggle.
Updated 6 months ago
An app to visually summarize any CSV data files stored in the data folder.
Updated 1 month ago
Preserve generated Stable Diffusion images with comments and metadata. Duplicate this repository to store your generated images in your own account, and customize to use your own code, tokens, and endpoints.
Updated 4 months ago