Updated 3 weeks ago
Updated 3 weeks ago
Updated 3 weeks ago
Updated 4 weeks ago
Having fun with ChatGPT4 to build an archive of IRS PDF documents. Curious about XetHub default deduplication over PDFs. 15+% feels pretty good!
Updated 1 month ago
Updated 1 month ago
Updated 1 month ago
Updated 1 month ago
A clone of the Flickr30k images repository, which has become a standard benchmark for sentence-based image description. See Kaggle for full details and updates.
Updated 2 months ago
A clone of the Laion 400M open dataset, an uncurated dataset to enable testing model training on larger scale for broad researcher and other interested communities.
Updated 2 months ago
Updated 2 months ago
Updated 2 months ago
Updated 2 months ago
Updated 2 months ago
Performance Evaluation for working with large binary files with history
Updated 2 months ago
Follow our Getting Started docs to help you make the most of this Tutorial repository.
Updated 2 months ago
Updated 2 months ago
Updated 2 months ago
Can we get GPT2 to play chess... as me?
Updated 2 months ago
Updated 2 months ago