Colab (#1)

* Created using Colaboratory * Update README.md * A few fixes * add base model * local files * remove pytorch model files * A few tweaks * Update V100 comment * Update README.md * Clears output * Reorganize cells * constant length dataset * fix char token ratio sampling * change constant length train dataset generation mode * fine tuning checkpoint for xetdata/pyxet fd7a21d * Fine tuned model trained on xetdata/pyxet@fd7a21d * Update comments
5 months ago · a5077fb282
parent 3b0947ba24
commit a5077fb282
60 changed files (17 B → 7.8 GiB)
--- a/CodeLlama-7b-hf/LICENSE
+++ b/CodeLlama-7b-hf/LICENSE
--- a/CodeLlama-7b-hf/README.md
+++ b/CodeLlama-7b-hf/README.md
--- a/CodeLlama-7b-hf/USE_POLICY.md
+++ b/CodeLlama-7b-hf/USE_POLICY.md
--- a/CodeLlama-7b-hf/config.json
+++ b/CodeLlama-7b-hf/config.json
--- a/CodeLlama-7b-hf/generation_config.json
+++ b/CodeLlama-7b-hf/generation_config.json
--- a/CodeLlama-7b-hf/model-00001-of-00002.safetensors
+++ b/CodeLlama-7b-hf/model-00001-of-00002.safetensors
--- a/CodeLlama-7b-hf/model-00002-of-00002.safetensors
+++ b/CodeLlama-7b-hf/model-00002-of-00002.safetensors
--- a/CodeLlama-7b-hf/model.safetensors.index.json
+++ b/CodeLlama-7b-hf/model.safetensors.index.json
--- a/CodeLlama-7b-hf/special_tokens_map.json
+++ b/CodeLlama-7b-hf/special_tokens_map.json
--- a/CodeLlama-7b-hf/tokenizer.json
+++ b/CodeLlama-7b-hf/tokenizer.json
--- a/CodeLlama-7b-hf/tokenizer.model
+++ b/CodeLlama-7b-hf/tokenizer.model
--- a/CodeLlama-7b-hf/tokenizer_config.json
+++ b/CodeLlama-7b-hf/tokenizer_config.json
--- a/README.md
+++ b/README.md
--- a/code-llama/README.md
+++ b/code-llama/README.md
--- a/code-llama/adapter_config.json
+++ b/code-llama/adapter_config.json
--- a/code-llama/adapter_model.safetensors
+++ b/code-llama/adapter_model.safetensors
--- a/code-llama/checkpoint-100/README.md
+++ b/code-llama/checkpoint-100/README.md
--- a/code-llama/checkpoint-100/adapter_config.json
+++ b/code-llama/checkpoint-100/adapter_config.json
--- a/code-llama/checkpoint-100/adapter_model.safetensors
+++ b/code-llama/checkpoint-100/adapter_model.safetensors
--- a/code-llama/checkpoint-100/optimizer.pt
+++ b/code-llama/checkpoint-100/optimizer.pt
--- a/code-llama/checkpoint-100/rng_state.pth
+++ b/code-llama/checkpoint-100/rng_state.pth
--- a/code-llama/checkpoint-100/scheduler.pt
+++ b/code-llama/checkpoint-100/scheduler.pt
--- a/code-llama/checkpoint-100/trainer_state.json
+++ b/code-llama/checkpoint-100/trainer_state.json
--- a/code-llama/checkpoint-100/training_args.bin
+++ b/code-llama/checkpoint-100/training_args.bin
--- a/code-llama/checkpoint-20/README.md
+++ b/code-llama/checkpoint-20/README.md
--- a/code-llama/checkpoint-20/adapter_config.json
+++ b/code-llama/checkpoint-20/adapter_config.json
--- a/code-llama/checkpoint-20/adapter_model.safetensors
+++ b/code-llama/checkpoint-20/adapter_model.safetensors
--- a/code-llama/checkpoint-20/optimizer.pt
+++ b/code-llama/checkpoint-20/optimizer.pt
--- a/code-llama/checkpoint-20/rng_state.pth
+++ b/code-llama/checkpoint-20/rng_state.pth
--- a/code-llama/checkpoint-20/scheduler.pt
+++ b/code-llama/checkpoint-20/scheduler.pt
--- a/code-llama/checkpoint-20/trainer_state.json
+++ b/code-llama/checkpoint-20/trainer_state.json
--- a/code-llama/checkpoint-20/training_args.bin
+++ b/code-llama/checkpoint-20/training_args.bin
--- a/code-llama/checkpoint-40/README.md
+++ b/code-llama/checkpoint-40/README.md
--- a/code-llama/checkpoint-40/adapter_config.json
+++ b/code-llama/checkpoint-40/adapter_config.json
--- a/code-llama/checkpoint-40/adapter_model.safetensors
+++ b/code-llama/checkpoint-40/adapter_model.safetensors
--- a/code-llama/checkpoint-40/optimizer.pt
+++ b/code-llama/checkpoint-40/optimizer.pt
--- a/code-llama/checkpoint-40/rng_state.pth
+++ b/code-llama/checkpoint-40/rng_state.pth
--- a/code-llama/checkpoint-40/scheduler.pt
+++ b/code-llama/checkpoint-40/scheduler.pt
--- a/code-llama/checkpoint-40/trainer_state.json
+++ b/code-llama/checkpoint-40/trainer_state.json
--- a/code-llama/checkpoint-40/training_args.bin
+++ b/code-llama/checkpoint-40/training_args.bin
--- a/code-llama/checkpoint-60/README.md
+++ b/code-llama/checkpoint-60/README.md
--- a/code-llama/checkpoint-60/adapter_config.json
+++ b/code-llama/checkpoint-60/adapter_config.json
--- a/code-llama/checkpoint-60/adapter_model.safetensors
+++ b/code-llama/checkpoint-60/adapter_model.safetensors
--- a/code-llama/checkpoint-60/optimizer.pt
+++ b/code-llama/checkpoint-60/optimizer.pt
--- a/code-llama/checkpoint-60/rng_state.pth
+++ b/code-llama/checkpoint-60/rng_state.pth
--- a/code-llama/checkpoint-60/scheduler.pt
+++ b/code-llama/checkpoint-60/scheduler.pt
--- a/code-llama/checkpoint-60/trainer_state.json
+++ b/code-llama/checkpoint-60/trainer_state.json
--- a/code-llama/checkpoint-60/training_args.bin
+++ b/code-llama/checkpoint-60/training_args.bin
--- a/code-llama/checkpoint-80/README.md
+++ b/code-llama/checkpoint-80/README.md
--- a/code-llama/checkpoint-80/adapter_config.json
+++ b/code-llama/checkpoint-80/adapter_config.json
--- a/Show More
+++ b/Show More