LLM Evolution Tracking

To run, simply set up a new python environment using

python -m venv .venv/
. .venv/bin/activate
pip install -r requirements.txt

Then run

./runner.py

The models used may be changed in llm_evolution/models.py.

This will generate a json output file containing all the responses.

To process the new results, run

./dump_output.py <output_json>

Run ./dump_output.py --help to see other options.

File List			Total items: 12
Name	Last Commit	Size	Last Modified
.xet	Compare 4 different GPT models across MMLU prompts, with prompt scorings		6 months ago
cache	Compare 4 different GPT models across MMLU prompts, with prompt scorings		6 months ago
data/mmlu	Compare 4 different GPT models across MMLU prompts, with prompt scorings		6 months ago
llm_evolution	Compare 4 different GPT models across MMLU prompts, with prompt scorings		6 months ago
output_data	Compare 4 different GPT models across MMLU prompts, with prompt scorings		6 months ago
.gitattributes	Compare 4 different GPT models across MMLU prompts, with prompt scorings	90 B	6 months ago
.gitignore	Compare 4 different GPT models across MMLU prompts, with prompt scorings	3.0 KiB	6 months ago
LICENSE	Compare 4 different GPT models across MMLU prompts, with prompt scorings	1.0 KiB	6 months ago
README.md	Updated readme.	462 B	6 months ago
dump_output.py	Compare 4 different GPT models across MMLU prompts, with prompt scorings	2.6 KiB	6 months ago
requirements.txt	Updated readme.	1.4 KiB	6 months ago
runner.py	Compare 4 different GPT models across MMLU prompts, with prompt scorings	1.2 KiB	6 months ago