grok4-gpqa-eval / benchmarks /mmlu_benchmark.py

Commit History

Upload 38 files
8474f02
Running
verified

TeddyYao commited on