Commit History

Upload from GitHub Actions: updated and cleaned up scripts for new eval runs
963cb78
verified

davidpomerenke commited on

Upload from GitHub Actions: Add Todos for using existing machine-translated datasets rather than our own ones
56adaa2
verified

davidpomerenke commited on

Upload from GitHub Actions: updated translation functions
8f5ce26
verified

davidpomerenke commited on

Upload from GitHub Actions: updated frontend and backend to fix bugs
4e8cb1a
verified

davidpomerenke commited on

Upload from GitHub Actions: Merge pull request #9 from datenlabor-bmz/jn-dev
7c06aef
verified

davidpomerenke commited on

Upload from GitHub Actions: Translate MMLU and evaluate
4c5c136
verified

davidpomerenke commited on

Upload from GitHub Actions: Fix vibecoding
75010c2
verified

davidpomerenke commited on

Fix dataset loading
c990cb9

David Pomerenke commited on

Fix import paths
c567aee

David Pomerenke commited on

Run on 40 languages, additional models
260c1a3

David Pomerenke commited on

Move functions for sharing them
55406ba

David Pomerenke commited on

Implement MMLU task
a683732

David Pomerenke commited on

MMLU data loader for 3 parallel datasets
47170a5

David Pomerenke commited on

Analyze MMLU datasets
031925d

David Pomerenke commited on