Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
TeddyYao
/
grok4-gpqa-eval
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
8474f02
grok4-gpqa-eval
/
benchmarks
/
__pycache__
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
TeddyYao
Upload 38 files
8474f02
verified
about 1 month ago
__init__.cpython-312.pyc
1.12 kB
Upload 38 files
about 1 month ago
base_benchmark.cpython-312.pyc
5.45 kB
Upload 38 files
about 1 month ago
evaluation_utils.cpython-312.pyc
5.64 kB
Upload 38 files
about 1 month ago
gpqa_benchmark.cpython-312.pyc
4.45 kB
Upload 38 files
about 1 month ago
gsm8k_benchmark.cpython-312.pyc
4.91 kB
Upload 38 files
about 1 month ago
humaneval_benchmark.cpython-312.pyc
6.18 kB
Upload 38 files
about 1 month ago
math_benchmark.cpython-312.pyc
5.41 kB
Upload 38 files
about 1 month ago
mmlu_benchmark.cpython-312.pyc
5.7 kB
Upload 38 files
about 1 month ago
prompt_templates.cpython-312.pyc
4.39 kB
Upload 38 files
about 1 month ago