Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
drwlf
's Collections
Robocop
Medra
Claria
Evaluation
MedImaging
Spaces
Psycho
Reasoning
Medical Data
Audiophile
Datasets
Evaluation
updated
Aug 27
Upvote
-
microsoft/MMLU-CF
Viewer
•
Updated
Jan 8
•
20.1k
•
1.14k
•
17
microsoft/Taskbench
Viewer
•
Updated
Aug 21, 2024
•
17.3k
•
420
•
32
AdaptLLM/biomed-VQA-benchmark
Viewer
•
Updated
Aug 21
•
10.2k
•
158
•
6
openai/healthbench
Preview
•
Updated
Aug 27
•
580
•
103
Upvote
-
Share collection
View history
Collection guide
Browse collections