Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Ai2
Enterprise
non-profit
Verified
https://allenai.org/
allen_ai
allenai
Activity Feed
Follow
3,489
AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
oliverwm
new
activity
about 2 hours ago
allenai/ACE2-ERA5:
add-training-validation-files
yilunzhao
authored
a paper
7 days ago
SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification
yilunzhao
authored
a paper
8 days ago
Can LLMs Generate High-Quality Test Cases for Algorithm Problems? TestCase-Eval: A Systematic Evaluation of Fault Coverage and Exposure
View all activity
Articles
Introducing the Open Chain of Thought Leaderboard
Apr 23, 2024
•
35
Team members
194
+160
+147
+126
+116
+96
allenai
's datasets
242
Sort: Recently updated
allenai/omega-compositional
Viewer
•
Updated
1 day ago
•
14.3k
allenai/omega-explorative
Viewer
•
Updated
1 day ago
•
52.2k
•
4
allenai/omega-transformative
Viewer
•
Updated
1 day ago
•
7.2k
•
2
allenai/reward-bench-2-results
Preview
•
Updated
2 days ago
•
349
•
1
allenai/IF_sft_data_verified
Viewer
•
Updated
3 days ago
•
31.8k
•
27
•
3
allenai/IF_multi_constraints_upto5_no_lang
Viewer
•
Updated
3 days ago
•
95.4k
•
38
•
2
allenai/DataDecide-ppl-results
Viewer
•
Updated
8 days ago
•
22.7k
•
133
•
2
allenai/ruler_data
Updated
14 days ago
•
242
allenai/PRISM
Viewer
•
Updated
18 days ago
•
412k
•
387
•
2
allenai/SimpleToM-rich
Viewer
•
Updated
18 days ago
•
4.59k
•
276
•
1
allenai/reward-bench-2
Viewer
•
Updated
22 days ago
•
1.87k
•
1.63k
•
18
allenai/IF_multi_constraints_upto5
Viewer
•
Updated
22 days ago
•
95.4k
•
411
allenai/sciriff-yesno
Viewer
•
Updated
23 days ago
•
2.24k
•
468
allenai/blog-images
Viewer
•
Updated
24 days ago
•
2
•
26.6k
allenai/WildChat-4M-Full
Updated
26 days ago
•
63
allenai/WildChat-4M
Updated
26 days ago
•
63
•
1
allenai/qasper-yesno
Viewer
•
Updated
28 days ago
•
649
•
155
allenai/olmOCR-bench
Preview
•
Updated
May 23
•
715
•
23
allenai/olmOCR-pes2o-0225
Viewer
•
Updated
May 16
•
7.87M
•
313
•
1
allenai/discoverybench
Viewer
•
Updated
May 10
•
264
•
504
•
12
allenai/reward-bench-results
Updated
May 7
•
65.8k
•
2
allenai/DataDecide-data-recipes
Updated
May 6
•
1.74k
•
8
allenai/olmo-2-0425-1b-preference-mix
Viewer
•
Updated
Apr 30
•
378k
•
407
•
3
allenai/DataDecide-eval-results
Viewer
•
Updated
Apr 16
•
1.41M
•
109
•
4
allenai/sqa_reranking_eval
Viewer
•
Updated
Apr 15
•
2.43k
•
72
•
2
allenai/tulu-3-do-anything-now-eval
Viewer
•
Updated
Apr 11
•
300
•
102
•
1
allenai/tulu-3-harmbench-eval
Viewer
•
Updated
Apr 11
•
320
•
83
allenai/tulu-3-trustllm-jailbreaktrigger-eval
Viewer
•
Updated
Apr 11
•
400
•
101
allenai/big-reasoning-traces
Viewer
•
Updated
Apr 1
•
677k
•
219
•
5
allenai/super
Viewer
•
Updated
Mar 21
•
801
•
663
•
3
Previous
1
2
3
...
9
Next