The Big Benchmarks Collection Collection Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) β’ 13 items β’ Updated Nov 18, 2024 β’ 253
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv 15 days ago β’ 122
view article Article Hugging Face and VirusTotal collaborate to strengthen AI security 16 days ago β’ 36
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM Paper β’ 2510.15870 β’ Published 21 days ago β’ 86
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning Paper β’ 2509.08755 β’ Published Sep 10 β’ 56
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper β’ 2510.04618 β’ Published Oct 6 β’ 115
neutts-air Collection NeuTTS Air is a speech foundation model that runs on CPU in real-time, with instant voice cloning. β’ 3 items β’ Updated 29 days ago β’ 12
AgentRxiv: Towards Collaborative Autonomous Research Paper β’ 2503.18102 β’ Published Mar 23 β’ 25
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper β’ 2510.02283 β’ Published Oct 2 β’ 92
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights Paper β’ 2509.22944 β’ Published Sep 26 β’ 76
Quantile Advantage Estimation for Entropy-Safe Reasoning Paper β’ 2509.22611 β’ Published Sep 26 β’ 117
Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence β’ 4 items β’ Updated about 15 hours ago β’ 134
OpenReasoning-Nemotron Collection Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. β’ 6 items β’ Updated 1 day ago β’ 44
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data Paper β’ 2509.15221 β’ Published Sep 18 β’ 109