Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models Paper • 2511.08577 • Published 11 days ago • 93
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published 4 days ago • 176
Jan-v2-VL Collection Jan-v2-VL: an 8B VLM focused on reliable, many-step task execution. • 6 items • Updated 9 days ago • 34
Gaperon Collection Our French-English LLM suite (SFT models are coming soon) • 10 items • Updated 19 days ago • 14
view article Article We’re open-sourcing our text-to-image model and the process behind it 10 days ago • 68
view article Article Building for an Open Future - our new partnership with Google Cloud 10 days ago • 44
Audio dataset Collection N datasets showcase how to configure and load audio datasets • 11 items • Updated Aug 2, 2024 • 4
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Paper • 2511.02778 • Published 18 days ago • 100
Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training Paper • 2506.01732 • Published Jun 2 • 6
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 24 days ago • 56
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning 27 days ago • 67
view article Article Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models Oct 20 • 19