Knut Jägersberg's picture

Knut Jägersberg

KnutJaegersberg

·

jagersbergknut

AI & ML interests

NLP, opinion mining, narrative intelligence

Recent Activity

updated a model about 9 hours ago

KnutJaegersberg/gpt-oss-120b

liked a model about 12 hours ago

ggml-org/gpt-oss-120b-GGUF

liked a model about 12 hours ago

ggml-org/gpt-oss-20b-GGUF

View all activity

Organizations

upvoted a paper 5 days ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published 7 days ago • 59

upvoted a collection 5 days ago

Cogito v2 Preview

6 items • Updated 6 days ago • 17

upvoted an article 6 days ago

Article

Introducing Command A Vision: Multimodal AI built for Business

By

and 3 others •

6 days ago

• 60

upvoted a collection 13 days ago

GLiCLass-V3

Models for zero-shot text classification that are up to 50 times faster than Cross-Encoders and show the same or higher accuracy. • 7 items • Updated 15 days ago • 13

upvoted an article 17 days ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

By

and 3 others •

19 days ago

• 47

upvoted a collection 22 days ago

EXAONE-4.0

EXAONE unified model series of 1.2B and 32B, integrating non-reasoning and reasoning modes. • 20 items • Updated 8 days ago • 43

upvoted a collection 23 days ago

MetaStone-S1

The open-source model of MetaStone-S1. • 4 items • Updated 7 days ago • 9

upvoted a collection 24 days ago

MLM vs CLM

65 items • Updated Jul 3 • 1

upvoted 2 collections 27 days ago

💧 LFM2

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 15 items • Updated 8 days ago • 83

ThinkPRM

Process Reward Models that Think -- https://arxiv.org/abs/2504.16828 • 8 items • Updated 7 days ago • 3

upvoted a collection 28 days ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 12 items • Updated about 22 hours ago • 68

upvoted an article 28 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

29 days ago

• 611

upvoted a collection 29 days ago

POLAR

5 items • Updated 28 days ago • 12

upvoted 2 papers about 1 month ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 58

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 51

upvoted 2 collections about 1 month ago

Reward Models

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 16 days ago • 19

Weaver

The models and datasets for Weaver: Shrinking the Generation-Verification Gap with Weak Verifiers • 21 items • Updated Jun 24 • 1

upvoted 2 papers about 2 months ago

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Paper • 2503.05139 • Published Mar 7 • 4

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 253

upvoted a paper 2 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 51