Quentin Gallouédec's picture

In a Training Loop 🔄

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a bucket about 15 hours ago

hf-doc-build/doc-dev

updated a dataset about 15 hours ago

hf-doc-build/doc-build

updated a bucket about 15 hours ago

hf-doc-build/doc

View all activity

Organizations

upvoted a paper 2 days ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 8

upvoted a paper 3 days ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 135

upvoted a collection 5 days ago

Laguna XS.2

Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated 26 days ago • 24

upvoted an article 7 days ago

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

9 days ago

• 91

upvoted a paper 11 days ago

Composer 2 Technical Report

Paper • 2603.24477 • Published Mar 25 • 18

upvoted an article 25 days ago

Article

EMO: Pretraining mixture of experts for emergent modularity

allenai

•

25 days ago

• 38

upvoted a paper 29 days ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 68

upvoted a changelog about 1 month ago

Hugging Face Changelog

Spaces agents.md for your coding agents

Apr 17

• 330

upvoted an article about 1 month ago

Article

AI evals are becoming the new compute bottleneck

evaleval

•

Apr 29

• 28

upvoted a collection about 1 month ago

Tiny Models for CI

A collection of tiny models of common model architectures. Useful for e2e smoke tests across real pretrained models to validate loss behavior. • 10 items • Updated Apr 22 • 1

upvoted an article 2 months ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

+2

qgallouedec, stevhliu, pcuenq, sergiopaniego

•

Mar 31

• 53

upvoted a paper 2 months ago

Fine-Tuning Language Models from Human Preferences

Paper • 1909.08593 • Published Sep 18, 2019 • 4

upvoted a paper 3 months ago

Fewer Truncations Improve Language Modeling

Paper • 2404.10830 • Published Apr 16, 2024 • 5

upvoted 3 articles 3 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 159

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner

•

Mar 10

• 195

Article

Bringing Autonomous Driving RL to OpenEnv and TRL

sergiopaniego

•

Feb 26

• 22

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.66k

upvoted 2 articles 3 months ago

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

dlouapre

•

Feb 19

• 62

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 507

upvoted a paper 3 months ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 151