2 33 51

Yassine Boukhari

Yasbok

AI & ML interests

NLP, Generative models, Reinforcement Learning

Recent Activity

upvoted an article 27 days ago

Building the Hugging Face MCP Server

upvoted an article 28 days ago

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

upvoted an article 28 days ago

Transformers backend integration in SGLang

View all activity

Organizations

upvoted an article 27 days ago

Article

Building the Hugging Face MCP Server

and 3 others •

27 days ago

• 57

upvoted 3 articles 28 days ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

and 3 others •

May 23

• 153

Article

Transformers backend integration in SGLang

and 4 others •

Jun 23

• 50

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

29 days ago

• 611

upvoted 2 articles about 2 months ago

Article

StarCoder: A State-of-the-Art LLM for Code

and 1 other •

May 4, 2023

• 62

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 331

upvoted a paper 3 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 37

upvoted an article 4 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

and 6 others •

Apr 5

• 146

upvoted a paper 5 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 154

upvoted 3 articles 6 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.28k

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 876

upvoted 2 articles 7 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

•

May 11, 2023

• 69

Article

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

and 1 other •

Jan 16

• 75

upvoted a paper 7 months ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 63

upvoted an article 8 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 140

upvoted 2 articles about 1 year ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

•

May 16, 2024

• 49

Article

🪆 Introduction to Matryoshka Embedding Models

and 2 others •

Feb 23, 2024

• 153

upvoted 2 articles over 1 year ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

•

Jun 3, 2024

• 47

Article

Mixture of Depth is Vibe

•

Apr 22, 2024

• 48

Yassine Boukhari

AI & ML interests

Recent Activity

Organizations

Yasbok's activity

Building the Hugging Face MCP Server

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

Transformers backend integration in SGLang

SmolLM3: smol, multilingual, long-context reasoner

StarCoder: A State-of-the-Art LLM for Code

You could have designed state of the art positional encoding

Welcome Llama 4 Maverick & Scout on Hugging Face!

Open-source DeepResearch – Freeing our search agents

Open-R1: Update #1

Open-R1: a fully open reproduction of DeepSeek-R1

Assisted Generation: a new direction toward low-latency text generation

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference

Use Models from the Hugging Face Hub in LM Studio

Unlocking Longer Generation with Key-Value Cache Quantization

🪆 Introduction to Matryoshka Embedding Models

Mergoo: Efficiently Build Your Own MoE LLM

Mixture of Depth is Vibe