Blog, Articles, and discussions

Transformers backend integration in SGLang

By June 23, 2025 • 34

Community Articles

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

Code a simple RAG from scratch

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series

and 1 other •

Adaptive Classifier: Dynamic Text Classification with Continuous Learning

Whose Voice Do We Hear When AI Speaks?

The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't)

Uncensor any LLM with abliteration

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Mastering Tensor Dimensions in Transformers

KV Caching Explained: Optimizing Transformer Inference Efficiency

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness

and 2 others •

Why Maybe We're Measuring LLM Compression Wrong

DO THEY SEE WHAT WE SEE?

Nano-vLLM meets Inference Endpoints

about 12 hours ago

The Large Language Model Course

Sensitivity Aware Mixed Precision Quantization V1

and 1 other •

The Common Pile v0.1

and 2 others •

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By May 21, 2025 • 169

Microsoft and Hugging Face expand collaboration

By May 19, 2025 • 22

The Transformers Library: standardizing model definitions

By May 15, 2025 • 114

Improving Hugging Face Model Access for Kaggle Users

By May 14, 2025 • 29

Blazingly fast whisper transcriptions with Inference Endpoints

By May 13, 2025 • 69

Vision Language Models (Better, Faster, Stronger)

By May 12, 2025 • 458

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

By May 11, 2025 • 64

How to Build an MCP Server with Gradio

By April 30, 2025 • 174

Welcoming Llama Guard 4 on Hugging Face Hub

By April 29, 2025 • 38

The 4 Things Qwen-3's Chat Template Teaches Us

By April 30, 2025 • 57

Tiny Agents: a MCP-powered agent in 50 lines of code

By April 25, 2025 • 283

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

By April 29, 2025 • 33

17 Reasons Why Gradio Isn't Just Another UI Library

By April 16, 2025 • 40

Cohere on Hugging Face Inference Providers 🔥

By April 16, 2025 • 126

Community Articles

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

Code a simple RAG from scratch

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series

and 1 other •

Adaptive Classifier: Dynamic Text Classification with Continuous Learning

Whose Voice Do We Hear When AI Speaks?

The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't)

Uncensor any LLM with abliteration

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

Mastering Tensor Dimensions in Transformers

KV Caching Explained: Optimizing Transformer Inference Efficiency

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness

and 2 others •

Why Maybe We're Measuring LLM Compression Wrong

DO THEY SEE WHAT WE SEE?

Nano-vLLM meets Inference Endpoints

about 12 hours ago

The Large Language Model Course

Sensitivity Aware Mixed Precision Quantization V1

and 1 other •

The Common Pile v0.1

and 2 others •

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

and 4 others •

View all