Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 75 items • Updated 7 days ago • 175
view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 8 days ago • 138
LLM - GGUF Collection Text Generations Models in GGUF format, hand picked by Nexa Team. • 1 item • Updated 15 days ago • 2
Multimodal - GGUF Collection Language Models that takes vision input and/or audio input, hand picked by Nexa Team. • 2 items • Updated 15 days ago • 2
Multimodal - MLX Collection Language Models that takes vision input and/or audio input, hand picked by Nexa Team. • 6 items • Updated 15 days ago • 2
LLM - MLX Collection Text Generations Models in MLX format, hand picked by Nexa Team. • 4 items • Updated 15 days ago • 2
Seed-X Collection A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 6 items • Updated 8 days ago • 61
🧠 SmolLM3 Collection Smol, multilingual, long-context reasoner • 12 items • Updated about 22 hours ago • 68
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • 29 days ago • 611
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation Paper • 2506.20639 • Published Jun 25 • 29
BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing Paper • 2506.17450 • Published Jun 20 • 62
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 By tomaarsen and 1 other • Jul 1 • 106
MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning Paper • 2506.22992 • Published Jun 28 • 12
Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs Paper • 2506.17080 • Published Jun 20 • 4