view article Article Sensitivity Aware Mixed Precision Quantization V1 By badaoui and 1 other • Jun 13 • 19
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 280
view article Article Hugging Face on AMD Instinct MI300 GPU By mfuntowicz and 3 others • May 21, 2024 • 15