🧬 Carbon Collection Carbon 500M, 3B, 8B genomic models and GGUF variants for llama.cpp • 6 items • Updated 2 days ago • 28
view article Article Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law mishig • 12 days ago • 22
view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 9 days ago • 29
view article Article Unlocking asynchronicity in continuous batching +1 ror, pcuenq, ariG23498 • 9 days ago • 54
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper • 2604.19254 • Published Apr 21 • 29
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c • Feb 4 • 89
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 898
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 52
ObjectClear: Complete Object Removal via Object-Effect Attention Paper • 2505.22636 • Published May 28, 2025 • 5
MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator Paper • 2512.11782 • Published Dec 12, 2025 • 3
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 506
view article Article How to Use Multiple GPUs in Hugging Face Transformers: Device Map vs Tensor Parallelism ariG23498 • Feb 12 • 20
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 merve, ysharma, abidlabs, hysts, pcuenq • Jan 29 • 107