view article Article The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) By fdaudens • 1 day ago • 9
view article Article Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 14 days ago • 18
view article Article Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others • May 15 • 35
view article Article Reduce, Reuse, Recycle: Why Open Source is a Win for Sustainability By sasha and 1 other • May 7 • 15
view article Article Accelerating LLM Inference with TGI on Intel Gaudi By baptistecolle and 4 others • Mar 28 • 13
An Investigation of FP8 Across Accelerators for LLM Inference Paper • 2502.01070 • Published Feb 3 • 3
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other • Jan 16 • 75
view article Article Organizing a Privacy-preserving Hackathon By binoua and 1 other • Oct 17, 2024 • 9