SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper • 2506.01844 • Published 23 days ago • 100
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 235
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 98
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model Paper • 2212.04960 • Published Dec 9, 2022 • 1
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning Paper • 2302.02662 • Published Feb 6, 2023 • 1
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation Paper • 2003.11963 • Published Mar 26, 2020
Learning from others' mistakes: Avoiding dataset biases without modeling them Paper • 2012.01300 • Published Dec 2, 2020
A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks Paper • 1811.06031 • Published Nov 14, 2018