Approximating Language Model Training Data from Weights Paper β’ 2506.15553 β’ Published 8 days ago β’ 1
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Paper β’ 2410.21845 β’ Published Oct 29, 2024 β’ 16
Chain-of-Thought Reasoning is a Policy Improvement Operator Paper β’ 2309.08589 β’ Published Sep 15, 2023 β’ 2
view article Article Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm By nvidia and 4 others β’ 14 days ago β’ 63
Layer by Layer: Uncovering Hidden Representations in Language Models Paper β’ 2502.02013 β’ Published Feb 4 β’ 2
Autonomous Improvement of Instruction Following Skills via Foundation Models Paper β’ 2407.20635 β’ Published Jul 30, 2024 β’ 1
RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning Paper β’ 2412.09858 β’ Published Dec 13, 2024 β’ 2
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning Paper β’ 2401.16013 β’ Published Jan 29, 2024 β’ 26
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents Paper β’ 2505.22954 β’ Published 28 days ago β’ 11
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published 24 days ago β’ 100
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ about 1 month ago β’ 44