-
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
Paper • 2510.12693 • Published • 26 -
Dr.LLM: Dynamic Layer Routing in LLMs
Paper • 2510.12773 • Published • 31 -
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Paper • 2510.05684 • Published • 137 -
BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities
Paper • 2510.08759 • Published • 46
Collections
Discover the best community collections!
Collections including paper arXiv:2510.12773
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 78 -
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Paper • 2510.01132 • Published • 5 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 118 -
MixReasoning: Switching Modes to Think
Paper • 2510.06052 • Published • 21
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 6 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 13 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 31 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 9 -
Making Mathematical Reasoning Adaptive
Paper • 2510.04617 • Published • 22 -
DocReward: A Document Reward Model for Structuring and Stylizing
Paper • 2510.11391 • Published • 26
-
Dr.LLM: Dynamic Layer Routing in LLMs
Paper • 2510.12773 • Published • 31 -
Is Multilingual LLM Watermarking Truly Multilingual? A Simple Back-Translation Solution
Paper • 2510.18019 • Published • 17 -
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
Paper • 2506.15674 • Published • 2 -
C-SEO Bench: Does Conversational SEO Work?
Paper • 2506.11097 • Published • 2
-
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
Paper • 2510.12693 • Published • 26 -
Dr.LLM: Dynamic Layer Routing in LLMs
Paper • 2510.12773 • Published • 31 -
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI
Paper • 2510.05684 • Published • 137 -
BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities
Paper • 2510.08759 • Published • 46
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 31 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 9 -
Making Mathematical Reasoning Adaptive
Paper • 2510.04617 • Published • 22 -
DocReward: A Document Reward Model for Structuring and Stylizing
Paper • 2510.11391 • Published • 26
-
ExGRPO: Learning to Reason from Experience
Paper • 2510.02245 • Published • 78 -
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning
Paper • 2510.01132 • Published • 5 -
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Paper • 2510.04618 • Published • 118 -
MixReasoning: Switching Modes to Think
Paper • 2510.06052 • Published • 21
-
Dr.LLM: Dynamic Layer Routing in LLMs
Paper • 2510.12773 • Published • 31 -
Is Multilingual LLM Watermarking Truly Multilingual? A Simple Back-Translation Solution
Paper • 2510.18019 • Published • 17 -
Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers
Paper • 2506.15674 • Published • 2 -
C-SEO Bench: Does Conversational SEO Work?
Paper • 2506.11097 • Published • 2
-
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 6 -
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 13 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69