Xin Li

lixin67

WilliamLeeBravo

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

moonshotai/Kimi-VL-A3B-Thinking-2506

upvoted an article 3 days ago

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

upvoted an article 15 days ago

GRPO for GUI Grounding Done Right

View all activity

Organizations

None yet

upvoted an article 3 days ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

5 days ago

• 50

upvoted 2 articles 15 days ago

Article

GRPO for GUI Grounding Done Right

•

15 days ago

• 27

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

and 5 others •

23 days ago

• 58

upvoted 3 papers 17 days ago

Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning

Paper • 2506.04207 • Published 22 days ago • 45

SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

Paper • 2506.02096 • Published 24 days ago • 51

OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published 22 days ago • 40

upvoted a paper 29 days ago

Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL

Paper • 2505.17952 • Published May 23 • 21

upvoted 7 papers about 1 month ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 274

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

AttentionInfluence: Adopting Attention Head Influence for Weak-to-Strong Pretraining Data Selection

Paper • 2505.07293 • Published May 12 • 26

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15 • 53

upvoted a collection about 1 month ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Apr 30 • 72

upvoted 3 papers 3 months ago

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

Paper • 2503.21620 • Published Mar 27 • 62

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 48

VisualPRM: An Effective Process Reward Model for Multimodal Reasoning

Paper • 2503.10291 • Published Mar 13 • 37

upvoted an article 4 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 437

upvoted a paper 4 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 66

Xin Li

AI & ML interests

Recent Activity

Organizations

lixin67's activity

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

GRPO for GUI Grounding Done Right

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM