wongyukim's picture

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

upvoted a paper 3 days ago

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

upvoted a paper 3 days ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

View all activity

Organizations

None yet

upvoted 3 papers 3 days ago

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Paper • 2507.21509 • Published 8 days ago • 23

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

Paper • 2507.23682 • Published 6 days ago • 21

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published 5 days ago • 96

upvoted 4 papers 5 days ago

MetaCLIP 2: A Worldwide Scaling Recipe

Paper • 2507.22062 • Published 7 days ago • 22

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published 7 days ago • 59

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published 18 days ago • 22

X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again

Paper • 2507.22058 • Published 7 days ago • 36

upvoted 6 papers 7 days ago

Diversity-Enhanced Reasoning for Subjective Questions

Paper • 2507.20187 • Published 10 days ago • 22

Region-based Cluster Discrimination for Visual Representation Learning

Paper • 2507.20025 • Published 10 days ago • 17

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Paper • 2507.21049 • Published 8 days ago • 38

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Paper • 2507.21033 • Published 8 days ago • 20

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published 8 days ago • 51

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 11 days ago • 127

upvoted 4 papers 8 days ago

Specification Self-Correction: Mitigating In-Context Reward Hacking Through Test-Time Refinement

Paper • 2507.18742 • Published 12 days ago • 5

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published 11 days ago • 20

CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Paper • 2507.18392 • Published 13 days ago • 17

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published 15 days ago • 53

upvoted 2 papers 11 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 13 days ago • 267

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published 19 days ago • 118

upvoted a paper 12 days ago

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published 15 days ago • 65