Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Lokendra Bairwa's picture

4 5

Lokendra Bairwa

lokendra77

Suhasdev's profile picture

·

AI & ML interests

None yet

Organizations

Collections 2

Reinforcement Learning

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 232

Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward

Paper • 2508.12800 • Published Aug 18, 2025 • 6

Reinforcement Learning

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 232

Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward

Paper • 2508.12800 • Published Aug 18, 2025 • 6

models 1

lokendra77/TinyClick-mlx

0.3B • Updated May 1, 2025

datasets 0

None public yet

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs