Asankhaya Sharma PRO

codelion

http://asankhaya.github.io/

AI & ML interests

Creator of OptiLLM, OpenEvolve, Adaptive Classifier, and PTS. Pioneering a new category in AI infrastructure: inference-time compute for LLMs.

Recent Activity

liked a model 1 day ago

MemChainAI/adaptive-sentiment-classifier

new activity 1 day ago

google/frames-benchmark:Official evaluation code?

updated a model 2 days ago

codelion/gemma-3-1b-it-icm-sft-mlx-fp16

View all activity

Organizations

upvoted an article 6 days ago

Article

Adaptive Classifier: Dynamic Text Classification with Continuous Learning

•

6 days ago

• 11

upvoted a paper 6 days ago

ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs

Paper • 2506.15211 • Published 8 days ago • 31

upvoted a paper 8 days ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published 9 days ago • 35

upvoted a paper 14 days ago

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 76

upvoted a paper 15 days ago

Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques

Paper • 2506.08060 • Published 17 days ago • 7

upvoted 2 papers 16 days ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published 17 days ago • 228

Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

Paper • 2505.14135 • Published May 20 • 15

upvoted 2 papers 21 days ago

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Paper • 2406.10149 • Published Jun 14, 2024 • 53

ATLAS: Learning to Optimally Memorize the Context at Test Time

Paper • 2505.23735 • Published 28 days ago • 23

upvoted a paper 22 days ago

Thinker: Learning to Think Fast and Slow

Paper • 2505.21097 • Published 30 days ago • 11

upvoted 2 papers 23 days ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 24 days ago • 161

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published 27 days ago • 93

upvoted an article 24 days ago

Article

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

•

24 days ago

• 13

upvoted a paper 29 days ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 78

upvoted an article 30 days ago

Article

AutoThink: Adaptive Reasoning for Large Language Models

•

30 days ago

• 4

upvoted a paper about 1 month ago

Chain-of-Model Learning for Language Model

Paper • 2505.11820 • Published May 17 • 119

upvoted 2 articles about 1 month ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

•

May 20

• 26

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

•

May 17

• 5

upvoted a paper about 1 month ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 119

upvoted a collection about 1 month ago

Pivotal Token Search

Collection

Pivotal Token Search (PTS) identifies tokens in a language model's generation that significantly impact the probability of success • 9 items • Updated May 14 • 3

Asankhaya Sharma PRO

AI & ML interests

Recent Activity

Organizations

codelion's activity

Adaptive Classifier: Dynamic Text Classification with Continuous Learning

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

AutoThink: Adaptive Reasoning for Large Language Models

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training