Alexey Gorbatovski's picture

3 7

Alexey Gorbatovski

Myashka

·

Myashka

AI & ML interests

NLP Alignment

Recent Activity

commented on a paper 14 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

new activity 26 days ago

agentica-org/DeepScaleR-Preview-Dataset:There are no answers for 6 samples

updated a model 2 months ago

Myashka/Qwen2.5-7B-UltraChat200K_EMA_SFT-Lr_3e_6-Alpha_0.01

View all activity

Organizations

None yet

authored a paper 9 months ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 113

authored 3 papers over 1 year ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 89

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

Bayesian Networks for Named Entity Prediction in Programming Community Question Answering

Paper • 2302.13253 • Published Feb 26, 2023