Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alexey Gorbatovski's picture
3 7

Alexey Gorbatovski

Myashka
kefirski's profile picture elephantmipt's profile picture SmartFlow's profile picture
·
  • Myashka

AI & ML interests

NLP Alignment

Recent Activity

commented on a paper 14 days ago
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
new activity 26 days ago
agentica-org/DeepScaleR-Preview-Dataset:There are no answers for 6 samples
updated a model 2 months ago
Myashka/Qwen2.5-7B-UltraChat200K_EMA_SFT-Lr_3e_6-Alpha_0.01
View all activity

Organizations

None yet

authored a paper 9 months ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 113
authored 3 papers over 1 year ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 89

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

Bayesian Networks for Named Entity Prediction in Programming Community Question Answering

Paper • 2302.13253 • Published Feb 26, 2023
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs