Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Edward Beeching's picture
277 8 15

Edward Beeching

edbeeching
badaoui's profile picture Capricorn35's profile picture samusenps's profile picture
·
https://edbeeching.github.io/
  • edbeeching

AI & ML interests

None yet

Organizations

Hugging Face's profile picture HuggingFaceBR4's profile picture trl internal testing's profile picture Jack of All Trades project's profile picture HuggingFaceM4's profile picture Simulation Environments Tests and Builds's profile picture TRL's profile picture BigCode's profile picture Hugging Face H4's profile picture ShapeNet's profile picture 🤗 H4 Community's profile picture Explorer of Simulate alpha's profile picture BigCode Data's profile picture Hugging Face H4 Community's profile picture Hugging Face Smol Models Research's profile picture Hugging Face Smol Cluster's profile picture Open LLM Leaderboard's profile picture H4-colab's profile picture HuggingFaceH4-colab's profile picture H4 Alignment Handbook's profile picture Godot RL Agents's profile picture Data Agents's profile picture nltpt's profile picture Reliable Agents's profile picture Hugging Face Science's profile picture HF CMU Collab's profile picture Open R1's profile picture

authored a paper 4 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 46
authored 2 papers about 1 year ago

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Paper • 2402.09844 • Published Feb 15, 2024 • 21

Godot Reinforcement Learning Agents

Paper • 2112.03636 • Published Dec 7, 2021 • 1
authored a paper over 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs