Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
5
5
Louis Castricato
LouisCastricato
Follow
Aaron-Cu's profile picture
Yanjo's profile picture
21world's profile picture
6 followers
·
8 following
https://louiscastricato.com
lcastricato
LouisCastricato
AI & ML interests
Storytelling
Recent Activity
upvoted
a
paper
1 day ago
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
liked
a dataset
4 months ago
gaia-benchmark/GAIA
upvoted
a
paper
4 months ago
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
View all activity
Organizations
Articles
1
Article
286
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Papers
4
arxiv:
2501.04682
arxiv:
2407.17387
arxiv:
2402.07896
arxiv:
2311.03736
models
1
LouisCastricato/StableBeluga2_fp16
Updated
Oct 7, 2023
datasets
0
None public yet