Mohsen Dowlatshah
mones2222
AI & ML interests
None yet
Recent Activity
upvoted
an
article
5 days ago
Preference Tuning LLMs with Direct Preference Optimization Methods
upvoted
an
article
5 days ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
upvoted
an
article
5 days ago
SmolLM3: smol, multilingual, long-context reasoner
Organizations
None yet