Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ulab-ai
's Collections
PersonalizedRouter
Multi-Agent Evolve
ResearchArcade
ResearchTown
Sotopia-RL
FusionFactory
IRanker
Router-R1
Time-R1
Sotopia-RL
updated
Aug 20
Sotopia-RL: Reward Design for Social Intelligence
Upvote
-
ulab-ai/sotopia-rl-qwen-2.5-7B-grpo
Text Generation
•
Updated
Aug 24
•
7
ulab-ai/sotopia-rl-reward-annotation
Viewer
•
Updated
Aug 7
•
7.57k
•
62
•
1
ulab-ai/sotopia-rl-qwen2.5-7B-rm
Feature Extraction
•
Updated
Aug 7
•
1
ulab-ai/sotopia-rl-qwen2.5-7b-sft
Updated
Aug 20
Upvote
-
Share collection
View history
Collection guide
Browse collections