Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ulab-ai 's Collections
PersonalizedRouter
Multi-Agent Evolve
ResearchArcade
ResearchTown
Sotopia-RL
FusionFactory
IRanker
Router-R1
Time-R1

Sotopia-RL

updated Aug 20

Sotopia-RL: Reward Design for Social Intelligence

Upvote
-

  • ulab-ai/sotopia-rl-qwen-2.5-7B-grpo

    Text Generation • Updated Aug 24 • 7

  • ulab-ai/sotopia-rl-reward-annotation

    Viewer • Updated Aug 7 • 7.57k • 62 • 1

  • ulab-ai/sotopia-rl-qwen2.5-7B-rm

    Feature Extraction • Updated Aug 7 • 1

  • ulab-ai/sotopia-rl-qwen2.5-7b-sft

    Updated Aug 20
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs