Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
XueZhang-bjtu 's Collections
M-Thinker-Data
M-Thinker

M-Thinker

updated Oct 14

Models of Paper "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning" (https://arxiv.org/pdf/2510.07300)

Upvote
-

  • XueZhang-bjtu/M-Thinker-7B-Iter2

    Text Generation • 8B • Updated Oct 14 • 4

  • XueZhang-bjtu/M-Thinker-1.5B-Iter2

    Text Generation • 2B • Updated Oct 14 • 8

  • XueZhang-bjtu/M-Thinker-7B-Iter1

    Text Generation • 8B • Updated Oct 14 • 106

  • XueZhang-bjtu/M-Thinker-1.5B-Iter1

    Text Generation • 2B • Updated Oct 14 • 7

  • XueZhang-bjtu/7B-cold-start-SFT

    Text Generation • 8B • Updated Oct 14 • 7

  • XueZhang-bjtu/1.5B-cold-start-SFT

    Text Generation • 2B • Updated Oct 14 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs