2 12 59

Masoud Hashemi

masoudhashemi

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Deep Researcher with Test-Time Diffusion

liked a dataset 20 days ago

microsoft/rStar-Coder

upvoted an article 28 days ago

SmolLM3: smol, multilingual, long-context reasoner

View all activity

Organizations

upvoted a paper 8 days ago

Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published 15 days ago • 53

liked a dataset 20 days ago

microsoft/rStar-Coder

Viewer • Updated 17 days ago • 1.86M • 14.5k • 171

upvoted an article 28 days ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

29 days ago

• 611

upvoted a collection about 2 months ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Jul 3 • 110

liked a Space about 2 months ago

332

MiniMax M1

💬

Generate code snippets and web applications from text descriptions

liked a model 3 months ago

TIGER-Lab/general-verifier

Question Answering • 2B • Updated Apr 15 • 2.31k • 15

upvoted a collection 3 months ago

General-Reasoner

Collection

Advancing LLMs' general reasoning capabilities • 9 items • Updated Jun 25 • 5

liked 2 models 3 months ago

moonshotai/Kimi-VL-A3B-Thinking

Image-Text-to-Text • 16B • Updated Jun 27 • 97.6k • 433

a-m-team/AM-Thinking-v1

Text Generation • 33B • Updated May 14 • 1.76k • • 191

liked 2 datasets 3 months ago

Salesforce/xlam-function-calling-60k

Viewer • Updated Jan 24 • 60k • 4.4k • 493

joey00072/seeder_pico_thinking_function_calling

Viewer • Updated Apr 23 • 15 • 10 • 1

upvoted an article 3 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 36

liked a dataset 5 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8 • 3.91M • 8.33k • 550

upvoted an article 5 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

liked a Space 6 months ago

2.96k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 6 months ago

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 63

liked a Space 7 months ago

Optillm

💬

Chat with different models using various approaches

liked 2 Spaces 8 months ago

576

Scaling test-time compute

📈

Implement test-time compute scaling for math problems

106

Judge Arena

💻

Vote on AI responses to rank models

liked a Space 9 months ago

Open Persian LLM Leaderboard

🏅

Open Persian LLM Leaderboard

Masoud Hashemi

AI & ML interests

Recent Activity

Organizations

masoudhashemi's activity

SmolLM3: smol, multilingual, long-context reasoner

MiniMax M1

Selective fine-tuning of Language Models with Spectrum

Open R1: Update #3

The Ultra-Scale Playbook

The N Implementation Details of RLHF with PPO

Optillm

Scaling test-time compute

Judge Arena

Open Persian LLM Leaderboard