1 6 2

Junkai Zhang

JunkaiZ

AI & ML interests

None yet

Recent Activity

liked a dataset 15 days ago

MatSciBench/MatSciBench

upvoted a paper about 1 month ago

Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

updated a dataset about 2 months ago

JunkaiZ/Rubrics

View all activity

Organizations

liked a dataset 15 days ago

MatSciBench/MatSciBench

Viewer • Updated Oct 14 • 1.34k • 61 • 1

upvoted a paper about 1 month ago

Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning

Paper • 2510.05251 • Published Oct 6 • 7

updated a dataset about 2 months ago

JunkaiZ/Rubrics

Viewer • Updated Sep 29 • 11.9k • 197

upvoted a paper about 2 months ago

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

Paper • 2509.21500 • Published Sep 25 • 18

commented a paper about 2 months ago

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

Paper • 2509.21500 • Published Sep 25 • 18 •

published a dataset about 2 months ago

JunkaiZ/Rubrics

Viewer • Updated Sep 29 • 11.9k • 197

updated a model 5 months ago

JunkaiZ/property_refinement_v2

3B • Updated Jun 20 • 3

published a model 5 months ago

JunkaiZ/property_refinement_v2

3B • Updated Jun 20 • 3

upvoted a paper 5 months ago

AR-RAG: Autoregressive Retrieval Augmentation for Image Generation

Paper • 2506.06962 • Published Jun 8 • 28

updated a model 5 months ago

JunkaiZ/property_refinement

3B • Updated Jun 14 • 4

published a model 5 months ago

JunkaiZ/property_refinement

3B • Updated Jun 14 • 4

updated a model 6 months ago

JunkaiZ/refinement_new

3B • Updated Jun 4 • 1

published a model 6 months ago

JunkaiZ/refinement_new

3B • Updated Jun 4 • 1

upvoted 2 papers 8 months ago

Entropy-Based Adaptive Weighting for Self-Training

Paper • 2503.23913 • Published Mar 31 • 3

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

Paper • 2503.17352 • Published Mar 21 • 24

liked a dataset 9 months ago

AbdulrhmanEldeeb/metallurgy-qa

Viewer • Updated Nov 29, 2024 • 5.21k • 55 • 3

authored a paper 9 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 23

upvoted a paper 9 months ago

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Paper • 2502.05163 • Published Feb 7 • 23

updated 2 models 12 months ago

JunkaiZ/refinement

8B • Updated Nov 27, 2024

JunkaiZ/refinement_v2

8B • Updated Nov 21, 2024 • 3

Junkai Zhang

AI & ML interests

Recent Activity

Organizations

JunkaiZ's activity