2 1 12

Arvind Rajasekaran

arvindcr4

AI & ML interests

None yet

Recent Activity

liked a Space 19 days ago

AdithyaSK/rl-environments-guide

updated a Space about 1 month ago

arvindcr4/tinkerrl-bench-demo

published a Space about 1 month ago

arvindcr4/tinkerrl-bench-demo

View all activity

Organizations

None yet

liked a Space 19 days ago

The ultimate guide to RL environments: building and scaling them in the LLM era

📝

174

Building and scaling RL environments for LLM training

updated a Space about 1 month ago

TinkerRL-Bench Demo

🧪

TinkerRL-Bench reproducibility study (Group 6 viva)

published a Space about 1 month ago

TinkerRL-Bench Demo

🧪

TinkerRL-Bench reproducibility study (Group 6 viva)

updated a model about 1 month ago

arvindcr4/tinker-rl-w1_deepseek-v31-base-deepseek-v3.1-base-s42

Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-w1_deepseek-v31-base-deepseek-v3.1-base-s42

Updated Apr 19

updated a model about 1 month ago

arvindcr4/tinker-rl-frontier_gsm8k_nemotron-120b-nemotron-120b

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-frontier_gsm8k_nemotron-120b-nemotron-120b

Reinforcement Learning • Updated Apr 19

updated a model about 1 month ago

arvindcr4/tinker-rl-frontier_gsm8k_deepseek-v3.1-deepseek-v3.1

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-frontier_gsm8k_deepseek-v3.1-deepseek-v3.1

Reinforcement Learning • Updated Apr 19

updated 2 models about 1 month ago

arvindcr4/tinker-rl-arch_gsm8k_kimi-k2-kimi-k2

Reinforcement Learning • Updated Apr 19

arvindcr4/tinker-rl-w2_qwen3-8b_g4-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g4-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

updated a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g32-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g32-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

updated a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g2-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g2-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

updated a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g16-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-w2_qwen3-8b_g16-qwen3-8b-s42

Reinforcement Learning • Updated Apr 19

updated a model about 1 month ago

arvindcr4/tinker-rl-w1_qwen3-8b-base-qwen3-8b-base-s42-run1

Reinforcement Learning • Updated Apr 19

published a model about 1 month ago

arvindcr4/tinker-rl-w1_qwen3-8b-base-qwen3-8b-base-s42-run1

Reinforcement Learning • Updated Apr 19

Arvind Rajasekaran

AI & ML interests

Recent Activity

Organizations

arvindcr4's activity

The ultimate guide to RL environments: building and scaling them in the LLM era

TinkerRL-Bench Demo

TinkerRL-Bench Demo