trl-sandbox / docs /source /rewards.md
ivangabriele's picture
feat: initialize project
2f5127c verified

Reward Functions

This module contains some useful reward functions, primarily intended for use with the [GRPOTrainer].

Format rewards

think_format_reward

[[autodoc]] rewards.think_format_reward