ivangabriele's picture
feat: initialize project
2f5127c verified

Notebooks

This directory contains a collection of Jupyter notebooks that demonstrate how to use the TRL library in different applications.

  • best_of_n.ipynb: This notebook demonstrates how to use the "Best of N" sampling strategy using TRL when fine-tuning your model with PPO.
  • gpt2-sentiment.ipynb: This notebook demonstrates how to reproduce the GPT2 imdb sentiment tuning example on a jupyter notebook.
  • gpt2-sentiment-control.ipynb: This notebook demonstrates how to reproduce the GPT2 sentiment control example on a jupyter notebook.