Analysing the RLHF pipeline
Russel
rshwndsz
·
AI & ML interests
Data Efficient Learning, Open-endedness, Alignment, AI Safety, Mechanical Interpretability
Recent Activity
updated
a collection
3 days ago
Janus
updated
a model
3 days ago
rshwndsz/gemma-3-4b-pt-SFT-DPO-si-v2
published
a model
3 days ago
rshwndsz/gemma-3-4b-pt-SFT-DPO-si-v2
Organizations
None yet