UnstableLlama
UnstableLlama
AI & ML interests
Local AI
Recent Activity
liked
a model
2 days ago
ArtusDev/aquif-ai_aquif-3.5-Max-42B-A3B-EXL3
liked
a model
3 days ago
zerofata/GLM-4.5-Iceblink-v2-106B-A12B
reacted
to
sergiopaniego's
post
with ๐ฅ
3 days ago
fine-tuning a 14B model with TRL + SFT on a free Colab (T4 GPU)?
thanks to the latest TRL optimizations, you actually can!
sharing a new notebook showing how to do it ๐
colab: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_trl_lora_qlora.ipynb
notebooks in TRL: https://github.com/huggingface/trl/tree/main/examples/notebooks
Organizations
None yet