Skywork/Skywork-Reward-V2-Llama-3.1-8B Text Classification • 8B • Updated about 1 month ago • 8.21k • 15
deepcogito/cogito-v2-preview-deepseek-671B-MoE Text Generation • 671B • Updated 6 days ago • 192 • 26
view article Article StackLLaMA: A hands-on guide to train LLaMA with RLHF By edbeeching and 6 others • Apr 5, 2023 • 42