Running 2.96k 2.96k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
MaziyarPanahi/MixTAO-7Bx2-MoE-Instruct-v7.0-GGUF Text Generation • 13B • Updated Feb 4, 2024 • 61 • 9