michaelbenayoun/granite-tiny-4kv-heads-4layers-random Text Generation • Updated 7 days ago • 1.96k
michaelbenayoun/granite-tiny-4kv-heads-4layers-random Text Generation • Updated 7 days ago • 1.96k
michaelbenayoun/llama-2-tiny-4kv-heads-4layers-random Text Generation • Updated 24 days ago • 29.3k
michaelbenayoun/llama-2-tiny-4kv-heads-16layers-random Text Generation • Updated 30 days ago • 5.5k
Running 2.72k 2.72k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters