RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w4a16 Text Generation • 2B • Updated May 30 • 25.5k • 29
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8 Text Generation • 8B • Updated May 30 • 23.1k • 17
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • 71B • Updated May 30 • 971 • 14