Edit Models filters

Apps

Apps with no match

Inference Providers

Inference Providers with no match

HF Inference API

Misc

Inference Endpoints

text-generation-inference

8-bit precision

Mixture of Experts

Misc with no match

4-bit precision

text-embeddings-inference

Carbon Emissions

Models

582

Full-text search

Active filters: fp8

raja-nectar/Lumimaid-70B-FP8-OAS

Text Generation • Updated Jul 12, 2024 • 13

Ksgk-fy/maria-v2-fp8-dynamic

Text Generation • Updated Jul 12, 2024

Ksgk-fy/maria-v2-fp8-static

Text Generation • Updated Jul 12, 2024 • 38

Ksgk-fy/maria_v113-fp8-dynamic

Text Generation • Updated Jul 13, 2024

Ksgk-fy/maria_v114-fp8-dynamic

Text Generation • Updated Jul 13, 2024 • 13

Ksgk-fy/maria_v115-fp8-dynamic

Text Generation • Updated Jul 14, 2024 • 14

RedHatAI/Qwen2-57B-A14B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 2.31k • 1

nm-testing/Qwen2-1.5B-Instruct-FP8-K-V

Text Generation • Updated Jul 16, 2024 • 2.48k •

nm-testing/Meta-Llama-3-8B-Instruct-FP8-K-V

Text Generation • Updated Oct 9, 2024 • 20

RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 19.7k • 7

RedHatAI/DeepSeek-Coder-V2-Lite-Base-FP8

Text Generation • Updated Jul 18, 2024 • 108

Rallio67/llama3-70b-exab-fp8

Text Generation • Updated Jul 18, 2024 • 15

mgoin/Mistral-Nemo-Instruct-2407-FP8-Dynamic

Text Generation • Updated Jul 18, 2024 • 160

mgoin/Mistral-Nemo-Instruct-2407-FP8-KV

Text Generation • Updated Jul 18, 2024 • 13

RedHatAI/Mistral-Nemo-Instruct-2407-FP8

Text Generation • Updated Jul 19, 2024 • 1.68k • 18

obamaTeo/llama-finetune-8bit-wiki-252-ver2

Text Generation • Updated Jul 18, 2024 • 17

FlorianJc/Mistral-Nemo-Instruct-2407-vllm-fp8

Text Generation • Updated Jul 31, 2024 • 65 • 8

darthhexx/Meta-Llama-3-8B-Instruct-FP8

Text Generation • Updated Jul 22, 2024 • 11

mgoin/Nemotron-4-340B-Instruct-FP8-Dynamic

Text Generation • Updated Jul 23, 2024 • 34

RedHatAI/DeepSeek-Coder-V2-Base-FP8

Text Generation • Updated Jul 22, 2024 • 42

RedHatAI/DeepSeek-Coder-V2-Instruct-FP8

Text Generation • Updated Jul 22, 2024 • 1.79k • 7

mgoin/Minitron-4B-Base-FP8

Text Generation • Updated Aug 16, 2024 • 20 • 3

mgoin/Minitron-8B-Base-FP8

Text Generation • Updated Jul 26, 2024 • 41 • 3

nm-testing/Qwen2-0.5B-Instruct-FP8-SkipQKV

Text Generation • Updated Jul 23, 2024 • 3.33k

RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 131k • 43

RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Text Generation • Updated 26 days ago • 43.1k • 4

RedHatAI/Meta-Llama-3.1-70B-Instruct-FP8-dynamic

Text Generation • Updated Oct 19, 2024 • 2.1k • 6

PrimeIntellect/Meta-Llama-3.1-8B-Instruct-FP8

Text Generation • Updated Jul 23, 2024 • 19

RedHatAI/Meta-Llama-3.1-70B-Instruct-FP8

Text Generation • Updated Mar 25 • 225k • 48

RedHatAI/Meta-Llama-3.1-405B-Instruct-FP8

Text Generation • Updated Oct 9, 2024 • 3.12k • 31