Edit Models filters

Apps

Apps with no match

Inference Providers

Inference Providers with no match

HF Inference API

Misc

Inference Endpoints

text-generation-inference

8-bit precision

Mixture of Experts

Misc with no match

4-bit precision

text-embeddings-inference

Carbon Emissions

Models

582

Full-text search

Active filters: fp8

nm-testing/opt-125m-fp8-dynamic

Text Generation • Updated Apr 27, 2024 • 16

anyisalin/Meta-Llama-3-8B-Instruct-FP8

Text Generation • Updated May 6, 2024 • 13

anyisalin/Meta-Llama-3-8B-Instruct-FP8-D

Text Generation • Updated Apr 28, 2024 • 15

anyisalin/lzlv_70b_fp16_hf-FP8-D

Text Generation • Updated Apr 28, 2024 • 12

anyisalin/Meta-Llama-3-70B-Instruct-FP8-D

Text Generation • Updated Apr 28, 2024 • 17

anyisalin/Mixtral-8x7B-Instruct-v0.1-FP8-D

Text Generation • Updated Apr 28, 2024 • 19

nm-testing/llama-3-instruct-fp8-static-shared-scales

Text Generation • Updated Apr 28, 2024 • 49

nm-testing/llama-3-instruct-fp8-dynamic-shared-scales

Text Generation • Updated Apr 28, 2024 • 14

pcmoritz/Mixtral-8x7B-v0.1-fp8-act-scale

Text Generation • Updated May 2, 2024 • 82

anyisalin/Meta-Llama-3-70B-Instruct-FP8

Text Generation • Updated May 8, 2024 • 27

RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV

Text Generation • Updated Jun 19, 2024 • 5.32k • 8

comaniac/Meta-Llama-3-8B-Instruct-FP8-v1

Text Generation • Updated May 24, 2024 • 13

comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v1

Text Generation • Updated May 28, 2024 • 21

RedHatAI/Meta-Llama-3-70B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 55.7k • 13

comaniac/Meta-Llama-3-70B-Instruct-FP8-v1

Text Generation • Updated May 26, 2024 • 11

comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v1

Text Generation • Updated May 26, 2024 • 17

comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v2

Text Generation • Updated Jun 10, 2024 • 17

Skywork/Skywork-MoE-Base-FP8

Text Generation • Updated Jul 31, 2024 • 18 • 6

RedHatAI/Qwen2-72B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 2.41k • 15

comaniac/Meta-Llama-3-70B-Instruct-FP8-v2

Text Generation • Updated Jun 10, 2024 • 17

comaniac/Mixtral-8x7B-Instruct-v0.1-FP8-v3

Text Generation • Updated Jun 10, 2024 • 30

comaniac/Mixtral-8x22B-Instruct-v0.1-FP8-v2

Text Generation • Updated Jun 10, 2024 • 14

RedHatAI/Mixtral-8x22B-Instruct-v0.1-AutoFP8

Text Generation • Updated Aug 12, 2024 • 20 • 3

nm-testing/granite-20b-code-base-FP8

Text Generation • Updated Jun 12, 2024 • 17

nm-testing/granite-3b-code-base-FP8

Text Generation • Updated Jun 12, 2024 • 17

fr00000/dolp-fp8

Text Generation • Updated Jun 13, 2024 • 12

RedHatAI/Qwen2-0.5B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 2.18k • 3

nm-testing/opt-125m-fp8-static-kv

Text Generation • Updated Jun 14, 2024 • 14

RedHatAI/Qwen2-1.5B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 4.7k •

RedHatAI/Qwen2-7B-Instruct-FP8

Text Generation • Updated Jul 18, 2024 • 17.1k • 2