Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
like
17
Follow
Red Hat AI
1.17k
Text Generation
Transformers
Safetensors
8 languages
llama
int8
vllm
conversational
text-generation-inference
8-bit precision
compressed-tensors
arxiv:
2210.17323
License:
llama3.1
Model card
Files
Files and versions
xet
Community
3
Train
Deploy
Use this model
main
Meta-Llama-3.1-8B-Instruct-quantized.w8a8
Ctrl+K
Ctrl+K
4 contributors
History:
23 commits
robgreenberg3
Update README.md
0fe06b1
verified
26 days ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
20.9 kB
Update README.md
26 days ago
config.json
Safe
2.15 kB
Updated compression_config to quantization_config
9 months ago
generation_config.json
Safe
184 Bytes
Upload folder using huggingface_hub
11 months ago
model-00001-of-00002.safetensors
Safe
5 GB
xet
Upload folder using huggingface_hub
11 months ago
model-00002-of-00002.safetensors
Safe
4.08 GB
xet
Upload folder using huggingface_hub
11 months ago
model.safetensors.index.json
Safe
43.5 kB
Upload folder using huggingface_hub
11 months ago
recipe.yaml
Safe
173 Bytes
Upload folder using huggingface_hub
11 months ago
special_tokens_map.json
Safe
325 Bytes
Upload folder using huggingface_hub
11 months ago
tokenizer.json
Safe
9.09 MB
Upload tokenizer.json with huggingface_hub
9 months ago
tokenizer_config.json
Safe
55.4 kB
Upload tokenizer_config.json with huggingface_hub
9 months ago