Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RedHatAI
/
Meta-Llama-3.1-8B-Instruct-quantized.w4a16
like
29
Follow
Red Hat AI
1.17k
Text Generation
Transformers
Safetensors
8 languages
llama
int4
vllm
conversational
text-generation-inference
4-bit precision
gptq
License:
llama3.1
Model card
Files
Files and versions
xet
Community
4
Train
Deploy
Use this model
main
Meta-Llama-3.1-8B-Instruct-quantized.w4a16
Ctrl+K
Ctrl+K
5 contributors
History:
22 commits
robgreenberg3
Update README.md
3ba651e
verified
26 days ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
20.7 kB
Update README.md
26 days ago
config.json
Safe
1.26 kB
Upload folder using huggingface_hub
11 months ago
model.safetensors
Safe
5.74 GB
xet
Upload folder using huggingface_hub
11 months ago
quantize_config.json
Safe
267 Bytes
Upload folder using huggingface_hub
11 months ago
special_tokens_map.json
Safe
296 Bytes
Upload folder using huggingface_hub
11 months ago
tokenizer.json
Safe
9.09 MB
Upload tokenizer.json with huggingface_hub
9 months ago
tokenizer_config.json
Safe
55.4 kB
Upload tokenizer_config.json with huggingface_hub
9 months ago