Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

RedHatAI
/

quantization

Model card Files Files and versions Community

Ctrl+K

Ctrl+K

2 contributors

History: 50 commits

danieldk's picture

danieldk HF Staff

Build (x86_64)

b65f8ab 25 days ago

attention
Sync to vLLM 20250627 about 1 month ago
build
Build (x86_64) 25 days ago
compressed_tensors
Sync to vLLM 20250627 about 1 month ago
core
Sync to vLLM 20250627 about 1 month ago
cutlass_extensions
Sync to vLLM 20250627 about 1 month ago
cutlass_w8a8
Sync to vLLM 20250627 about 1 month ago
fp8
Sync to vLLM 20250627 about 1 month ago
gptq_marlin
Sync to vLLM 20250627 about 1 month ago
marlin
Sync to vLLM 20250627 about 1 month ago
tests
Sync to vLLM 20250627 about 1 month ago
torch-ext
Fix absolute imports 26 days ago
.gitattributes

1.56 kB

Build 8 months ago
LICENSE

11.4 kB

Add cutlass_w8a8 8 months ago
README.md

195 Bytes

Update README.md (#1) 5 months ago
build.toml

5.96 kB

Fix undefined symbol on CUDA 11.8 25 days ago
cuda_utils.h

1.41 kB

Sync on vLLM 20240402 4 months ago
dispatch_utils.h

3.9 kB

Sync to vLLM 20250627 about 1 month ago
flake.lock

4.5 kB

Fix absolute imports 26 days ago
flake.nix

352 Bytes

Fix absolute imports 26 days ago
utils.cuh

1.84 kB

Sync on vLLM 20240402 4 months ago
vectorization.cuh

878 Bytes

Sync to vLLM 20250627 about 1 month ago
vectorization_utils.cuh

2.61 kB

Sync to vLLM 20250627 about 1 month ago