Ex0bit/Qwen3-VLTO-32B-Instruct-NVFP4-256K at main

Qwen3-VLTO-32B-Instruct-NVFP4-256K

20.7 GB

1 contributor

History: 4 commits

Ex0bit's picture

Update README.md

a87dab3 verified 10 days ago

.gitattributes

1.57 kB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
README.md

5.43 kB

Update README.md 10 days ago
added_tokens.json

707 Bytes

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
chat_template.jinja

4.17 kB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
config.json

3.49 kB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
generation_config.json

214 Bytes

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
hf_quant_config.json

267 Bytes

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
merges.txt

1.67 MB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
model-00001-of-00005.safetensors

4.97 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
model-00002-of-00005.safetensors

4.94 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
model-00003-of-00005.safetensors

4.94 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
model-00004-of-00005.safetensors

4.26 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
model-00005-of-00005.safetensors

1.56 GB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
model.safetensors.index.json

176 kB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
quantization_metadata.json

1.06 kB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
special_tokens_map.json

613 Bytes

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
tokenizer.json

11.4 MB
xet

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
tokenizer_config.json

5.4 kB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago
vocab.json

2.78 MB

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling) 11 days ago