Qwen3-VLTO-32B-Instruct-NVFP4-256K / chat_template.jinja

Commit History

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling)
f64de72
verified

Ex0bit commited on