Qwen3-VLTO-32B-Instruct-NVFP4-256K / chat_template.jinja

Commit History

Upload NVFP4-quantized Qwen3-VLTO-32B with 256K context (YaRN RoPE scaling)

f64de72
verified

Ex0bit commited on 16 days ago