Qwen
/

Qwen3-Next-80B-A3B-Thinking-FP8

ValueError: Detected some but not all shards of model.layers.0.linear_attn.in_proj are quantized. All shards of fused layers to have the same precision.

#1 opened about 2 months ago by