Is there a plan for an FP8 version or a GGUF version that can be used in ComfyUI?

by pymo - opened 23 days ago

pymo

23 days ago

Thank you to the author for the open-source spirit, respect. May I ask the author: Is there a plan for an FP8 version or a GGUF version that can be used in ComfyUI?

huihui-ai

Owner 17 days ago

https://huggingface.co/huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated/tree/main/GGUF

MikaSouthworth

16 days ago

I asked AIs this and get different answers... in your experience, is f32 of 4b better than q8_0 of 8b? similarly, q4_K_M of the 32b vs 8b in f32? it is maybe stupid to ask, but I can't get a straight answer out of anyone...

huihui-ai

Owner 16 days ago

If long context is the focus, choose the large model's Q4; if speed is prioritized, choose the small model's f32.

MikaSouthworth

16 days ago

thanks!! that is actually helpful

xuanwoa

15 days ago

Is there a plan for an FP8 version

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment