MiniMaxAI
/

MiniMax-M2

Text Generation

Model card Files Files and versions

GGUF support

#17

by geboh67859 - opened 6 days ago

6 days ago

GGUF format will make your great work accessible to more users!

4 days ago

the mainline llama.cpp PR is here: https://github.com/ggml-org/llama.cpp/pull/16831

3 days ago

I got @DevQuasar Q8_0 quant working with the above PR with command to run here on the GGUF repos discussions: https://huggingface.co/DevQuasar/MiniMaxAI.MiniMax-M2-GGUF/discussions/1

Seems to be working okay, though be mindful of the unique interleaved thinking tag chat thread.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment