This is a MXFP4_MOE quantization of the model LLaDA-MoE-7B-A1B-Instruct-TD: A specialized instruction-tuned model, further optimized for accelerated inference using Trajectory Distillation.

Also created a quant with an imatrix from mradermacher.

Original model: https://huggingface.co/inclusionAI/LLaDA-MoE-7B-A1B-Instruct-TD

Downloads last month: 492

GGUF

Model size

7B params

Architecture

llada-moe

Hardware compatibility

4-bit

Model tree for noctrex/LLaDA-MoE-7B-A1B-Instruct-TD-MXFP4_MOE-GGUF

Base model

inclusionAI/LLaDA-MoE-7B-A1B-Instruct-TD

Quantized

(3)

this model