qingy2024
/

GRMR-V3-L1B-GGUF

Model card Files Files and versions

Quantized GGUF models for GRMR-V3-L1B

This repository contains GGUF quantized versions of qingy2024/GRMR-V3-L1B.

Available quantizations:

FP16 (full precision)
Q2_K
Q3_K_L
Q3_K_M
Q3_K_S
Q4_K_M
Q4_K_S
Q5_K_M
Q5_K_S
Q6_K
Q8_0

Original model

This is a quantized version of qingy2024/GRMR-V3-L1B.

Generated on

Wed Jun 4 17:35:11 UTC 2025

Downloads last month: 33

GGUF

Model size

1.24B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

View +1 variant

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for qingy2024/GRMR-V3-L1B-GGUF

Base model

meta-llama/Llama-3.2-1B

Finetuned

unsloth/Llama-3.2-1B

Finetuned

qingy2024/GRMR-V3-L1B

Quantized

(1)

this model

Collection including qingy2024/GRMR-V3-L1B-GGUF

GRMR V3 GGUFs

GGUF Quantized versions of the GRMR V3 Models • 6 items • Updated Jun 4 • 7