Add files using upload-large-folder tool
Browse files
README.md
CHANGED
|
@@ -21,6 +21,8 @@ This model was quantized to 3-bit using DWQ with mlx-lm version **0.28.4**.
|
|
| 21 |
| Relative KL reduction | ≈40 % |
|
| 22 |
| Tokens processed | ≈1.09 M |
|
| 23 |
|
|
|
|
|
|
|
| 24 |
## Use with mlx
|
| 25 |
|
| 26 |
```bash
|
|
|
|
| 21 |
| Relative KL reduction | ≈40 % |
|
| 22 |
| Tokens processed | ≈1.09 M |
|
| 23 |
|
| 24 |
+
<img src="minimax_3e-7.png" width="600" alt="Training loss curve">
|
| 25 |
+
|
| 26 |
## Use with mlx
|
| 27 |
|
| 28 |
```bash
|