Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ Qwen3 is the latest generation of large language models in Qwen series, offering
|
|
| 27 |
**Qwen3-8B** has the following features:
|
| 28 |
- Type: Causal Language Models
|
| 29 |
- Training Stage: Pretraining & Post-training
|
| 30 |
-
- Number of Parameters: 8.
|
| 31 |
- Number of Paramaters (Non-Embedding): 6.95B
|
| 32 |
- Number of Layers: 36
|
| 33 |
- Number of Attention Heads (GQA): 32 for Q and 8 for KV
|
|
|
|
| 27 |
**Qwen3-8B** has the following features:
|
| 28 |
- Type: Causal Language Models
|
| 29 |
- Training Stage: Pretraining & Post-training
|
| 30 |
+
- Number of Parameters: 8.3B
|
| 31 |
- Number of Paramaters (Non-Embedding): 6.95B
|
| 32 |
- Number of Layers: 36
|
| 33 |
- Number of Attention Heads (GQA): 32 for Q and 8 for KV
|