AmirItachi commited on
Commit
2725246
·
verified ·
1 Parent(s): b968826

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -27,7 +27,7 @@ Qwen3 is the latest generation of large language models in Qwen series, offering
27
  **Qwen3-8B** has the following features:
28
  - Type: Causal Language Models
29
  - Training Stage: Pretraining & Post-training
30
- - Number of Parameters: 8.2B
31
  - Number of Paramaters (Non-Embedding): 6.95B
32
  - Number of Layers: 36
33
  - Number of Attention Heads (GQA): 32 for Q and 8 for KV
 
27
  **Qwen3-8B** has the following features:
28
  - Type: Causal Language Models
29
  - Training Stage: Pretraining & Post-training
30
+ - Number of Parameters: 8.3B
31
  - Number of Paramaters (Non-Embedding): 6.95B
32
  - Number of Layers: 36
33
  - Number of Attention Heads (GQA): 32 for Q and 8 for KV