chat_template issue
#40 opened 15 days ago
by
Alex-Chan
Add assistant mask support to Qwen3-235B-A22B
🚀
1
#38 opened 21 days ago
by
waleko

system prompt suggestion
1
#37 opened 29 days ago
by
Shuaiqi
Publish to GitHub Marketplace
#36 opened about 1 month ago
by
typed-sigterm

The Qwen3-235B-A22B model is not as effective as the Qwen3-32B model.
🔥
1
#35 opened about 1 month ago
by
czqqq
Video?
2
#34 opened about 1 month ago
by
jujutechnology
关于Qwen3-235B-A22B在文学创作中“思维链”模式对输出风格影响的观察与建议
🧠
👍
8
#33 opened about 2 months ago
by
yxcl6874
In complex reasoning tasks Qwen3 is far behind QwQ
12
#32 opened about 2 months ago
by
AdamF92

I know not related but saw This Star Wars picture on Facebook and i thought it is code so trying to say Hi to Someone named John!
#31 opened about 2 months ago
by
ebearden

Not sure Clocks keep coming up and keep getting the run around might need time stamping confirmations?
#30 opened about 2 months ago
by
ebearden

Not Sure Orginal to Persian to Binary?
#29 opened about 2 months ago
by
ebearden

Qwen3 is simply amazing.
#28 opened about 2 months ago
by
Trilogix1

Upload b891b3c3bb6a146a8e809bb72a06d101.png
#27 opened about 2 months ago
by
Jalil16

Add image visual recognition output just like qwen 2.5 vl-32b instruct
6
#26 opened about 2 months ago
by
devopsML

English to French - French to English based on Meta & HuggingFace Chat Bot
#25 opened about 2 months ago
by
ebearden

Qwen3 幻觉太高了,比 Qwen 2.5 差太多了
➕
1
9
#24 opened about 2 months ago
by
hehua2008
Upload 3 files
#23 opened about 2 months ago
by
neuroQuantu

Upload 3 files
1
#22 opened about 2 months ago
by
neuroQuantu

Model keeps talking about Cumhurbaşkanlığı Sarayı when speaking Turkish
#21 opened about 2 months ago
by
aeminkocal
Qwen3 not Using Tools in Complex Prompts Unlike QwQ-32B
8
#20 opened about 2 months ago
by
Anaudia
Thanks a lot for this release
🔥
2
#19 opened about 2 months ago
by
Volko76

Does anyone feel Qwen3 often fails to follow instructions accurately?
🚀
7
7
#18 opened about 2 months ago
by
DOFOFFICIAL

Two of the base models are missing
➕
1
1
#17 opened about 2 months ago
by
ZhangRC

Qwen is loosing broad knowledge since Qwen2.
🔥
👍
11
14
#16 opened about 2 months ago
by
phil111
GPQA perf for DSV3-Base seems wrong
➕
3
1
#15 opened about 2 months ago
by
AChen-qaq
72B-MoE
👍
4
#13 opened about 2 months ago
by
avalonsec
235B会放出来Base模型吗?
➕
7
#12 opened about 2 months ago
by
Yantao2009
看模型介绍和模型结构里面没有关于vision encoder的部分,但是在qwen的在线模型服务界面可以用这个模型去看图片,想问下视觉部分是复用了哪个vision encoder呢?
5
#11 opened about 2 months ago
by
Chloez

有用4张H20实践过的大佬吗
2
#10 opened about 2 months ago
by
Edison0902
8张80G显存的8卡A100能部署不?
10
#9 opened about 2 months ago
by
Yuxin362
User rating and reviews of Qwen3 App and Qwen3 Model
#8 opened about 2 months ago
by
DeepNLP
是不是奖励函数没有ngram重复度惩罚
2
#7 opened about 2 months ago
by
wzx111
🚀[Fine-tuning] Qwen3-MoE Megatron Training Implementation and Best Practices👋
🚀
6
1
#6 opened about 2 months ago
by
study-hjt

【Evaluation】Best practice for evaluating Qwen3 !!
🚀
👍
4
#5 opened about 2 months ago
by
wangxingjun778

Please upload the base model for this one
👍
5
#4 opened about 2 months ago
by
mesh-ops

GPTQ/AWQ
👀
12
4
#3 opened about 2 months ago
by
ndurkee
Add languages tag
#2 opened about 2 months ago
by
de-francophones
