Cool way to fine tune that I wanted to share.
π
1
#9 opened 5 months ago
by
SuperbEmphasis
Model decent when running with 6 active experts
#8 opened 6 months ago
by
userzyzz
Another question: How did you train this model?
π
1
#7 opened 6 months ago
by
marcuscedricridia
This is the first Qwen3 A3B model that doesnt immediately start repeating itself
3
#2 opened 7 months ago
by
SuperbEmphasis
Feedback after some use
π
β€οΈ
3
6
#1 opened 7 months ago
by
AlecFoster