Drop-in implementation of https://github.com/shawntan/scattermoe for efficient training of Qwen 3 MoE.