Attention Is (not) All You Need for Commonsense Reasoning Paper • 1905.13497 • Published May 31, 2019 • 1
Mixture-of-experts VAEs can disregard variation in surjective multimodal data Paper • 2204.05229 • Published Apr 11, 2022 • 1
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky Paper • 2507.03336 • Published Jul 4 • 5