AI & ML interests

The AI community building the future.

Recent Activity

Articles

angt 
posted an update about 9 hours ago
jeffboudier 
posted an update about 24 hours ago
view post
Post
182
AMD summer hackathons are here!
A chance to get hands-on with MI300X GPUs and accelerate models.
🇫🇷 Paris - Station F - July 5-6
🇮🇳 Mumbai - July 12-13
🇮🇳 Bengaluru - July 19-20

Hugging Face and GPU Mode will be on site and on July 6 in Paris @ror will share lessons learned while building new kernels to accelerate Llama 3.1 405B on ROCm

Register to Paris event: https://lu.ma/fmvdjmur?tk=KeAbiP
All dates: https://lu.ma/calendar/cal-3sxhD5FdxWsMDIz
AdinaY 
posted an update about 24 hours ago
AdinaY 
posted an update about 24 hours ago
view post
Post
151
MOSS-TTSD 🔊 Bilingual text-to-spoken dialogue model by Fudan University - Open MOSS team.

Model:
fnlp/MOSS-TTSD-v0
Demo:
fnlp/MOSS-TTSD

✨ Supports Chinese & English
✨ Zero-shot 2-speaker voice cloning
✨ Long-form generation (up to 960s)
✨ Built on Qwen 3
pagezyhf 
posted an update 1 day ago
view post
Post
1609
Hackathons in Paris on July 5th and 6th!

Hugging Face just wrapped 4 months of deep work with AMD to push kernel-level optimization on their MI300X GPUs. Now, it's time to share everything we learned.

Join us in Paris at STATION F for a hands-on weekend of workshops and a hackathon focused on making open-source LLMs faster and more efficient on AMD.

Prizes, amazing host speakers, ... if you want more details, navigate to https://lu.ma/fmvdjmur!
  • 2 replies
·
sergiopaniego 
posted an update 2 days ago
AdinaY 
posted an update 2 days ago
view post
Post
196
Skywork-SWE 🔥 New code agent model by Skywork 天工

Skywork/Skywork-SWE-32B

✨ 32B - Apache 2.0
✨ 38.0% pass@1 on SWE-bench Verified
✨ Up to 47.0% with test-time scaling
✨ Shows clear data scaling law (8K+ demos)
✨ Built on Qwen2.5-Coder-32B + OpenHands
giadap 
posted an update 5 days ago
view post
Post
1809
🗣️ Whose voice do we hear when AI speaks?

Every language carries its own cultural values and worldviews. So, when we build AI systems, we're not just deciding how they speak but also whose perspectives they represent.

Even choosing which dialect to train on in Norway becomes a question of inclusion and power. In Kenya, will AI speak Swahili from Nairobi or coastal regions? What about indigenous languages with rich oral traditions but limited written text, like Quechua in Peru or Cherokee in North America?

The path forward? Building WITH communities, not just FOR them. Working with local partners (libraries, universities, civil society), testing for cultural alignment, and asking hard questions about representation.

Just published some thoughts on this after my keynote in Norway a few weeks ago: https://huggingface.co/blog/giadap/when-ai-speaks
  • 1 reply
·
clem 
posted an update 6 days ago
multimodalart 
posted an update 6 days ago
view post
Post
3290
Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it 🐐

I've built a live real time demo on Spaces 📹💨

multimodalart/self-forcing
  • 2 replies
·
AdinaY 
posted an update 8 days ago
pagezyhf 
posted an update 8 days ago
AdinaY 
posted an update 9 days ago
view post
Post
4003
Kimi-Dev 💻 New coding model by Moonshot AI

moonshotai/Kimi-Dev-72B

✨ 72B - MIT license
✨ 60.4% on SWE-bench Verified
✨ RL-trained to patch real repos in Docker
✨ Only rewarded if full test suite passes
AdinaY 
posted an update 9 days ago
view post
Post
633
MiniMax-M1 🔥 The First reasoning model by MiniMax.

MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094

✨ 40k/80k thinking budget
✨ Powered by Hybrid MoE + Lightning Attention 👀
✨ 1M context length 🤯
✨ Apache 2.0
✨ RL-trained for math, coding & real-world software
AdinaY 
posted an update 9 days ago
view post
Post
425
Hunyuan 3D 2.1 🔥 Industrial-grade 3D model just released by Tencent Hunyuan

tencent/Hunyuan3D-2.1
tencent/Hunyuan3D-2.1

✨ PBR materials: leather, bronze & more, breathtaking realism under any light
✨ Consumer GPU-ready: good for developers and small teams

a-r-r-o-w 
posted an update 12 days ago