view post Post 1448 🚨 Implement KV Cache from scratch in pure PyTorch. 🚨We have documented all of our learning while implementing KV Cache to nanoVLM. Joint work with @kashif @lusxvr @andito @pcuenq Blog: hf.co/blog/kv-cache See translation 1 reply · 👍 2 2 + Reply
BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation Paper • 2504.02812 • Published Apr 3 • 5
BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation Paper • 2504.02812 • Published Apr 3 • 5
ShieldGemma 2: Robust and Tractable Image Content Moderation Paper • 2504.01081 • Published Apr 1 • 3
The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models via Visual Information Steering Paper • 2502.03628 • Published Feb 5 • 12
view post Post 2831 Tried my hand at simplifying the derivations of Direct Preference Optimization.I cover how one can reformulate RLHF into DPO. The idea of implicit reward modeling is chef's kiss.Blog: https://huggingface.co/blog/ariG23498/rlhf-to-dpo See translation 👍 4 4 + Reply
view post Post 2025 Timm ❤️ TransformersWtih the latest version of transformers you can now use any timm model with the familiar transformers API.Blog Post: https://huggingface.co/blog/timm-transformersRepository with examples: https://github.com/ariG23498/timm-wrapper-examplesCollection: ariG23498/timmwrapper-6777b85f1e8d085d3f1374a1 See translation 🚀 10 10 + Reply
view post Post 1451 We are blessed with another iteration of Pali Gemma. Google launches PaliGemma 2. google/paligemma-2-release-67500e1e1dbfdd4dee27ba48 merve/paligemma2-vqav2 See translation 🤗 3 3 + Reply
view post Post 2973 Qwen/qwen25-66e81a666513e518adb90d9e Qwen/Qwen2.5-Coder-Artifacts Qwen/Qwen2.5-Coder-demo 🚀 7 7 😎 4 4 👍 2 2 + Reply
view post Post 1609 Cohere drops two new multilingual models!https://huggingface.co/CohereForAI/aya-expanse-8bhttps://huggingface.co/CohereForAI/aya-expanse-32bTry them out herehttps://huggingface.co/spaces/CohereForAI/aya_expanse 👍 6 6 👀 2 2 + Reply
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding Paper • 2303.16341 • Published Mar 28, 2023