merve
Β·
AI & ML interests
I love this website
VLMs, vision & co
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware
view article
Enhance Your Models in 5 Minutes with the Hugging Face Kernel Hub
By
and 6 others
β’
β’
100
view article
SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data
published
an
article
about 1 month ago
view article
nanoVLM: The simplest repository to train your VLM in pure PyTorch
published
an
article
about 1 month ago
view article
Vision Language Models (Better, Faster, Stronger)
By
and 4 others
β’
β’
458
published
an
article
about 2 months ago
view article
Welcoming Llama Guard 4 on Hugging Face Hub
By
and 3 others
β’
β’
38
view article
Cohere on Hugging Face Inference Providers π₯
view article
Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM
view article
SigLIP 2: A better multilingual vision language encoder
view article
SmolVLM2: Bringing Video Understanding to Every Device
view article
Open-source DeepResearch β Freeing our search agents
By
and 4 others
β’
β’
1.26k
view article
We now support VLMs in smolagents!
By
and 2 others
β’
β’
104
view article
SmolVLM Grows Smaller β Introducing the 250M & 500M Models!
view article
Introducing smolagents: simple agents that write actions in code.
By
and 2 others
β’
β’
1.07k
view article
Welcome PaliGemma 2 β New vision language models by Google
By
and 3 others
β’
β’
155
view article
SmolVLM - small yet mighty Vision Language Model
view article
Llama can now see and run on your device - welcome Llama 3.2
By
and 6 others
β’
β’
189
view article
Preference Optimization for Vision Language Models
published
an
article
about 1 year ago
view article
Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
published
an
article
about 1 year ago
view article
PaliGemma β Google's Cutting-Edge Open Vision Language Model
By
and 2 others
β’
β’
253