Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
merve
's Collections
June 20 Releases
OCR Models & Datasets
Releases June 13
Releases June 6
Releases 30 May
Releases 23 May
May 16 Releases
May 9 Releases
Any-to-Any Models, Datasets, Spaces
Releases Apr 21 & May 2
InternVL3 HF
April 16 Releases
Multimodal DSE Retrievers
April 11 Releases
March 28 Releases
March 21 Releases
TΓΌrkΓ§e VLMler
Feb 14 Releases π
Feb 7 Releases π§£
January 31 Releases π§€
Models, Jan 27
Jan 24 Releases
Jan 17 Releases βοΈ
Jan 10 Releases π¨οΈ
Dec 6 Releases π
Nov 29 Releases π²π²
Nov 22 Releases βοΈ
Nov 15 Releases π
Nov 1 Releases
MIT Talk 31/10 Papers
October 25 Releases
LOTUS πͺ·
New Depth Models
BRAVE Models π¦
Computer Vision Backbones π§©
Image Classification Models πΆ π±
Object Detection Models π₯₯
Image Segmentation Models π
Zero-shot Image Classification Models πΌοΈ
Image-to-Image Models π¨
Video Classification Models πΊ
Image-to-Text Models π
Text-to-Image Models π₯
Foundation Models for Vision π§©
Segment Anything Model
OWL-series π¦
SigLIP
Awesome Document AI
SegGPT
Vision Language Models Papers πΌοΈπ¬π
gvhf/owl
gv-hf/owl
merve/owl2
Depth Anything v2 Release
Document VLM Papers
Vision Language Leaderboards
Video Language Models
SAM2
NVEagle
Multimodal RAG
Zero-shot Segmentation
OCR Models & Datasets
updated
3 days ago
Upvote
-
opendatalab/OmniDocBench
Viewer
β’
Updated
Feb 11
β’
984
β’
2.26k
β’
26
nanonets/Nanonets-OCR-s
Image-Text-to-Text
β’
Updated
5 days ago
β’
177k
β’
1.16k
echo840/MonkeyOCR
Image-Text-to-Text
β’
Updated
1 day ago
β’
270
β’
456
Running
on
Zero
MCP
58
58
OCR2
π»
nanonets ocr / typhoon ocr / smoldocling / monkey ocr
Running
on
Zero
MCP
280
280
OCR
π
nanonets / qwen2vl ocr / rolmocr / aya vision
Upvote
-
Share collection
View history
Collection guide
Browse collections