-
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 110 -
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 2.99M • 3.1k -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 13.3k • 1.5k -
nanonets/Nanonets-OCR2-3B
Image-Text-to-Text • 4B • Updated • 85k • 482
www.minds.com/jelyazko/
21world
AI & ML interests
Who not work will not Eat
Recent Activity
updated
a collection
1 day ago
35\ Speech -> TEXT /STT/
updated
a collection
1 day ago
57\ Picture Editors
updated
a collection
1 day ago
36\ in text -> out speech /TTS/