Prithiv Sakthi's picture

Prithiv Sakthi

prithivMLmods

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality @strangerzonehf @strangerguardhf

Recent Activity

updated a dataset about 1 hour ago

prithivMLmods/OpenDoc-Pdf-Preview

updated a Space about 2 hours ago

prithivMLmods/Multimodal-OCR2

published a dataset about 16 hours ago

prithivMLmods/OpenDoc-Pdf-Preview

View all activity

Organizations

upvoted an article about 16 hours ago

Article

📄 PDF Support in the Hugging Face Dataset Viewer

By

•

about 17 hours ago

• 2

upvoted a paper 1 day ago

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published 1 day ago • 47

upvoted a collection 2 days ago

Doc VL

drex [doc ], virex [ video (image ++) exp ] • 3 items • Updated about 18 hours ago • 2

upvoted an article 3 days ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

By

and 1 other •

5 days ago

• 50

upvoted 2 collections 3 days ago

June 20 Releases

22 items • Updated 3 days ago • 6

VisionScope OCR Experimentals

Based on Qwen2.5 VL, Qwen2 VL • 5 items • Updated 3 days ago • 1

upvoted 2 papers 6 days ago

Improved Iterative Refinement for Chart-to-Code Generation via Structured Instruction

Paper • 2506.14837 • Published 11 days ago • 10

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published 8 days ago • 41

upvoted a paper 7 days ago

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published 11 days ago • 57

upvoted 3 papers 8 days ago

EfficientVLA: Training-Free Acceleration and Compression for Vision-Language-Action Models

Paper • 2506.10100 • Published 15 days ago • 10

Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team

Paper • 2506.14234 • Published 9 days ago • 38

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published 9 days ago • 43

upvoted an article 9 days ago

Article

Testing VisionOCR-3B-061125 and Qwen2-VL-OCR-2B-Instruct for precise recognition of [messy] handwriting.

By

•

9 days ago

• 3

upvoted a changelog 9 days ago

Changelog

New Model Filtering Options on the Hub

10 days ago

• 52

upvoted a collection 14 days ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated 13 days ago • 128

upvoted 3 changelogs 16 days ago

Changelog

Connect Your MCP Client to the Hugging Face Hub

20 days ago

• 90

Changelog

Static Spaces can now have a build step

May 23

• 105

Changelog

New Inference Providers Dashboard

21 days ago

• 52

upvoted an article 19 days ago

Article

What if Your AI Conversations Become Public?

By

•

20 days ago

• 11

upvoted a collection 21 days ago

GCIRS Reasoning Qwen

RL~Reward Signal • 2 items • Updated 22 days ago • 1