Jaykumaran R's picture

Jaykumaran R

Jaykumaran17

·

Jaykumaran

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Unified Vision-Language-Action Model

upvoted a paper about 1 month ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

liked a model about 2 months ago

patrickjohncyh/fashion-clip

View all activity

Organizations

upvoted 2 papers about 1 month ago

Unified Vision-Language-Action Model

Paper • 2506.19850 • Published Jun 24 • 27

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

upvoted a collection about 2 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 15 days ago • 522

upvoted 2 articles about 2 months ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

By

and 4 others •

Jun 11

• 74

Article

Introducing Training Cluster as a Service - a new collaboration with NVIDIA

By

and 2 others •

Jun 11

• 24

upvoted a collection about 2 months ago

Vision Language Models Papers 🖼️💬📝

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 38

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 122

upvoted a collection 2 months ago

SmolVLA

Small, efficient and light-weight VLAs pretrained on community datasets • 1 item • Updated Jun 1 • 27

upvoted a paper 2 months ago

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Paper • 2401.02117 • Published Jan 4, 2024 • 34

upvoted 2 articles 2 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

Jun 3

• 216

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 495

upvoted a collection 2 months ago

NVILA

10 items • Updated May 20 • 16

upvoted an article 3 months ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 403

upvoted an article 4 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 41

upvoted a collection 4 months ago

Multimodal DSE Retrievers

A collection of DSE models for multimodal retrieval • 5 items • Updated Apr 15 • 14

upvoted 3 articles 4 months ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

By

and 4 others •

Mar 18

• 41

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

By

and 3 others •

Feb 4

• 168

Article

SmolVLM - small yet mighty Vision Language Model

By

and 4 others •

Nov 26, 2024

• 345

upvoted a paper 5 months ago

VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published Mar 14 • 26

upvoted an article 5 months ago

Article

Open-Source Handwritten Signature Detection Model

By

•

Mar 14

• 116