krishna praveen's picture

krishna praveen

krishnapraveen

·

AI & ML interests

None yet

Recent Activity

liked a model 10 days ago

Qwen/Qwen3-235B-A22B-Instruct-2507

liked a model 2 months ago

Tongyi-Zhiwen/QwenLong-L1-32B

liked a model 2 months ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

None yet

upvoted a paper 2 months ago

VACE: All-in-One Video Creation and Editing

Paper • 2503.07598 • Published Mar 10 • 54

upvoted a collection 2 months ago

VACE

VACE: All-in-One Video Creation and Editing • 7 items • Updated May 15 • 31

upvoted 2 papers 3 months ago

Agent S: An Open Agentic Framework that Uses Computers Like a Human

Paper • 2410.08164 • Published Oct 10, 2024 • 25

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 107

upvoted a collection 4 months ago

Llama 4

Llama 4 release • 13 items • Updated Apr 29 • 597

upvoted a paper 6 months ago

MatAnyone: Stable Video Matting with Consistent Memory Propagation

Paper • 2501.14677 • Published Jan 24 • 35

upvoted 2 collections 6 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 15 days ago • 522

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated 15 days ago • 120

upvoted a paper 7 months ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 62

upvoted 2 collections 7 months ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 7 days ago • 254

DeepSeek-R1

10 items • Updated May 29 • 772

upvoted a paper 7 months ago

Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions

Paper • 2501.10020 • Published Jan 17 • 24

upvoted a collection 7 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated 15 days ago • 293

upvoted a paper 7 months ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 105

upvoted a collection 9 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 81

upvoted a paper 9 months ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30, 2024 • 51

upvoted a collection 10 months ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 15 days ago • 51

upvoted a paper 10 months ago

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 88

upvoted a collection 11 months ago

CogVideo

10 items • Updated Jun 30 • 56

upvoted a paper 12 months ago

MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model

Paper • 2408.10198 • Published Aug 19, 2024 • 36