46 35 21

vansin PRO

vansin

AI & ML interests

None yet

Recent Activity

updated a Space about 14 hours ago

vansin/PaperScope

published a Space about 14 hours ago

vansin/PaperScope

updated a model 9 days ago

internlm/Intern-S1-GGUF

View all activity

Organizations

upvoted 2 changelogs 15 days ago

Changelog

New Inference Providers Dashboard

Jun 5

• 61

Changelog

Inference Providers now fully support OpenAI-compatible API

19 days ago

• 73

upvoted a paper 28 days ago

Coding Triangle: How Does Large Language Model Understand Code?

Paper • 2507.06138 • Published 29 days ago • 20

upvoted a paper 29 days ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published 29 days ago • 38

upvoted an article about 1 month ago

Article

The AI Paradigm Shift Is Here: 4 Disruptive Trends from the Top 50 Hugging Face Papers of Q2 2025

•

Jul 2

• 2

upvoted 3 papers about 1 month ago

AnnaAgent: Dynamic Evolution Agent System with Multi-Session Memory for Realistic Seeker Simulation

Paper • 2506.00551 • Published May 31 • 3

Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination

Paper • 2503.04149 • Published Mar 6 • 6

CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming

Paper • 2505.12925 • Published May 19 • 2

upvoted a paper 2 months ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 267

upvoted a paper 3 months ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 68

upvoted a paper 4 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

upvoted a collection 4 months ago

InternVL3

Collection

34 items • Updated Apr 20 • 79

upvoted 6 papers 5 months ago

ProJudge: A Multi-Modal Multi-Discipline Benchmark and Instruction-Tuning Dataset for MLLM-based Process Judges

Paper • 2503.06553 • Published Mar 9 • 8

VisualSimpleQA: A Benchmark for Decoupled Evaluation of Large Vision-Language Models in Fact-Seeking Question Answering

Paper • 2503.06492 • Published Mar 9 • 11

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 68

upvoted 2 papers 6 months ago

SARChat-Bench-2M: A Multi-Task Vision-Language Benchmark for SAR Image Interpretation

Paper • 2502.08168 • Published Feb 12 • 12

Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning

Paper • 2502.06781 • Published Feb 10 • 61