Beijing Academy of Artificial Intelligence

non-profit

https://www.baai.ac.cn/english.html

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Lucywang720 updated a dataset 4 days ago

BAAI/MOVE

yimingju2 updated a dataset 4 days ago

BAAI/Infinity-Instruct

Lucywang720 published a dataset 5 days ago

BAAI/MOVE

View all activity

Papers

General Agentic Memory Via Deep Research

Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

View all Papers

Lucywang720

updated a dataset 4 days ago

BAAI/MOVE

Updated 4 days ago • 10

yimingju2

updated a dataset 4 days ago

BAAI/Infinity-Instruct

Viewer • Updated 4 days ago • 21.9M • 3.85k • 687

Lucywang720

published a dataset 5 days ago

BAAI/MOVE

Updated 4 days ago • 10

lz1001

authored a paper 13 days ago

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 15 days ago • 155

zhizhou57

authored a paper 13 days ago

Beyond Multiple Choice: Verifiable OpenQA for Robust Vision-Language RFT

Paper • 2511.17405 • Published 16 days ago • 10

Zhoues

authored a paper 14 days ago

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics

Paper • 2510.07181 • Published Oct 8 • 1

Kiri233

updated a dataset 19 days ago

BAAI/Chinese-LiPS

Viewer • Updated 19 days ago • 36.2k • 1.03k • 7

JUNJIE99

authored a paper 24 days ago

MR$^2$-Bench: Going Beyond Matching to Reasoning in Multimodal Retrieval

Paper • 2509.26378 • Published Sep 30

xinlongwang

updated a collection 25 days ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 25 days ago • 71

wolfwjs

in BAAI/Emu3.5-Image about 1 month ago

Add pipeline tag, library name, and sync model card with GitHub README

#2 opened about 1 month ago by

wolfwjs

in BAAI/Emu3.5 about 1 month ago

Add pipeline tag and library name

#1 opened about 1 month ago by

aikx

authored a paper about 1 month ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 106

ryanzhangfan

authored 2 papers about 1 month ago

Uniform Discrete Diffusion with Metric Path for Video Generation

Paper • 2510.24717 • Published Oct 28 • 39

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 106

Bitterdhg

authored a paper about 1 month ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30 • 106

philokey

authored a paper about 1 month ago

Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

Paper • 2510.26865 • Published Oct 30 • 11

PhyscalX

published 2 models about 1 month ago

BAAI/URSA-0.6B-IBQ1024

Text-to-Image • Updated Nov 2 • 15 • 3

BAAI/URSA-0.6B-FSQ320

Text-to-Video • Updated Nov 2 • 2 • 3

PhyscalX

updated a collection about 1 month ago

🐻 URSA

URSA: Uniform Discrete Diffusion with Metric Path for Video Generation • 6 items • Updated Nov 2 • 6

PhyscalX

updated a model about 1 month ago

BAAI/URSA-0.6B-IBQ1024

Text-to-Image • Updated Nov 2 • 15 • 3