RoboChallenge.ai

community

https://robochallenge.ai/

robochallengeai

robochallenge

Activity Feed

AI & ML interests

large scale real-robot-based benchmark platform of embodied intelligence

Recent Activity

smrset updated a dataset 5 days ago

RoboChallenge/Table30

xianbao authored a paper 10 days ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

smrset authored a paper 10 days ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

View all activity

Papers

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

View all Papers

smrset

updated a dataset 5 days ago

RoboChallenge/Table30

Updated 5 days ago • 3.92k • 7

AdinaY

posted an update 9 days ago

Post

2766

Kimi K2 Thinking is now live on the hub 🔥

moonshotai/Kimi-K2-Thinking

✨ 1T MoE for deep reasoning & tool use
✨ Native INT4 quantization = 2× faster inference
✨ 256K context window
✨ Modified MIT license

xianbao

authored a paper 10 days ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published 25 days ago • 5

smrset

authored a paper 10 days ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published 25 days ago • 5

AdinaY

posted an update 10 days ago

Post

395

Chinese open source AI in October wasn’t about bigger models, it was about real world impact 🔥

https://huggingface.co/collections/zh-ai-community/october-2025-china-open-source-highlights

✨ Vision-Language & OCR wave 🌊
- DeepSeek-OCR : 3B
- PaddleOCR-VL : 0.9B
- Qwen3-VL : 2B / 4B / 8B / 32B /30B-A3B
- Open-Bee: Bee-8B-RL
- http://Z.ai Glyph :10B

OCR is industrializing, the real game now is understanding the (long context) document, not just reading it.

✨ Text generation: scale or innovation?
- MiniMax-M2: 229B
- Antgroup Ling-1T & Ring-1T
- Moonshot Kimi-Linear : linear-attention challenger
- Kwaipilot KAT-Dev

Efficiency is the key.

✨ Any-to-Any & World-Model : one step forward to the real world
- BAAI Emu 3.5
- Antgroup Ming-flash-omni
- HunyuanWorld-Mirror: 3D

Aligning with the “world model” globally

✨ Audio & Speech + Video & Visual: released from entertainment labs to delivery platforms
- SoulX-Podcast TTS
- LongCat-Audio-Codec & LongCat-Video by Meituan delivery paltform
- xiabs DreamOmni 2

Looking forward to what's next 🚀

AdinaY

authored a paper 10 days ago

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

Paper • 2510.17950 • Published 25 days ago • 5

AdinaY

posted an update 15 days ago

Post

419

Kimi Linear🚀 Hybrid linear attention model from Moonshot AI

https://huggingface.co/collections/moonshotai/kimi-linear-a3b

✨ 48B total/ 3B active - MIT license
✨ Up to 1M context
✨ 84.3 on RULER (128k) with 3.98× speedup
✨ Hybrid KDA + MLA architecture for peak throughput & quality

RoboChallengeAI

updated a dataset 16 days ago

RoboChallenge/Table30

Updated 5 days ago • 3.92k • 7

RoboChallengeAI

updated a dataset 18 days ago

RoboChallenge/task_table30_sort_books

Updated 18 days ago • 372

AdinaY

posted an update 19 days ago

Post

1696

Ming-flash-omni Preview 🚀 Multimodal foundation model from AntGroup

inclusionAI/Ming-flash-omni-Preview

✨ Built on Ling-Flash-2.0: 10B total/6B active
✨ Generative segmentation-as-editing
✨ SOTA contextual & dialect ASR
✨ High-fidelity image generation

AdinaY

posted an update 19 days ago

Post

1764

Glyph 🔥 a framework that scales context length by compressing text into images and processing them with vision–language models, released by Z.ai.

Paper:https://huggingface.co/papers/2510.17800
Model:https://huggingface.co/zai-org/Glyph

✨ Compresses long sequences visually to bypass token limits
✨ Reduces computational and memory costs
✨ Preserves meaning through multimodal encoding
✨ Built on GLM-4.1V-9B-Base

AdinaY

posted an update 24 days ago

Post

2614

HunyuanWorld Mirror🔥a versatile feed forward model for universal 3D world reconstruction by Tencent

tencent/HunyuanWorld-Mirror

✨ Any prior in → 3D world out
✨ Mix camera, intrinsics, depth as priors
✨ Predict point clouds, normals, Gaussians & more in one pass
✨ Unified architecture for all 3D task

emilychen522

updated a Space 26 days ago

README

🚀

RoboChallenge Community

AdinaY

posted an update 28 days ago

Post

659

PaddleOCR VL🔥 0.9B Multilingual VLM by Baidu

PaddlePaddle/PaddleOCR-VL

✨ Ultra-efficient NaViT + ERNIE-4.5 architecture
✨ Supports 109 languages 🤯
✨ Accurately recognizes text, tables, formulas & charts
✨ Fast inference and lightweight for deployment

RoboChallengeAI

updated a Space 29 days ago

README

🚀

RoboChallenge Community

RoboChallengeAI

in RoboChallenge/Table30 29 days ago

Are Camera Intrinsic and Extrinsic Parameters Included in the Dataset?

#1 opened 30 days ago by

Muyun99

AdinaY

posted an update 30 days ago

Post

1793

Bee-8B 🐝 open 8B Multimodal LLM built on high quality data, released by
TencentHunyuan

Paper: Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs (2510.13795)
Model: https://huggingface.co/collections/Open-Bee/bee-8b-68ecbf10417810d90fbd9995

✨ Trained on Honey-Data-15M, a 15M-sample SFT corpus with dual-level CoT reasoning
✨ Backed by HoneyPipe, a transparent & reproducible open data curation suite

RoboChallengeAI

published a Space 30 days ago

README

🚀

RoboChallenge Community

AdinaY

posted an update 30 days ago

Post

320

Reflection ≠ self-correction

Interesting paper on long-chain reasoning 📑 from Miromind AI.
First Try Matters: Revisiting the Role of Reflection in Reasoning Models (2510.08308)

It dives into how LLMs think 🧠 Most reflections confirm, not fix, and true improvement comes from stronger initial reasoning.

AdinaY

posted an update about 1 month ago

Post

485

Ring-1T🔥 the trillion-parameter thinking model released by Ant group, the company behind Alipay

inclusionAI/Ring-1T

✨ 1T params (50B active)- MIT license
✨ 128K context (YaRN)
✨ RLVR, Icepop, and ASystem make trillion-scale RL stable

AI & ML interests

Recent Activity

Papers

Team members 7

RoboChallenge's activity

README

README

Are Camera Intrinsic and Extrinsic Parameters Included in the Dataset?

README