Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shaolei Zhang's picture
5 6 10

Shaolei Zhang

zhangshaolei
21world's profile picture
·
https://zhangshaolei1998.github.io/
  • zhangshaolei1998

AI & ML interests

None yet

Recent Activity

liked a model 27 days ago
ICTNLP/StreamUni-Phi4
authored a paper about 2 months ago
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
upvoted a paper about 2 months ago
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
View all activity

Organizations

Natural Language Processing Group, Institute of Computing Technology, Chinese Academy of Science's profile picture

commented a paper about 2 months ago

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16 • 27 •
2
New activity in ICTNLP/llava-mini-llama-3.1-8b 7 months ago

Add pipeline tag

#1 opened 7 months ago by
nielsr
commented a paper 7 months ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 53 •
4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs