Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Cheng-Yu Hsieh's picture
1 5

Cheng-Yu Hsieh

cydhsieh01
21world's profile picture
·
https://chengyuhsieh.github.io/
  • cydhsieh

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago
Meta CLIP 1/2
authored a paper 11 months ago
Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
updated a model 12 months ago
vila-molmo/molmo-dense-captioner-v22-qwen2
View all activity

Organizations

Efficient-Large-Model's profile picture VILA / Molmo's profile picture

upvoted a collection about 1 month ago

Meta CLIP 1/2

Collection
Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 11 items • Updated Aug 25 • 20
upvoted 4 papers over 1 year ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 24

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Paper • 2407.06723 • Published Jul 9, 2024 • 11

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

Paper • 2407.07071 • Published Jul 9, 2024 • 12

Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

Paper • 2406.16008 • Published Jun 23, 2024 • 6
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs