Charles Cai

charlescai2016

AI & ML interests

None yet

Recent Activity

liked a model about 16 hours ago

stabilityai/stable-diffusion-3.5-large

upvoted a paper 3 days ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

liked a Space 5 days ago

Kwai-Kolors/Kolors-Portrait-with-Flux

View all activity

Organizations

upvoted a paper 3 days ago

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 63

upvoted 2 papers 6 days ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published 7 days ago • 81

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published 9 days ago • 30

upvoted a paper 8 days ago

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published 13 days ago • 37

upvoted a paper 12 days ago

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published 18 days ago • 122

upvoted a paper 13 days ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 122

upvoted a paper 22 days ago

Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data

Paper • 2507.07095 • Published 27 days ago • 53

upvoted a paper about 1 month ago

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19 • 86

upvoted a paper about 2 months ago

Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression

Paper • 2506.09482 • Published Jun 11 • 46

upvoted an article about 2 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

and 4 others •

Jun 19

• 83

upvoted a collection 2 months ago

Qwen3-Reranker

Collection

3 items • Updated 16 days ago • 62

upvoted an article 3 months ago

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 63

upvoted 2 papers 3 months ago

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

upvoted an article 3 months ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 293

upvoted a paper 3 months ago

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Paper • 2504.12395 • Published Apr 16 • 17

upvoted an article 5 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

and 2 others •

Jan 23

• 182

upvoted a collection 5 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 244

upvoted 2 articles 5 months ago

Article

Merge Large Language Models with mergekit

•

Jan 9, 2024

• 130

Article

Multivariate Probabilistic Time Series Forecasting with Informer

and 2 others •

Mar 10, 2023

• 21

Charles Cai

AI & ML interests

Recent Activity

Organizations

charlescai2016's activity

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

The N Implementation Details of RLHF with PPO

Tiny Agents: a MCP-powered agent in 50 lines of code

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Merge Large Language Models with mergekit

Multivariate Probabilistic Time Series Forecasting with Informer