Select elements in an image using text instructions
Describe objects in webcam feed
Upscale an image to higher resolution
Show detailed model outputs for specific benchmarks
Compare latest VAE's
Convert images and text into structured documents
Fast image relighting using Latent Bridge Matching