AI & ML interests
None defined yet.
Recent Activity
Papers
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding
PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion
Articles
NV-Generate Synthetic Medical Imaging
Synthetic 3D CT and MR generation with NVIDIA NV-Generate.
Music Flamingo
Analyze music and answer questions from audio or YouTube links
VoMP
Volumetric physics materials for interactive worlds
LLM RTL Coding Errors Explainer
NVR - How LLMs Fail and Generalize in RTL Coding
Kimodo
Generate high-quality motions from text prompts
KVPress Leaderboard
KVPress leaderboard: benchmark KV Cache compression methods
Audio Flamingo 3 Demo
Audio Flamingo 3 Demo
Judge's Verdict Leaderboard
Judge's Verdict: Benchmarking LLM as a Judge
Llm Robustness Leaderboard
LLM Robustness leaderboard
Simready Validator
Validate a HuggingFace dataset with a SimReady profile
Debuggingapp
Generate a greeting with your number on GPU
LocateAnything
Locate objects in images and videos with visual tags
RE USE
A universal speech enhancement model for diverse degradation
ProfBench
Human-annotated rubrics in Professional Tasks
NV-Reason-CXR-3B Demo
Analyze chest X‑ray images and get detailed medical findings
Magpietts Demo
Generate multilingual speech from text
NVIDIA Hugging Face Organization
Asset Harvester
Image-to-3D for autonomous-vehicle simulation assets
Audio Flamingo Next
Answer questions about uploaded audio or YouTube videos
Audio Flamingo Next Captioner
Generate detailed captions and summaries for audio or YouTube videos
Audio Flamingo Next Think
Generate timestamped answers from audio or YouTube videos
Parakeet TDT 0.6b V3
Transcribe Speech with Multilingual parakeet-tdt-0.6b-v3
Nemotron OCR v2
Extract text and bounding boxes from images
MMOU Eval
Evaluate prediction files against MMOU benchmark data