Shah Nirmesh's picture

4 18 1

Shah Nirmesh

Nirmesh

·

nirmesh-shah

AI & ML interests

Speech Processing

Recent Activity

liked a model about 2 months ago

DiscreteSpeech/DSTK

upvoted a paper 2 months ago

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning

upvoted a paper 2 months ago

In-Domain African Languages Translation Using LLMs and Multi-armed Bandits

View all activity

Organizations

None yet

commented 4 papers 2 months ago

EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion

Paper • 2412.20359 • Published Dec 29, 2024 • 8 •

VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech

Paper • 2406.08076 • Published Jun 12, 2024 • 6 •

DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing

Paper • 2406.08802 • Published Jun 13, 2024 • 8 •

REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion

Paper • 2505.20756 • Published May 27 • 8 •