Speech - a kaufmane Collection

kaufmane 's Collections

Speech

Motion

MLLM

LLMs

Speech

updated Mar 19

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Paper • 2403.03100 • Published Mar 5 • 34
VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis

Paper • 2403.08764 • Published Mar 13 • 36