HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
tta video-to-audio text-to-audio text-to-video foley-sound-synthesis foley-art aigc-audio text-video-to-audio
-
Updated
Sep 28, 2025 - Python