[논문분석] Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Wav2vec + animatediff - Talking Face generation
Wav2vec + animatediff - Talking Face generation
Audio Conditioned Diffusion Models - Talking Face generation
GAN을 활용한 sound guided video generation, clip의 latent space를 활용
기존 Unit based audio Multilingual translate으로 제안된 논문에 Korean을 추가
기존 Unit based audio Multilingual translate으로 제안된 논문에 Korean을 추가