[프로젝트] Audio2Video : audio unit based speech video generation with diffusion model - Proposal
Hubert, [Speech audio Unit encoding] conditioning, diffusion video generation
Hubert, [Speech audio Unit encoding] conditioning, diffusion video generation
Scene Graph를 Condition으로 받는 image generation diffusion model finetuning
Scene Graph를 Condition으로 받는 image generation diffusion model finetuning
GAN을 활용해 압축된 표현에서 오디오로 변환, one generator and two discriminators
Scene Graph를 Condition으로 받는 image generation diffusion model finetuning