[논문분석] HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
GAN을 활용해 압축된 표현에서 오디오로 변환, one generator and two discriminators
GAN을 활용해 압축된 표현에서 오디오로 변환, one generator and two discriminators
diffusion model에 Transformer 구조 사용, video generation model
transformer based Diffusion model
Multi-model image generation diffusion model
transformer based Diffusion model