隐空间分片式非对抗生成自编码（张世华与合作者）中国科学院数学与系统科学研究院应用数学研究所

隐空间分片式非对抗生成自编码（张世华与合作者）

2024-03-18 | 撰稿： | 浏览:

Non-adversarial generative models are relatively easy to train and have less mode collapse than adversarial models. However, they are not very accurate in approximating the target distribution in latent space because they don't have a discriminator. To this end, we develop a novel divide-and-conquer model called Tessellated Wasserstein Auto-Encoders (TWAE) which has less statistical error in approximating the target distribution. TWAE tessellates the support of the target distribution into a given number of regions using the centroidal Voronoi tessellation (CVT) technique and designs data batches according to the tessellation instead of random shuffling for accurate computation of discrepancy. Theoretically, we demonstrate that the error in estimating the discrepancy decreases as the number of samples n and the regions m of the tessellation increase at rates of \mathcal {O}(\frac{1}{\sqrt{n}}) and \mathcal {O}(\frac{1}{\sqrt{m}}) , respectively. TWAE is very flexible to different non-adversarial metrics and can significantly enhance their generative performance in terms of Fréchet inception distance (FID) compared to existing ones. Furthermore, numerical results demonstrate that TWAE is competitive to the adversarial model and shows powerful generative ability.

Publication: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 46, no. 2, pp. 780-792, Feb. 2024 http://dx.doi.org/10.1109/TPAMI.2023.3325282

　　Author: Kuo Gai,NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China, School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China;Shihua Zhang,NCMIS, CEMS, RCSDS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China,School of Mathematics Sciences, University of Chinese Academy of Sciences, Beijing, China,Email: zsh@amss.ac.cn