TY - JOUR
T1 - SWAGAN
AU - Gal, Rinon
AU - Hochberg, Dana Cohen
AU - Bermano, Amit
AU - Cohen-Or, Daniel
N1 - Publisher Copyright:
© 2021 ACM.
PY - 2021/7/1
Y1 - 2021/7/1
N2 - In recent years, considerable progress has been made in the visual quality of Generative Adversarial Networks (GANs). Even so, these networks still suffer from degradation in quality for high-frequency content, stemming from a spectrally biased architecture, and similarly unfavorable loss functions. To address this issue, we present a novel general-purpose Style and WAvelet based GAN (SWAGAN) that implements progressive generation in the frequency domain. SWAGAN incorporates wavelets throughout its generator and discriminator architectures, enforcing a frequency-aware latent representation at every step of the way. This approach, designed to directly tackle the spectral bias of neural networks, yields an improvement in the ability to generate medium and high frequency content, including structures which other networks fail to learn. We demonstrate the advantage of our method by integrating it into the SyleGAN2 framework, and verifying that content generation in the wavelet domain leads to more realistic high-frequency content, even when trained for fewer iterations. Furthermore, we verify that our model's latent space retains the qualities that allow StyleGAN to serve as a basis for a multitude of editing tasks, and show that our frequency-aware approach also induces improved high-frequency performance in downstream tasks.
AB - In recent years, considerable progress has been made in the visual quality of Generative Adversarial Networks (GANs). Even so, these networks still suffer from degradation in quality for high-frequency content, stemming from a spectrally biased architecture, and similarly unfavorable loss functions. To address this issue, we present a novel general-purpose Style and WAvelet based GAN (SWAGAN) that implements progressive generation in the frequency domain. SWAGAN incorporates wavelets throughout its generator and discriminator architectures, enforcing a frequency-aware latent representation at every step of the way. This approach, designed to directly tackle the spectral bias of neural networks, yields an improvement in the ability to generate medium and high frequency content, including structures which other networks fail to learn. We demonstrate the advantage of our method by integrating it into the SyleGAN2 framework, and verifying that content generation in the wavelet domain leads to more realistic high-frequency content, even when trained for fewer iterations. Furthermore, we verify that our model's latent space retains the qualities that allow StyleGAN to serve as a basis for a multitude of editing tasks, and show that our frequency-aware approach also induces improved high-frequency performance in downstream tasks.
KW - StyleGAN
KW - generative adversarial networks
KW - wavelet decomposition
UR - http://www.scopus.com/inward/record.url?scp=85111254126&partnerID=8YFLogxK
U2 - 10.1145/3450626.3459836
DO - 10.1145/3450626.3459836
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85111254126
SN - 0730-0301
VL - 40
JO - ACM Transactions on Graphics
JF - ACM Transactions on Graphics
IS - 4
M1 - 134
ER -