SWAGAN

Rinon Gal, Dana Cohen Hochberg, Amit Bermano, Daniel Cohen-Or

Research output: Contribution to journalArticlepeer-review

51 Scopus citations

Abstract

In recent years, considerable progress has been made in the visual quality of Generative Adversarial Networks (GANs). Even so, these networks still suffer from degradation in quality for high-frequency content, stemming from a spectrally biased architecture, and similarly unfavorable loss functions. To address this issue, we present a novel general-purpose Style and WAvelet based GAN (SWAGAN) that implements progressive generation in the frequency domain. SWAGAN incorporates wavelets throughout its generator and discriminator architectures, enforcing a frequency-aware latent representation at every step of the way. This approach, designed to directly tackle the spectral bias of neural networks, yields an improvement in the ability to generate medium and high frequency content, including structures which other networks fail to learn. We demonstrate the advantage of our method by integrating it into the SyleGAN2 framework, and verifying that content generation in the wavelet domain leads to more realistic high-frequency content, even when trained for fewer iterations. Furthermore, we verify that our model's latent space retains the qualities that allow StyleGAN to serve as a basis for a multitude of editing tasks, and show that our frequency-aware approach also induces improved high-frequency performance in downstream tasks.

Original languageEnglish
Article number134
JournalACM Transactions on Graphics
Volume40
Issue number4
DOIs
StatePublished - 1 Jul 2021

Funding

FundersFunder number
Israel Science Foundation2366/16, 2492/20

    Keywords

    • StyleGAN
    • generative adversarial networks
    • wavelet decomposition

    Fingerprint

    Dive into the research topics of 'SWAGAN'. Together they form a unique fingerprint.

    Cite this