TY - JOUR
T1 - LOGAN
T2 - Unpaired shape transform in latent overcomplete space
AU - Yin, Kangxue
AU - Chen, Zhiqin
AU - Huang, Hui
AU - Cohen-Or, Daniel
AU - Zhang, Hao
N1 - Publisher Copyright:
© 2019 Association for Computing Machinery.
PY - 2019/11
Y1 - 2019/11
N2 - We introduce LOGAN, a deep neural network aimed at learning generalpurpose shape transforms from unpaired domains. The network is trained on two sets of shapes, e.g., tables and chairs, while there is neither a pairing between shapes from the domains as supervision nor any point-wise correspondence between any shapes. Once trained, LOGAN takes a shape from one domain and transforms it into the other. Our network consists of an autoencoder to encode shapes from the two input domains into a common latent space, where the latent codes concatenate multi-scale shape features, resulting in an overcomplete representation. The translator is based on a generative adversarial network (GAN), operating in the latent space, where an adversarial loss enforces cross-domain translation while a feature preservation loss ensures that the right shape features are preserved for a natural shape transform. We conduct ablation studies to validate each of our key network designs and demonstrate superior capabilities in unpaired shape transforms on a variety of examples over baselines and state-of-the-art approaches. We show that LOGAN is able to learn what shape features to preserve during shape translation, either local or non-local, whether content or style, depending solely on the input domains for training.
AB - We introduce LOGAN, a deep neural network aimed at learning generalpurpose shape transforms from unpaired domains. The network is trained on two sets of shapes, e.g., tables and chairs, while there is neither a pairing between shapes from the domains as supervision nor any point-wise correspondence between any shapes. Once trained, LOGAN takes a shape from one domain and transforms it into the other. Our network consists of an autoencoder to encode shapes from the two input domains into a common latent space, where the latent codes concatenate multi-scale shape features, resulting in an overcomplete representation. The translator is based on a generative adversarial network (GAN), operating in the latent space, where an adversarial loss enforces cross-domain translation while a feature preservation loss ensures that the right shape features are preserved for a natural shape transform. We conduct ablation studies to validate each of our key network designs and demonstrate superior capabilities in unpaired shape transforms on a variety of examples over baselines and state-of-the-art approaches. We show that LOGAN is able to learn what shape features to preserve during shape translation, either local or non-local, whether content or style, depending solely on the input domains for training.
KW - Generative adversarial network
KW - Multi-scale point cloud encoding
KW - Shape transform
KW - Unpaired domain translation
KW - Unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85079050664&partnerID=8YFLogxK
U2 - 10.1145/3355089.3356494
DO - 10.1145/3355089.3356494
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85079050664
SN - 0730-0301
VL - 38
JO - ACM Transactions on Graphics
JF - ACM Transactions on Graphics
IS - 6
M1 - 3356494
ER -