EXIM: A Hybrid Explicit-Implicit Representation for Text-Guided 3D Shape Generation

Zhengzhe Liu, Jingyu Hu, Ka Hei Hui, Xiaojuan Qi*, Daniel Cohen-Or, Chi Wing Fu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

This paper presents a new text-guided technique for generating 3D shapes. The technique leverages a hybrid 3D shape representation, namely EXIM, combining the strengths of explicit and implicit representations. Specifically, the explicit stage controls the topology of the generated 3D shapes and enables local modifications, whereas the implicit stage refines the shape and paints it with plausible colors. Also, the hybrid approach separates the shape and color and generates color conditioned on shape to ensure shape-color consistency. Unlike the existing state-of-the-art methods, we achieve high-fidelity shape generation from natural-language descriptions without the need for time-consuming per-shape optimization or reliance on human-annotated texts during training or test-time optimization. Further, we demonstrate the applicability of our approach to generate indoor scenes with consistent styles using text-induced 3D shapes. Through extensive experiments, we demonstrate the compelling quality of our results and the high coherency of our generated shapes with the input texts, surpassing the performance of existing methods by a significant margin. Codes and models are released at https://github.com/liuzhengzhe/EXIM.

Original languageEnglish
Article number228
JournalACM Transactions on Graphics
Volume42
Issue number6
DOIs
StatePublished - 4 Dec 2023

Funding

FundersFunder number
InnoHK of the Government of the Hong Kong Special Administrative Region
Research Grants Council, University Grants CommitteeCUHK 14206320,14201921

    Keywords

    • 3D shape generation
    • text-guided

    Fingerprint

    Dive into the research topics of 'EXIM: A Hybrid Explicit-Implicit Representation for Text-Guided 3D Shape Generation'. Together they form a unique fingerprint.

    Cite this