4.8 Article

Deformable Generator Networks: Unsupervised Disentanglement of Appearance and Geometry

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TPAMI.2020.3013905

Keywords

Generators; Deformable models; Data models; Shape; Interpolation; Analytical models; Image color analysis; Unsupervised learning; deep generative model; deformable model

Funding

  1. Natural Science Foundation of China [61703119]
  2. Natural Science Fund of Heilongjiang Province of China [QC2017070]
  3. Fundamental Research Funds for the Central Universities [3072020CF0403]
  4. NSF [DMS-2015577]
  5. DARPA [XAI N66001-17-2-4029]
  6. ARO [W911NF1810296]
  7. ONR MURI [N00014-16-1-2007]

Ask authors/readers for more resources

We present a deformable generator model that can disentangle the appearance and geometric information of image and video data in an unsupervised manner. Our model consists of two generators, one for appearance generation and the other for geometric warping. The appearance generator models color, illumination, identity, and category information, while the geometric generator generates deformation fields for geometric warping. We also introduce a nonlinear transition model for capturing dynamics over time in video data. Extensive experiments show that our approach can successfully disentangle appearance and geometric information, and the learned geometric generator can be transferred to other image datasets with similar structure regularity.
We present a deformable generator model to disentangle the appearance and geometric information for both image and video data in a purely unsupervised manner. The appearance generator network models the information related to appearance, including color, illumination, identity or category, while the geometric generator performs geometric warping, such as rotation and stretching, through generating deformation field which is used to warp the generated appearance to obtain the final image or video sequences. Two generators take independent latent vectors as input to disentangle the appearance and geometric information from image or video sequences. For video data, a nonlinear transition model is introduced to both the appearance and geometric generators to capture the dynamics over time. The proposed scheme is general and can be easily integrated into different generative models. An extensive set of qualitative and quantitative experiments shows that the appearance and geometric information can be well disentangled, and the learned geometric generator can be conveniently transferred to other image datasets that share similar structure regularity to facilitate knowledge transfer tasks.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available