Disentangling Shape and Orientation with Affine Variational Autoencoders

Rene  Bidart; Alexander   Wong

doi:10.15353/jcvis.v7i1.4898

Vol. 7 No. 1 (2021)
Special Issue: Proceedings of CVIS 2021

Articles

Disentangling Shape and Orientation with Affine Variational Autoencoders

https://doi.org/10.15353/jcvis.v7i1.4898

Published 2022-04-08

Rene Bidart
Alexander Wong

How to Cite

Bidart, R. ., & Wong, A. . . (2022). Disentangling Shape and Orientation with Affine Variational Autoencoders. Journal of Computational Vision and Imaging Systems, 7(1), 1–3. https://doi.org/10.15353/jcvis.v7i1.4898

Download Citation

Abstract

Is it be possible to disentangle an object's orientation from its shape? In this work we create compressed representations of an object by disentangling its orientation and shape with a variational autoencoder augmented with affine transform layers. Even when trained on randomly oriented data, shape and orientation are disentangled during training while the model learns to encode objects at a fixed orientation. We show this process results in a more compressed latent representation for 2d digits on the MNIST dataset, and for 3d objects on the ModelNet dataset.

PDF