We propose a novel approach to disentangle the generative factors of variation underlying a given set of observations. Our method builds upon the idea that the (unknown) low-dimensional manifold underlying the data space can be explicitly modeled as a product of submanifolds. This definition of disentanglement gives rise to a novel weakly-supervised algorithm for recovering the unknown explanatory factors behind the data. At training time, our algorithm only requires pairs of non i.i.d. data samples whose elements share at least one, possibly multidimensional, generative factor of variation. We require no knowledge on the nature of these transformations, and do not make any limiting assumption on the properties of each subspace. Our approach is easy to implement, and can be successfully applied to different kinds of data (from images to 3D surfaces) undergoing arbitrary transformations. In addition to standard synthetic benchmarks, we showcase our method in challenging real-world applications, where we compare favorably with the state of the art.

Fumero, M., Cosmo, L., Melzi, S., Rodola, E. (2021). Learning disentangled representations via product manifold projection. In Proceedings of the 38th International Conference on Machine Learning (pp.3530-3540). ML Research Press.

Learning disentangled representations via product manifold projection

Melzi, S;
2021

Abstract

We propose a novel approach to disentangle the generative factors of variation underlying a given set of observations. Our method builds upon the idea that the (unknown) low-dimensional manifold underlying the data space can be explicitly modeled as a product of submanifolds. This definition of disentanglement gives rise to a novel weakly-supervised algorithm for recovering the unknown explanatory factors behind the data. At training time, our algorithm only requires pairs of non i.i.d. data samples whose elements share at least one, possibly multidimensional, generative factor of variation. We require no knowledge on the nature of these transformations, and do not make any limiting assumption on the properties of each subspace. Our approach is easy to implement, and can be successfully applied to different kinds of data (from images to 3D surfaces) undergoing arbitrary transformations. In addition to standard synthetic benchmarks, we showcase our method in challenging real-world applications, where we compare favorably with the state of the art.
paper
disentangled representation; product manifold; latent space
English
38th International Conference on Machine Learning, ICML 2021 - 18 July 2021 through 24 July 2021
2021
Proceedings of the 38th International Conference on Machine Learning
9781713845065
2021
139
3530
3540
reserved
Fumero, M., Cosmo, L., Melzi, S., Rodola, E. (2021). Learning disentangled representations via product manifold projection. In Proceedings of the 38th International Conference on Machine Learning (pp.3530-3540). ML Research Press.
File in questo prodotto:
File Dimensione Formato  
fumero21a.pdf

Solo gestori archivio

Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Dimensione 3.7 MB
Formato Adobe PDF
3.7 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/350558
Citazioni
  • Scopus 9
  • ???jsp.display-item.citation.isi??? 1
Social impact