Discriminate-and-Rectify Encoders: Learning from Image Transformation Sets

Tachetti, Andrea; Voinea, Stephen; Evangelopoulos, Georgios

dc.contributor.author	Tachetti, Andrea
dc.contributor.author	Voinea, Stephen
dc.contributor.author	Evangelopoulos, Georgios
dc.date.accessioned	2017-03-16T19:50:32Z
dc.date.available	2017-03-16T19:50:32Z
dc.date.issued	2017-03-13
dc.identifier.uri	http://hdl.handle.net/1721.1/107446
dc.description.abstract	The complexity of a learning task is increased by transformations in the input space that preserve class identity. Visual object recognition for example is affected by changes in viewpoint, scale, illumination or planar transformations. While drastically altering the visual appearance, these changes are orthogonal to recognition and should not be reflected in the representation or feature encoding used for learning. We introduce a framework for weakly supervised learning of image embeddings that are robust to transformations and selective to the class distribution, using sets of transforming examples (orbit sets), deep parametrizations and a novel orbit-based loss. The proposed loss combines a discriminative, contrastive part for orbits with a reconstruction error that learns to rectify orbit transformations. The learned embeddings are evaluated in distance metric-based tasks, such as one-shot classification under geometric transformations, as well as face verification and retrieval under more realistic visual variability. Our results suggest that orbit sets, suitably computed or observed, can be used for efficient, weakly-supervised learning of semantically relevant image embeddings.	en_US
dc.description.sponsorship	This material is based upon work supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF-1231216.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Center for Brains, Minds and Machines (CBMM), arXiv	en_US
dc.relation.ispartofseries	CBMM Memo Series;062
dc.rights	Attribution-NonCommercial-ShareAlike 3.0 United States	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/us/	*
dc.subject	supervised learning	en_US
dc.subject	object recognition	en_US
dc.subject	machine learning	en_US
dc.title	Discriminate-and-Rectify Encoders: Learning from Image Transformation Sets	en_US
dc.type	Technical Report	en_US
dc.type	Working Paper	en_US
dc.type	Other	en_US
dc.identifier.citation	arXiv:1703.04775v1	en_US

Files in this item

Name:: CBMM-Memo-062.pdf
Size:: 9.370Mb
Format:: PDF

View/Open

Name:: license_rdf
Size:: 1.5Kb
Format:: application/rdf+xml

View/Open

This item appears in the following Collection(s)

CBMM Memo Series

Show simple item record