Show simple item record

dc.contributor.authorZhang, Chiyuan
dc.contributor.authorEvangelopoulos, Georgios
dc.contributor.authorVoinea, Stephen
dc.contributor.authorRosasco, Lorenzo
dc.contributor.authorPoggio, Tomaso
dc.date.accessioned2015-12-08T20:45:08Z
dc.date.available2015-12-08T20:45:08Z
dc.date.issued2014-17-03
dc.identifier.urihttp://hdl.handle.net/1721.1/100163
dc.description.abstractRepresentations in the auditory cortex might be based on mechanisms similar to the visual ventral stream; modules for building invariance to transformations and multiple layers for compositionality and selectivity. In this paper we propose the use of such computational modules for extracting invariant and discriminative audio representations. Building on a theory of invariance in hierarchical architectures, we propose a novel, mid-level representation for acoustical signals, using the empirical distributions of projections on a set of templates and their transformations. Under the assumption that, by construction, this dictionary of templates is composed from similar classes, and samples the orbit of variance-inducing signal transformations (such as shift and scale), the resulting signature is theoretically guaranteed to be unique, invariant to transformations and stable to deformations. Modules of projection and pooling can then constitute layers of deep networks, for learning composite representations. We present the main theoretical and computational aspects of a framework for unsupervised learning of invariant audio representations, empirically evaluated on music genre classification.en_US
dc.description.sponsorshipThis work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF - 1231216.en_US
dc.language.isoen_USen_US
dc.publisherCenter for Brains, Minds and Machines (CBMM), arXiven_US
dc.relation.ispartofseriesCBMM Memo Series;002
dc.rightsAttribution-NonCommercial 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/3.0/us/*
dc.subjectAudio Representationen_US
dc.subjectHierarchyen_US
dc.subjectInvarianceen_US
dc.subjectMachine Learningen_US
dc.subjectTheories for Intelligenceen_US
dc.titleA Deep Representation for Invariance And Music Classificationen_US
dc.typeTechnical Reporten_US
dc.typeWorking Paperen_US
dc.typeOtheren_US
dc.identifier.citationarXiv:1404.0400v1en_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record