Show simple item record

dc.contributor.authorLei, Tao
dc.contributor.authorJin, Wengong
dc.contributor.authorBarzilay, Regina
dc.contributor.authorJaakkola, Tommi S
dc.date.accessioned2021-04-14T20:56:48Z
dc.date.available2021-04-14T20:56:48Z
dc.date.issued2017
dc.identifier.urihttps://hdl.handle.net/1721.1/130480
dc.description.abstractThe design of neural architectures for structured objects is typically guided by experimental insights rather than a formal process. In this work, we appeal to kernels over combinatorial structures, such as sequences and graphs, to derive appropriate neural operations. We introduce a class of deep recurrent neural operations and formally characterize their associated kernel spaces. Our recurrent modules compare the input to virtual reference objects (cf. filters in CNN) via the kernels. Similar to traditional neural operations, these reference objects are parameterized and directly optimized in end-to-end training. We empirically evaluate the proposed class of neural architectures on standard applications such as language modeling and molecular graph regression, achieving state-of-the-art results across these applications.en_US
dc.language.isoen
dc.publisherMLResearch Pressen_US
dc.relation.isversionofhttp://proceedings.mlr.press/v70/lei17a.htmlen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleDeriving neural architectures from sequence and graph kernelsen_US
dc.typeArticleen_US
dc.identifier.citationLei, Tao et al. "Deriving neural architectures from sequence and graph kernels." Proceedings of the 34th International Conference on Machine Learning, August 2017, Sydney, Australia, MLResearch Press, 2017. © 2017 The author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journalProceedings of the 34th International Conference on Machine Learningen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-05-07T15:51:58Z
dspace.date.submission2019-05-07T15:51:59Z
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record