Show simple item record

dc.contributor.authorGaddy, David M.
dc.contributor.authorZhang, Yuan
dc.contributor.authorBarzilay, Regina
dc.contributor.authorJaakkola, Tommi S.
dc.date.accessioned2017-07-17T18:13:57Z
dc.date.available2017-07-17T18:13:57Z
dc.date.issued2016-06
dc.identifier.isbn978-1-941643-91-4
dc.identifier.urihttp://hdl.handle.net/1721.1/110739
dc.description.abstractIn the absence of annotations in the target language, multilingual models typically draw on extensive parallel resources. In this paper, we demonstrate that accurate multilingual partof-speech (POS) tagging can be done with just a few (e.g., ten) word translation pairs. We use the translation pairs to establish a coarse linear isometric (orthonormal) mapping between monolingual embeddings. This enables the supervised source model expressed in terms of embeddings to be used directly on the target language. We further refine the model in an unsupervised manner by initializing and regularizing it to be close to the direct transfer model. Averaged across six languages, our model yields a 37.5% absolute improvement over the monolingual prototypedriven method (Haghighi and Klein, 2006) when using a comparable amount of supervision. Moreover, to highlight key linguistic characteristics of the generated tags, we use them to predict typological properties of languages, obtaining a 50% error reduction relative to the prototype modelen_US
dc.language.isoen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.relation.isversionofhttp://dblp.dagstuhl.de/db/conf/naacl/naacl2016.htmlen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT Web Domainen_US
dc.titleTen pairs to tag - Multilingual POS tagging via coarse mapping between embeddingsen_US
dc.typeArticleen_US
dc.identifier.citationZhang, Yuan et al. "Ten Pairs to Tag - Multilingual POS Tagging via Course Mapping between Embeddings." 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, San Diego, California, USA, 12-17 June, 2016. Association for Computational Linguistics, 2016.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorGaddy, David M.
dc.contributor.mitauthorZhang, Yuan
dc.contributor.mitauthorBarzilay, Regina
dc.contributor.mitauthorJaakkola, Tommi S.
dc.relation.journal15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologiesen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsZhang, Yuan; Gaddy, David; Barzilay, Regina; Jaakkola, Tommien_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-3121-0185
dc.identifier.orcidhttps://orcid.org/0000-0002-2921-8201
dc.identifier.orcidhttps://orcid.org/0000-0002-2199-0379
dspace.mitauthor.errortrue
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record