Show simple item record

dc.contributor.authorMao, Junhua
dc.contributor.authorXu, Wei
dc.contributor.authorYang, Yi
dc.contributor.authorWang, Jiang
dc.contributor.authorHuang, Zhiheng
dc.contributor.authorYuille, Alan L.
dc.date.accessioned2015-12-11T22:15:05Z
dc.date.available2015-12-11T22:15:05Z
dc.date.issued2015-05-07
dc.identifier.urihttp://hdl.handle.net/1721.1/100198
dc.description.abstractIn this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. Image captions are generated according to this distribution. The model consists of two sub-networks: a deep recurrent neural network for sentences and a deep convolutional network for images. These two sub-networks interact with each other in a multimodal layer to form the whole m-RNN model. The effectiveness of our model is validated on four benchmark datasets (IAPR TC-12, Flickr 8K, Flickr 30K and MS COCO). Our model outperforms the state-of-the-art methods. In addition, the m-RNN model can be applied to retrieval tasks for retrieving images or sentences, and achieves significant performance improvement over the state-of-the-art methods which directly optimize the ranking objective function for retrieval.en_US
dc.description.sponsorshipThis work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF - 1231216.en_US
dc.language.isoen_USen_US
dc.publisherCenter for Brains, Minds and Machines (CBMM), arXiven_US
dc.relation.ispartofseriesCBMM Memo Series;033
dc.rightsAttribution-NonCommercial 3.0 United States*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/3.0/us/*
dc.subjectmultimodal Recurrent Neural Network (m-RNN)en_US
dc.subjectArtificial Intelligenceen_US
dc.subjectComputer Languageen_US
dc.titleDeep Captioning with Multimodal Recurrent Neural Networks (m-RNN)en_US
dc.typeTechnical Reporten_US
dc.typeWorking Paperen_US
dc.typeOtheren_US
dc.identifier.citationarXiv:1412.6632en_US


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record