Publications: Recent submissions
Now showing items 127-129 of 159
-
Deep Captioning with Multimodal Recurrent Neural Networks (m-RNN)
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-05-07)In this paper, we present a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. It directly models the probability distribution of generating a word given previous words and an image. ... -
Semantic Part Segmentation using Compositional Model combining Shape and Appearance
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-06-08)In this paper, we study the problem of semantic part segmentation for animals. This is more challenging than standard object detection, object segmentation and pose estimation tasks because semantic parts of animals often ... -
Complexity of Representation and Inference in Compositional Models with Part Sharing
(Center for Brains, Minds and Machines (CBMM), arXiv, 2015-05-05)This paper performs a complexity analysis of a class of serial and parallel compositional models of multiple objects and shows that they enable efficient representation and rapid inference. Compositional models are generative ...


