Search
Now showing items 1-6 of 6
Computational role of eccentricity dependent cortical magnification
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-06)
We develop a sampling extension of M-theory focused on invariance to scale and translation. Quite surprisingly, the theory predicts an architecture of early vision with increasing receptive field sizes and a high resolution ...
Seeing What You’re Told: Sentence-Guided Activity Recognition In Video
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-05-29)
We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, ...
Detect What You Can: Detecting and Representing Objects using Holistic Models and Body Parts
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Detecting objects becomes difficult when we need to deal with large shape deformation, occlusion and low resolution. We propose a novel approach to i) handle large deformations and partial occlusions in animals (as examples ...
Robust Estimation of 3D Human Poses from a Single Image
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-10)
Human pose estimation is a key step to action recognition. We propose a method of estimating 3D human poses from a single image, which works in conjunction with an existing 2D pose/joint detector. 3D pose estimation is ...
The Secrets of Salient Object Segmentation
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-13)
In this paper we provide an extensive evaluation of fixation prediction and salient object segmentation algorithms as well as statistics of major datasets. Our analysis identifies serious design flaws of existing salient ...
Learning An Invariant Speech Representation
(Center for Brains, Minds and Machines (CBMM), arXiv, 2014-06-15)
Recognition of speech, and in particular the ability to generalize and learn from small sets of labelled examples like humans do, depends on an appropriate representation of the acoustic input. We formulate the problem of ...