Now showing items 121-140 of 141

    • Theoretical Issues in Deep Networks 

      Poggio, Tomaso; Banburski, Andrzej; Liao, Qianli (Center for Brains, Minds and Machines (CBMM), 2019-08-17)
      While deep learning is successful in a number of applications, it is not yet well understood theoretically. A theoretical characterization of deep learning should answer questions about their approximation power, the ...
    • Theory I: Why and When Can Deep Networks Avoid the Curse of Dimensionality? 

      Poggio, Tomaso; Mhaskar, Hrushikesh; Rosasco, Lorenzo; Miranda, Brando; Liao, Qianli (Center for Brains, Minds and Machines (CBMM), arXiv, 2016-11-23)
      [formerly titled "Why and When Can Deep – but Not Shallow – Networks Avoid the Curse of Dimensionality: a Review"] The paper reviews and extends an emerging body of theoretical results on deep learning including the ...
    • Theory II: Landscape of the Empirical Risk in Deep Learning 

      Poggio, Tomaso; Liao, Qianli (Center for Brains, Minds and Machines (CBMM), arXiv, 2017-03-30)
      Previous theoretical work on deep learning and neural network optimization tend to focus on avoiding saddle points and local minima. However, the practical observation is that, at least for the most successful Deep ...
    • Theory IIIb: Generalization in Deep Networks 

      Poggio, Tomaso; Liao, Qianli; Miranda, Brando; Burbanski, Andrzej; Hidary, Jack (Center for Brains, Minds and Machines (CBMM), arXiv.org, 2018-06-29)
      The general features of the optimization problem for the case of overparametrized nonlinear networks have been clear for a while: SGD selects with high probability global minima vs local minima. In the overparametrized ...
    • Theory of Deep Learning IIb: Optimization Properties of SGD 

      Zhang, Chiyuan; Liao, Qianli; Rakhlin, Alexander; Miranda, Brando; Golowich, Noah; e.a. (Center for Brains, Minds and Machines (CBMM), 2017-12-27)
      In Theory IIb we characterize with a mix of theory and experiments the optimization of deep convolutional networks by Stochastic Gradient Descent. The main new result in this paper is theoretical and experimental evidence ...
    • Theory of Deep Learning III: explaining the non-overfitting puzzle 

      Poggio, Tomaso; Kawaguchi, Kenji; Liao, Qianli; Miranda, Brando; Rosasco, Lorenzo; e.a. (arXiv, 2017-12-30)
      THIS MEMO IS REPLACED BY CBMM MEMO 90 A main puzzle of deep networks revolves around the absence of overfitting despite overparametrization and despite the large capacity demonstrated by zero training error on randomly ...
    • Theory of Intelligence with Forgetting: Mathematical Theorems Explaining Human Universal Forgetting using “Forgetting Neural Networks” 

      Cano-Córdoba, Felipe; Sarma, Sanjay; Subirana, Brian (Center for Brains, Minds and Machines (CBMM), 2017-12-05)
      In [42] we suggested that any memory stored in the human/animal brain is forgotten following the Ebingghaus curve – in this follow-on paper, we define a novel algebraic structure, a Forgetting Neural Network, as a simple ...
    • Three approaches to facilitate DNN generalization to objects in out-of-distribution orientations and illuminations 

      Sakai, Akira; Sunagawa, Taro; Madan, Spandan; Suzuki, Kanata; Katoh, Takashi; e.a. (Center for Brains, Minds and Machines (CBMM), 2022-01-26)
      The training data distribution is often biased towards objects in certain orientations and illumination conditions. While humans have a remarkable capability of recognizing objects in out-of-distribution (OoD) orientations ...
    • Towards a Programmer’s Apprentice (Again) 

      Shrobe, Howard; Katz, Boris; Davis, Randall (Center for Brains, Minds and Machines (CBMM), 2015-04-03)
      Programmers are loathe to interrupt their workflow to document their design rationale, leading to frequent errors when software is modified—often much later and by different programmers. A Pro- grammer’s Assistant could ...
    • Trajectory Prediction with Linguistic Representations 

      Kuo, Yen-Ling; Huang, Xin; Barbu, Andrei; McGill, Stephen G.; Katz, Boris; e.a. (Center for Brains, Minds and Machines (CBMM), International Conference on Robotics and Automation (ICRA), 2022-03-09)
      Language allows humans to build mental models that interpret what is happening around them resulting in more accurate long-term predictions. We present a novel trajectory prediction model that uses linguistic intermediate ...
    • Transformer Module Networks for Systematic Generalization in Visual Question Answering 

      Yamada, Moyuru; D'Amario, Vanessa; Takemoto, Kentaro; Boix, Xavier; Sasaki, Tomotake (Center for Brains, Minds and Machines (CBMM), 2022-02-03)
      Transformer-based models achieve great performance on Visual Question Answering (VQA). How- ever, when we evaluate them on systematic generalization, i.e., handling novel combinations of known concepts, their performance ...
    • Understanding the Role of Recurrent Connections in Assembly Calculus 

      Rangamani, Akshay; Xie, Yi (Center for Brains, Minds and Machines (CBMM), 2022-07-06)
      In this note, we explore the role of recurrent connections in Assembly Calculus through a number of experiments conducted on models with and without recurrent connections. We observe that as- semblies can be formed even ...
    • Universal Dependencies for Learner English 

      Berzak, Yevgeni; Kenney, Jessica; Spadine, Carolyn; Wang, Jing Xian; Lam, Lucia; e.a. (Center for Brains, Minds and Machines (CBMM), arXiv, 2016-08-01)
      We introduce the Treebank of Learner English (TLE), the first publicly available syntactic treebank for English as a Second Language (ESL). The TLE provides manually annotated POS tags and Universal Dependency (UD) trees ...
    • Unsupervised learning of clutter-resistant visual representations from natural videos 

      Liao, Qianli; Leibo, Joel Z; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2015-04-27)
      Populations of neurons in inferotemporal cortex (IT) maintain an explicit code for object identity that also tolerates transformations of object appearance e.g., position, scale, viewing angle [1, 2, 3]. Though the learning ...
    • Unsupervised learning of invariant representations with low sample complexity: the magic of sensory cortex or a new framework for machine learning? 

      Anselmi, Fabio; Leibo, Joel Z.; Rosasco, Lorenzo; Mutch, Jim; Tacchetti, Andrea; e.a. (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-03-12)
      The present phase of Machine Learning is characterized by supervised learning algorithms relying on large sets of labeled examples (n → ∞). The next phase is likely to focus on algorithms capable of learning from very few ...
    • UNSUPERVISED LEARNING OF VISUAL STRUCTURE USING PREDICTIVE GENERATIVE NETWORKS 

      Lotter, William; Kreiman, Gabriel; Cox, David (Center for Brains, Minds and Machines (CBMM), arXiv, 2015-12-15)
      The ability to predict future states of the environment is a central pillar of intelligence. At its core, effective prediction requires an internal model of the world and an understanding of the rules by which the world ...
    • View-tolerant face recognition and Hebbian learning imply mirror-symmetric neural tuning to head orientation 

      Leibo, Joel Z.; Liao, Qianli; Freiwald, Winrich; Anselmi, Fabio; Poggio, Tomaso (Center for Brains, Minds and Machines (CBMM), arXiv, 2016-06-03)
      The primate brain contains a hierarchy of visual areas, dubbed the ventral stream, which rapidly computes object representations that are both specific for object identity and relatively robust against identity-preserving ...
    • Visual concepts and compositional voting 

      Wang, Jianyu; Zhang, Zhishuai; Xie, Cihang; Zhou, Yuyin; Premachandran, Vittal; e.a. (Center for Brains, Minds and Machines (CBMM), 2018-03-27)
      It is very attractive to formulate vision in terms of pattern theory [26], where patterns are defined hierarchically by compositions of elementary building blocks. But applying pattern theory to real world images is very ...
    • What am I searching for? 

      Zhang, Mengmi; Feng, Jiashi; Lim, Joo Hwee; Zhao, Qi; Kreiman, Gabriel (Center for Brains, Minds and Machines (CBMM), arXiv.org, 2018-07-31)
      Can we infer intentions and goals from a person's actions? As an example of this family of problems, we consider here whether it is possible to decipher what a person is searching for by decoding their eye movement behavior. ...
    • When Computer Vision Gazes at Cognition 

      Gao, Tao; Harari, Daniel; Tenenbaum, Joshua; Ullman, Shimon (Center for Brains, Minds and Machines (CBMM), arXiv, 2014-12-12)
      Joint attention is a core, early-developing form of social interaction. It is based on our ability to discriminate the third party objects that other people are looking at. While it has been shown that people can accurately ...