Now showing items 22-24 of 156

    • Compositional Networks Enable Systematic Generalization for Grounded Language Understanding 

      Kuo, Yen-Ling; Katz, Boris; Barbu, Andrei (Center for Brains, Minds and Machines (CBMM), Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021-11-07)
      Humans are remarkably flexible when under- standing new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language ...
    • Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset 

      Palmer, Ian; Rouditchenko, Andrew; Barbu, Andrei; Katz, Boris; Glass, James (Center for Brains, Minds and Machines (CBMM), The 22nd Annual Conference of the International Speech Communication Association (Interspeech), 2021-08-30)
      Visually-grounded spoken language datasets can enable models to learn cross-modal correspon- dences with very weak supervision. However, modern audio-visual datasets contain biases that un- dermine the real-world performance ...
    • Compositional RL Agents That Follow Language Commands in Temporal Logic 

      Kuo, Yen-Ling; Barbu, Andrei; Katz, Boris (Center for Brains, Minds and Machines (CBMM), Frontiers in Robotics and AI, 2021-07-19)
      We demonstrate how a reinforcement learning agent can use compositional recurrent neural net- works to learn to carry out commands specified in linear temporal logic (LTL). Our approach takes as input an LTL formula, ...