Context and Compositionality in Biological and Artificial Neural Systems
NeurIPS 2019 Workshop, Vancouver, Canada
December 14th, 2019
Google Group
https://groups.google.com/d/forum/context-comp
Videos
- Gary Marcus - Deep Understanding: The Next Challenge for AI (starts at 9:55)
- Gina Kuperberg - How probabilistic is language prediction in the brain? Insights from multimodal neuroimaging studies (starts at 54:36)
- Spotlight talks, incl. Paul Soulos - Discovering the Compositional Structure of Vector Representations with Role Learning Networks (starts at 0:00), Robert Kim - Recurrent Neural Networks as a Model to Probe Neuronal Timescales Specific to Working Memory (starts at 11:06), & Maxwell Nye - Learning Compositional Rules via Neural Program Synthesis (starts at 20:32)
- Tom Mitchell - Understanding Neural Processes: Beyond Where, and When, to How (starts at 30:14)
- Yoshua Bengio - Towards compositional understanding of the world by agent-based deep learning
- Ev Fedorenko - Composition as the Core Driver of the Human Language System
- Panel discussion incl. Paul Smolensky, Ev Fedorenko, Jacob Andreas, Kenton Lee, Gary Marcus, Yoshua Bengio, & Gina Kuperberg (chaired by Ted Willke)
About
The ability to integrate semantic information across narratives is fundamental to language understanding in both biological and artificial cognitive systems. In recent years, enormous strides have been made in NLP and Machine Learning to develop architectures and techniques that effectively capture these effects. The field has moved away from traditional bag-of-words approaches that ignore temporal ordering, and instead embraced RNNs [1][2][3][4], Temporal CNNs [5] and Transformers [6], which incorporate contextual information at varying timescales. While these architectures have lead to state-of-the-art performance on many difficult language understanding tasks [7],[8], it is unclear what representations these networks learn and how exactly they incorporate context. Interpreting these networks, systematically analyzing the advantages and disadvantages of different elements, such as gating or attention, and reflecting on the capacity of the networks across various timescales are open and important questions.
On the biological side, recent work in neuroscience suggests that areas in the brain are organized into a temporal hierarchy in which different areas are not only sensitive to specific semantic information [9] but also to the composition of information at different timescales [10][11]. Computational neuroscience has moved in the direction of leveraging deep learning to gain insights about the brain [12][13]. By answering questions on the underlying mechanisms and representational interpretability of these artificial networks, we can also expand our understanding of temporal hierarchies, memory, and capacity effects in the brain.
In this workshop we aim to bring together researchers from machine learning, NLP, and neuroscience to explore and discuss how computational models should effectively capture the multi-timescale, context-dependent effects that seem essential for processes such as language understanding. We believe that this will lead to both a deeper understanding of biological language systems, as well as improved artificial systems that leverage these insights to better understand language.
Important Dates
Name | Date |
---|---|
Paper Submission Deadline | |
Final Decisions | |
Camera Ready | |
Workshop Date |
Schedule
Date: Saturday December 14th, 2019
Room: West 217 - 219
Time | Event |
---|---|
08:00 AM | Opening Remarks |
08:15 AM | Gary Marcus - Deep Understanding: The Next Challenge for AI [Slides] |
09:00 AM | Gina Kuperberg - How Probabilistic is Language Comprehension in the Brain? Insights from Multimodal Neuroimaging Studies [Slides] |
09:45 AM | Poster Session + Break |
10:30 AM | Paul Soulos - Uncovering the Compositional Structure of Vector Representations with Role Learning Networks [Slides] |
10:40 AM | Robert Kim - Spiking Recurrent Networks as a Model to Probe Neuronal Timescales Specific to Working Memory [Slides] |
10:50 AM | Maxwell Nye - Learning Compositional Rules via Neural Program Synthesis [Slides] |
11:00 AM | Tom Mitchell - Understanding Neural Processes: Getting Beyond Where and When, to How [Slides] |
12:00 PM | Poster Session + Lunch |
02:00 PM | Yoshua Bengio - Towards Compositional Understanding of the World by Agent-Based Deep Learning [Slides] |
03:00 PM | Ev Fedorenko - Composition as the Core Driver of the Human Language System |
03:30 PM | Break |
04:00 PM | Panel Discussion: Ev Fedorenko, Kenton Lee, Paul Smolensky [Slides], Jacob Andreas [Ask a Question!] |
05:30 PM | Closing remarks |
Invited Speakers
Organizers
Accepted Papers
- Self-Organization of Action Hierarchy and Compositionality by Reinforcement Learning with Recurrent Networks. Dongqi Han, Kenji Doya, Jun Tani
- Multi-Context Term Embeddings: the Use Case of Corpus-based Term Set Expansion. Jonathan Mamou, Oren Pereg, Moshe Wasserblat, Ido Dagan
- Learning Compositional Rules via Neural Program Synthesis. Maxwell Nye, Armando Solar-Lezama, Joshua Tenenbaum, Brenden Lake
- Spiking Recurrent Networks as a Model to Probe Neuronal Timescales Specific to Working Memory. Robert Kim, Terry Sejnowski
- Localizing Occluders with Compositional Convolutional Networks. Adam Kortylewski, Qing Liu, Huiyu Wang, Zhishuai Zhang, Alan Yuille
- Why Attention? Analyzing and Remedying BiLSTM Deficiency in Modeling Cross-Context for NER. Peng-Hsuan Li, Tsu-Jui Fu, Wei-yun Ma
- A crossover code for high-dimensional composition. Rich Pang
- Radically Compositional Cognitive Concepts. Toby St Clere Smithe
- A memory enhanced LSTM for modeling complex temporal dependencies. Sneha Aenugu
- Towards Generation of Visual Attention Map for Source Code. Takeshi D Itoh, Takatomi Kubo, Kiyoka Ikeda, Yuki Maruno, Yoshiharu Ikutani, Hideaki Hata, Kenichi Matsumoto, Kazushi Ikeda
- Long-Distance Dependencies don’t have to be Long: Simplifying through Provably (Approximately) Optimal Permutations. Rishi Bommasani
- A representational asymmetry for composition in the human left-middle temporal gyrus. Steven Frankland
- Uncovering the compositional structure of vector representations with Role Learning Networks. Paul Soulos, R. Thomas Mccoy, Tal Linzen, Paul Smolensky
- Modelling the N400 brain potential as change in a probabilistic representation of meaning. Milena Rabovsky, Steven Hansen, James McClelland
- Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain). Mariya Toneva, Leila Wehbe
- Enhancing the Transformer with Explicit Relational Encoding for Math Problem Solving. Imanol Schlag, Paul Smolensky, Roland Fernandez, Nebojsa Jojic, Jianfeng Gao
- Inducing brain-relevant bias in natural language processing models. Dan Schwartz , Mariya Toneva, Leila Wehbe
- Sparse associative memory based on contextual code learning for disambiguating word senses. Max Raphael Sobroza Marques, Tales Marra, Deok-Hee Dufor, Claude Berrou
- Structured Sparsification of Gated Recurrent Neural Networks. Ekaterina Lobacheva, Nadezhda Chirkova, Aleksandr Markovich, Dmitry Vetrov
- Compositionality as Directional Consistency in Sequential Neural Networks. Najoung Kim, Tal Linzen
- Modelling Working Memory using Deep Recurrent Reinforcement Learning. Pravish Sainath, Pierre Bellec, Guillaume Lajoie
- Aging Memories Generate More Fluent Dialogue Responses with Memory Augmented Neural Networks. Omar U Florez, Erik Muller
- On Compositionality in Neural Machine Translation. Vikas Raunak, Vaibhav Kumar, Florian Metze, Jamie Callan
- Learning to Control Latent Representations on an External Memory for Few-Shot Learning. Omar U Florez, Erik Muller
- Decoding Affirmative and Negated Action-Related Sentences in the Brain with Distributional Semantic Models. Vesna Djokic
Call for Papers
Submit at: https://cmt3.research.microsoft.com/CNTXTCOMP2019/
We will consider the following (non-exhaustive) list of topics for contribution:
- Contextual sequence processing in the human brain
- Compositional representations
- Compositional representations
- Systematic generalization in deep learning
- Compositionality in human intelligence
- Compositionality in natural language
- Understanding composition and temporal processing in neural network models
- New approaches to compositionality and temporal processing in language
- Hierarchical representations of temporal information
- Datasets for contextual sequence processing
- Applications of compositional neural networks to real-world problems
Formatting Instructions: All submissions must be in PDF format. Submissions are limited to four content pages, including all figures and tables; additional pages containing only references are allowed. You must format your submission using the NeurIPS 2019 LaTeX style file. Submissions that violate the NeurIPS style (e.g., by decreasing margins or font sizes) or page limits may be rejected without further review. All submissions should be anonymous.
Accepted papers will be presented during a poster session, with spotlight oral presentations for exceptional submissions. The accepted papers will be made publicly available as non-archival reports, allowing future submissions to archival conferences or journals.
The review process is double-blind. We also welcome published papers that are within the scope of the workshop (without re-formatting). Already-published papers do not have to be anonymous. They are eligible for poster sessions and will only have a very light review process.
Poster Instructions
Posters should be no larger than 36W x 48H inches or 90 x 122 cm (portrait). Also, posters should be on light weight paper, not laminated. They will be taped to the wall with the special tabs that we will supply.
Please redirect questions and all future correspondence to [email protected].
Program Committee
- Abhijit Mahabal, Pinterest
- Cassandra Jacobs, University of Wisconsin
- Chris Baldassano, Columbia University
- Cory Shain, The Ohio State University
- Emily Mugler, Facebook
- Evelina Fedorenko, Massachusetts Institute of Technology
- Guangyu Robert Yang, Columbia University
- Jascha Sohl-Dickstein, Google Brain
- Jun Tani, Okinawa Institute of Science and Technology Graduate University
- Katrin Erk, University of Texas
- Kenji Doya, Okinawa Institute of Science and Technology
- Liberty Hamilton, The University of Texas at Austin
- Samuel Bowman, NYU and Google Research
- Shimon Edelman, Cornell University
- Srini Narayanan, Google AI Language
- Vy Vo, Intel Labs
Sponsors
References
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Comput., 9(8):1735–1780, Nov. 1997.
- J. Chung, C. Gulcehre, K. Cho, and Y. Bengio. Gated feedback recurrent neural networks. Proceedings of the 32nd International Conference on Machine Learning, volume 37 of Proceedings of Machine Learning Research, pages 2067–2075, Lille, France, 07–09 Jul 2015.
- S. Chandar, C. Sankar, E. Vorontsov, S. E. Kahou, and Y. Bengio. Towards non-saturatingrecurrent units for modelling long-term dependencies.arXiv preprint arXiv:1902.06704, 2019.
- J. Chung, S. Ahn, and Y. Bengio. Hierarchical multiscale recurrent neural networks. In ICLR, 2017.
- S. Bai, J. Z. Kolter, and V. Koltun. An empirical evaluation of generic convolutional andrecurrent networks for sequence modeling.CoRR, abs/1803.01271, 2018.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. u. Kaiser, andI. Polosukhin. Attention is all you need. In I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach,R. Fergus, S. Vishwanathan, and R. Garnett, editors,Advances in Neural Information ProcessingSystems 30, pages 5998–6008. Curran Associates, Inc., 2017.
- M. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark, K. Lee, and L. Zettlemoyer. Deepcontextualized word representations. InProceedings of the 2018 Conference of the North Amer-ican Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 2227–2237, New Orleans, Louisiana, June 2018. Association for Computational Linguistics.
- J. Devlin, M. Chang, K. Lee, and K. Toutanova. BERT: pre-training of deep bidirectionaltransformers for language understanding.CoRR, abs/1810.04805, 2018.
- A. G. Huth, W. A. de Heer, T. L. Griffiths, F. E. Theunissen, and J. L. Gallant. Natural speech reveals the semantic maps that tile human cerebral cortex. Nature, 532(7600):453–458, 2016.
- C. Baldassano, J. Chen, A. Zadbood, J. W. Pillow, U. Hasson, and K. A. Norman. Discovering event structure in continuous narrative perception and memory. Neuron, 95(3):709 – 721.e5,2017.
- K. D. Himberger, H.-Y. Chien, and C. J. Honey. Principles of temporal processing across thecortical hierarchy. Neuroscience, 389:161 – 174, 2018. Sensory Sequence Processing in the Brain.
- L. Wehbe, A. Vaswani, K. Knight, and T. Mitchell. Aligning context-based statistical models of language with brain activity during reading. In EMNLP, 2014.
- S. Jain and A. Huth. Incorporating context into language encoding models for fmri. In Advances in Neural Information Processing Systems 31, pages 6628–6637, 2018.
- Y. Lerner, C. J. Honey, L. J. Silbert, and U. Hasson. Topographic Mapping of a Hierarchy of Temporal Receptive Windows Using a Narrated Story. Journal of Neuroscience 23 February 2011, 31 (8) 2906-2915.