Grounding Language into the External Visual World
Everyday interactions require a common understanding of language, i.e. for people to communicate effectively, words (for example ‘cat’) should invoke similar beliefs over physical concepts (what cats look like, the sounds they make, how they behave, what their skin feels like etc.). However, how this ‘common understanding’ emerges is still unclear. One appealing hypothesis is that language is tied to how we interact with the environment. As a result, meaning emerges by ‘grounding’ language in modalities in our environment (images, sounds, actions, etc.).
This course will review recent works in machine learning which bridges visual and natural language understanding through visually-grounded language learning tasks, e.g. through natural images (Visual Question Answering, Visual Dialog), or through interactions with virtual physical environments.
As the grounding problem requires an interdisciplinary attitude, this course aims to gather students with broad expertise in various fields -- machine learning, computer vision, natural language, neuroscience, and psychology -- and who are excited about this space of grounding and interactions.
Ort: nicht angegeben
Zeiten: Do. 09:00 - 12:00 (wöchentlich)
Erster Termin: Donnerstag, 07.04.2022 09:00 - 12:00
Veranstaltungsart: Seminar (Offizielle Lehrveranstaltungen)
- Interdisziplinäre Studiengänge > Sprache in Europa > Sprache und Kognition
- Cognitive Science > Bachelor-Programm
- Cognitive Science > Master-Programm