Ohio State nav bar

Seminar: Multimodal Embodied Agents: Situated Vision, Language, and Action Learning

Headshot: Dr. Xin Wang, UC Santa Cruz
February 5, 2024
10:00AM - 11:00AM
301 Pomerene Hall

Date Range
Add to Calendar 2024-02-05 10:00:00 2024-02-05 11:00:00 Seminar: Multimodal Embodied Agents: Situated Vision, Language, and Action Learning TDAI's Foundations of Data Science & AI community of practice is pleased to present Dr. Xin (Eric) Wang (Computer Science & Engineering, University of California, Santa Cruz) on Monday, Feb. 5, at 10:00 a.m. in 301 Pomerene Hall, who will give a talk entitled "Multimodal Embodied Agents: Situated Vision, Language, and Action Learning." Register for in-person     Register for ZoomAbstract: A key ambition in AI research is to develop multimodal embodied agents that can interact with humans, understand their environment, and perform tasks in the real world. These agents have the potential to transform a range of applications, from household chores to critical missions.In this talk, I will present our latest breakthroughs in this domain, addressing key challenges of situated decision-making. We begin by exploring the extension of large language models (LLMs) for commonsense reasoning in the fundamental object navigation task, by calibrating the probabilistic outputs of LLMs in a “soft” manner.Then I will discuss the significance of vision and language grounding in embodied agents, and introduce our pioneering method that employs a causal lens to improve this grounding capability, which has shown remarkable impacts in various downstream tasks. In addition, I will briefly overview our comprehensive efforts in advancing and evaluating these agents across different scenarios, and conclude with a discussion of future research plans.Bio:Xin (Eric) Wang is an Assistant Professor of Computer Science and Engineering at UC Santa Cruz. His research interests include Natural Language Processing, Computer Vision, and Machine Learning, with an emphasis on Multimodal and Embodied AI. He worked at Google AI, Facebook AI Research, Microsoft Research, and Adobe Research.Xin has served as Area Chair for conferences such as ACL, NAACL, EMNLP, ICLR, and NeurIPS, as well as a Senior Program Committee for AAAI and IJCAI. He organized workshops and tutorials at conferences such as ACL, NAACL, CVPR, and ICCV. He has received several awards and recognitions for his work, including CVPR Best Student Paper Award, Google Research Faculty Award, Amazon Alexa Prize Awards, and various gift awards from Adobe, Snap, eBay, etc. 301 Pomerene Hall Translational Data Analytics Institute tdai@osu.edu America/New_York public

TDAI's Foundations of Data Science & AI community of practice is pleased to present Dr. Xin (Eric) Wang (Computer Science & Engineering, University of California, Santa Cruz) on Monday, Feb. 5, at 10:00 a.m. in 301 Pomerene Hall, who will give a talk entitled "Multimodal Embodied Agents: Situated Vision, Language, and Action Learning." 

Register for in-person     Register for Zoom

Abstract: 

A key ambition in AI research is to develop multimodal embodied agents that can interact with humans, understand their environment, and perform tasks in the real world. These agents have the potential to transform a range of applications, from household chores to critical missions.

In this talk, I will present our latest breakthroughs in this domain, addressing key challenges of situated decision-making. We begin by exploring the extension of large language models (LLMs) for commonsense reasoning in the fundamental object navigation task, by calibrating the probabilistic outputs of LLMs in a “soft” manner.

Then I will discuss the significance of vision and language grounding in embodied agents, and introduce our pioneering method that employs a causal lens to improve this grounding capability, which has shown remarkable impacts in various downstream tasks. In addition, I will briefly overview our comprehensive efforts in advancing and evaluating these agents across different scenarios, and conclude with a discussion of future research plans.

Bio:

Xin (Eric) Wang is an Assistant Professor of Computer Science and Engineering at UC Santa Cruz. His research interests include Natural Language Processing, Computer Vision, and Machine Learning, with an emphasis on Multimodal and Embodied AI. He worked at Google AI, Facebook AI Research, Microsoft Research, and Adobe Research.

Xin has served as Area Chair for conferences such as ACL, NAACL, EMNLP, ICLR, and NeurIPS, as well as a Senior Program Committee for AAAI and IJCAI. He organized workshops and tutorials at conferences such as ACL, NAACL, CVPR, and ICCV. He has received several awards and recognitions for his work, including CVPR Best Student Paper Award, Google Research Faculty Award, Amazon Alexa Prize Awards, and various gift awards from Adobe, Snap, eBay, etc.

Events Filters: