CVPR2024 Tutorial on
Generalist Agent AI

Time and Venue

Date:
June 18, 2024

Time:
13:30pm - 18:00pm

Room:
TBD

Venue:
Seattle Convention Center, Seattle, USA

Zoom:
Zoom Meeting Link


Overview

Generalist Agent AI (GAA) is a family of systems that generate effective actions in a given environment based on the understanding of multimodal sensory input. With the advent of large foundation models, numerous GAA systems have been proposed in fields ranging from basic research to applications. While these research areas are growing rapidly by integrating with the traditional technologies of each domain, they share common interests such as data collection, benchmarking, and ethical perspectives. In this tutorial, we focus on the some representative research areas of Embodied GAA, namely embodied-multimodality, robotics, gaming (VR/AR/MR), and healthcare, etc., and we aim to provide comprehensive knowledge on the common concerns discussed in these fields. As a result we expect the participants to learn the fundamentals of GAA and gain insights to further advance their research. Specific learning outcomes include:

  • GAA Overview: A deep dive into its principles and roles in contemporary applications, providing attendees with a thorough grasp of its importance and uses.
  • Methodologies: Detailed examples of how large foundation model enhance GAAs, illustrated through case studies in embodied virtual and real world, e.g., robotics, gaming, and healthcare.
  • Performance Evaluation: Guidance on the assessment of GAAs with relevant datasets, focusing on their effectiveness and generalization.
  • Ethical Considerations: A discussion on the societal impacts and ethical challenges of deploying Agent AI, highlighting responsible development practices.
  • Emerging Trends and Future Challenges: Categorize the latest developments in each domain and discuss the future directions.

Led by esteemed experts from academia and industry, we expect that the tutorial will be an interactive and enriching experience, complete with lectures, case studies, Q&A sessions, and panel discussion ensuring a comprehensive and engaging learning experience for all participants.



Tutorial Materials

Download links will be available



Timetable Schedule

Time Slot Talk Scheduling Areas
13:30 - 13:40 Qiuyuan Huang Opening Remarks
13:40 - 14:20 Talk1: Juan Carlos Niebles LLM tool-based agents
14:20 - 15:00 Talk2: Katsushi Ikeuchi Agent Robotics
15:00 - 16:00 Coffee Break
16:00 - 16:40 Talk3: TDB TBD
16:40 - 17:50

Juan Carlos Niebles, Katsushi Ikeuchi, Hoi Vo, Jianfeng Gao

Panels and Q&A
17:50 - 18:00 Naoki Wake Ending Remarks