Robot learning is currently being reshaped by foundation models: large-scale pretraining, generalist policies, and the integration of vision-language models with robot control. This seminar provides a structured introduction to modern Vision-Language-Action (VLA) systems and generalist robot policies, and focuses on the practical design choices that determine whether such models work reliably on real robots. We will start from the basics of modern imitation learning (behavioral cloning) and cover key policy families for robot manipulation, including action chunking transformers and diffusion/flow-based policies. Building on this, we will discuss what matters in learning from demonstrations, with an emphasis on data quality, action representations and tokenizers, and the trade-offs between autoregressive and generative (diffusion/flow) action generation. We will also introduce recent ideas for robust adaptation, such as modular design and knowledge insulation, and give a light overview of how experience-based fine-tuning can further improve performance.
| Frequency | Weekday | Time | Format / Place | Period |
|---|
| Module | Course | Requirements | |
|---|---|---|---|
| 39-M-Inf-AI-app-foc_a Applied Artificial Intelligence (focus) Applied Artificial Intelligence (focus) | Applied Artificial Intelligence (focus): Seminar | Student information | |
| Applied Artificial Intelligence (focus): application-oriented seminar 1 | Study requirement
|
Student information | |
| Applied Artificial Intelligence (focus): application-oriented seminar 2 | Student information | ||
| - | Graded examination | Student information | |
| 39-M-Inf-ASE-app-foc_a Applied Autonomous Systems Engineering (focus) Applied Autonomous Systems Engineering (focus) | Applied Autonomous Systems Engineering (focus): Seminar | Student information | |
| Applied Autonomous Systems Engineering (focus): Application-oriented seminar 1 | Student information | ||
| Applied Autonomous Systems Engineering (focus): Application-oriented seminar 2 | Student information | ||
| - | Graded examination | Student information | |
| 39-M-Inf-INT-app-foc_a Applied Interaction Technology (focus) Applied Interaction Technology (focus) | Applied Interaction Technology (focus) - Seminar | Student information | |
| Applied Interaction Technology (focus): Application-oriented seminar 1 | Study requirement
|
Student information | |
| Applied Interaction Technology (focus): Application-oriented seminar 2 | Student information | ||
| - | Graded examination | Student information |
The binding module descriptions contain further information, including specifications on the "types of assignments" students need to complete. In cases where a module description mentions more than one kind of assignment, the respective member of the teaching staff will decide which task(s) they assign the students.
The seminar will start with two introductory lectures. Each week, one student will present a research paper from the reading list and lead the discussion. At the end of the semester, each student has the option to submit an essay surveying related literature to obtain extra credit.