392231 Project: Helping Robots Understand Complex Tasks by Discovering Sub-Tasks with a 3D-printed Arm (Pj) (WiSe 2025/2026)

Contents, comment

Complex robotic tasks like assembling objects or stacking blocks often involve multiple, smaller steps or phases. Humans naturally split these complex actions into clear sub-steps ("pick up block", "place block on stack"), but explicitly labeling and teaching robots each step individually is tedious and costly. This project explores unsupervised methods, based on pretrained visual representations, to automatically identify sub-task boundaries from unlabeled video demonstrations. You will collect simulated demonstrations of a three-block stacking task, identify potential phase boundaries based on visual embedding similarities, and integrate these discovered phases into a reset-free reinforcement learning framework. The goal is to significantly reduce external resets during training and speed up policy convergence.

This project is particularly valuable for students interested in task decomposition, representation learning, and hierarchical RL. It provides practical experience in a highly relevant research area and a solid foundation for future thesis or project work. When applicable, your results can be published as a short benchmark note or as an appendix to an existing paper.

For more details or to apply, feel free to contact me directly via email or in-person.

Requirements for participation, required level

You should be familiar with a python-based simulation framework and visual embeddings (e.g., CLIP and ResNets). Experience with PyTorch and basic RL knowledge will be beneficial. The project provides you with a prebuilt 3D printed WidowX arm, ready-to-use MuJoCo simulation environments, baseline RL implementations, and all necessary computational resources (though bringing your own GPU is a plus).

Teaching staff

Dates ( Calendar view )

Frequency Weekday Time Format / Place Period  
by appointment n.V.   13.10.2025-06.02.2026

Subject assignments

Module Course Requirements  
39-M-Inf-P Projekt Projekt Ungraded examination
Student information

The binding module descriptions contain further information, including specifications on the "types of assignments" students need to complete. In cases where a module description mentions more than one kind of assignment, the respective member of the teaching staff will decide which task(s) they assign the students.


Automatically discover sub-task boundaries within complex manipulation tasks to enable efficient long-horizon Reinforcement Learning.

No eLearning offering available
Address:
WS2025_392231@ekvv.uni-bielefeld.de
This address can be used by teaching staff, their secretary's offices as well as the individuals in charge of course data maintenance to send emails to the course participants. IMPORTANT: All sent emails must be activated. Wait for the activation email and follow the instructions given there.
If the reference number is used for several courses in the course of the semester, use the following alternative address to reach the participants of exactly this: VST_568327276@ekvv.uni-bielefeld.de
Notes:
Additional notes on the electronic mailing lists
Last update basic details/teaching staff:
Tuesday, June 24, 2025 
Last update times:
Sunday, June 15, 2025 
Last update rooms:
Sunday, June 15, 2025 
Type(s) / SWS (hours per week per semester)
project (Pj) / 2
Language
This lecture is taught in english
Department
Faculty of Technology
Questions or corrections?
Questions or correction requests for this course?
Planning support
Clashing dates for this course
Links to this course
If you want to set links to this course page, please use one of the following links. Do not use the link shown in your browser!
The following link includes the course ID and is always unique:
https://ekvv.uni-bielefeld.de/kvv_publ/publ/vd?id=568327276
Send page to mobile
Click to open QR code
Scan QR code: Enlarge QR code
ID
568327276