392265 Project: SignSpeak: Generating Sign Language Videos from Speech using Deep Generative Models (Pj) (SoSe 2026)

Contents, comment

Current AI systems can accurately transcribe speech and generate text-based translations. However, generating sign language videos directly from speech remains a challenging problem, as sign languages are complex visual languages that rely on hand gestures, facial expressions, and body movements.
This project explores how multimodal representations extracted from speech can be used to generate sign-language video sequences with deep generative models.
The goal is to design and implement a prototype pipeline that maps speech features to temporally coherent sign language representations and generates corresponding video output. The system will be trained and evaluated using multimodal datasets containing aligned speech, text, and sign language videos.
Depending on the number of students and the project scope, the project can also include:
• Evaluation of the intelligibility and realism of generated sign language videos
• Analysis of temporal alignment between speech and generated signs
• Comparison of direct speech-based vs. speech-to-text-to-sign pipelines

Requirements for participation, required level

• Good programming skills in Python (ideally PyTorch)
• Basic knowledge of machine learning / deep learning
• Interest in generative AI and multimodal systems
• (Preferably) experience or strong interest in speech processing or video processing (e.g., speech recognition, audio features, or temporal data such as video or pose sequences)
Upon completion of this project, we will work hand in hand to publish the results in a well-established conference or journal in Human–Computer Interaction (HCI) or Computer Vision (CV)

Teaching staff

Dates ( Calendar view )

Frequency Weekday Time Format / Place Period  
by appointment n.V.   13.04.-24.07.2026 Nach Vereinbarung, online, CITEC oder R.1

Subject assignments

Module Course Requirements  
39-M-Inf-AI-app-foc_a Applied Artificial Intelligence (focus) Applied Artificial Intelligence (focus) Applied Artificial Intelligence (focus): Project Study requirement
Student information
39-M-Inf-INT-app-foc_a Applied Interaction Technology (focus) Applied Interaction Technology (focus) Applied Interaction Technology (focus): Project Study requirement
Student information

The binding module descriptions contain further information, including specifications on the "types of assignments" students need to complete. In cases where a module description mentions more than one kind of assignment, the respective member of the teaching staff will decide which task(s) they assign the students.


No more requirements
No eLearning offering available
Address:
SS2026_392265@ekvv.uni-bielefeld.de
This address can be used by teaching staff, their secretary's offices as well as the individuals in charge of course data maintenance to send emails to the course participants. IMPORTANT: All sent emails must be activated. Wait for the activation email and follow the instructions given there.
If the reference number is used for several courses in the course of the semester, use the following alternative address to reach the participants of exactly this: VST_720235317@ekvv.uni-bielefeld.de
Notes:
Additional notes on the electronic mailing lists
Last update basic details/teaching staff:
Monday, April 27, 2026 
Last update times:
Saturday, April 25, 2026 
Last update rooms:
Saturday, April 25, 2026 
Type(s) / SWS (hours per week per semester)
project (Pj) / 2
Department
Faculty of Technology
Questions or corrections?
Questions or correction requests for this course?
Planning support
Clashing dates for this course
Links to this course
If you want to set links to this course page, please use one of the following links. Do not use the link shown in your browser!
The following link includes the course ID and is always unique:
https://ekvv.uni-bielefeld.de/kvv_publ/publ/vd?id=720235317
Send page to mobile
Click to open QR code
Scan QR code: Enlarge QR code
ID
720235317