392284 ISY Project: Unsupervised pretraining & transfer learning in pixel based deep reinforcement learning (Pj) (SoSe 2018)

Contents, comment

Pixel-based end-to-end reinforcement learning has recently attracted a lot of attention, particularly
as a result of the success in gaming (published by a Google team for Atari games in Nature) and
subsequent efforts follow in these footsteps. A drawback of such end-to-end learning for image-based
signals is that the internal representation of data is learned afresh for every new game. The goal of
the project is to investigate whether it is possible to learn a more universal representation by means of
deep learning (such as deep autoencoders) which enables to leverage end-to-end learning for
specific games by its particularly efficient initialization. Such transfer could leverage the efficiency
of reinforcement learning for one specific game but possibly also along different games where different
visual features ight be of importance. Within the project, the task is, based on a basic end-to-end RL
chain and standard autoencoders, to elucidate suitable scenarios in which such transfer of the learned
representation can be tested.

- In case this would not find enough interest for a team project, this project proposal would be also offered (in reduced/modified form)

The project team for this project has already been selected, no additional applications are considered.

Requirements for participation, required level

- Required skils (e.g. mandatory courses, if required)
Advanced Machine Learning skills and knowledge about deep learning technology (eg tensor flow) are required.

Teaching staff

Subject assignments

Module Course Requirements  
39-M-Inf-GP Grundlagenprojekt Intelligente Systeme Gruppenprojekt Ungraded examination
Student information

The binding module descriptions contain further information, including specifications on the "types of assignments" students need to complete. In cases where a module description mentions more than one kind of assignment, the respective member of the teaching staff will decide which task(s) they assign the students.


No more requirements
No eLearning offering available
Registered number: 4
This is the number of students having stored the course in their timetable. In brackets, you see the number of users registered via guest accounts.
Address:
SS2018_392284@ekvv.uni-bielefeld.de
This address can be used by teaching staff, their secretary's offices as well as the individuals in charge of course data maintenance to send emails to the course participants. IMPORTANT: All sent emails must be activated. Wait for the activation email and follow the instructions given there.
If the reference number is used for several courses in the course of the semester, use the following alternative address to reach the participants of exactly this: VST_124014825@ekvv.uni-bielefeld.de
Coverage:
2 Students to be reached directly via email
Notes:
Additional notes on the electronic mailing lists
Last update basic details/teaching staff:
Monday, February 12, 2018 
Last update times:
?
Last update rooms:
?
Type(s) / SWS (hours per week per semester)
project (Pj) / 4
Department
Faculty of Technology
Questions or corrections?
Questions or correction requests for this course?
Planning support
Clashing dates for this course
Links to this course
If you want to set links to this course page, please use one of the following links. Do not use the link shown in your browser!
The following link includes the course ID and is always unique:
https://ekvv.uni-bielefeld.de/kvv_publ/publ/vd?id=124014825
Send page to mobile
Click to open QR code
Scan QR code: Enlarge QR code
ID
124014825