Start my eKVV Studieninformation Lernräume Prüfungsverwaltung Anmelden

392115 Deep Reinforcement Learning (S) (WiSe 2017/2018)

Inhalt, Kommentar

Deep Reinforcement Learning
Seminar im WS 2015/16
Dr. Malte Schilling mschilli@techfak.uni-bielefeld.de Office 1.414 (CITEC Building)

Summary
Many seemingly simple tasks are hard to describe in a formal way. Nonetheless, humans are quite good to solve such problems and manage such tasks. Crucial is the ability to learn which is a characteristic and prerequisite for intelligent behavior. Machine Learning in general aims at learning how to solve a task through training instead of relying on a formal description.
Neural Networks provide a learning approach that addresses the problem of (very) high dimensional input data and how to project it into lower dimensional spaces. A neuronal network can therefore be understood as approximating a function in high dimensional spaces. Modern deep learning provides a very powerful framework for supervised learning. By adding more layers and more units within a layer, a deep network can represent functions of increasing complexity. Most tasks that consist of mapping an input vector to an output vector, and that are easy for a person to do rapidly, can be accom-plished via deep learning, given sufficiently large models and sufficiently large datasets of labeled training examples. Oth-er tasks, that can not be described as associating one vector to another, or that are difficult enough that a person would require time to think and reflect in order to accomplish the task, remain beyond the scope of deep learning for now.
Reinforcement learning (RL) is usually about sequential decision making, solving problems in a wide range of fields in science, engineering and arts (Sutton and Barto, 2017). With recent exciting achievements of deep learning (LeCun et al., 2015; Goodfellow et al., 2016), benefiting from big data, powerful computation and new algorithmic techniques, we have been witnessing the renaissance of reinforcement learning (Krakovsky, 2016), especially, the combination of reinforce-ment learning and deep neural networks, i.e., deep reinforcement learning (deep RL).
The seminar focus is on current approaches to deep reinforcement learning as has been explored widely in the last cou-ple of years in the area of learning decisions in computer games. Further, current work extends these approaches to more real world problems as grasping in robotics.
The seminar will give an introduction into the theoretical background of Neural Networks, Deep Learning and Reinforce-ment Learning. It will afterwards deal with state of the art methods and research literature presenting those methods. The seminar aims at comparing different approaches and providing an overview on current evolving principles and questions.

Lehrende

Termine (Kalendersicht )

Rhythmus Tag Uhrzeit Ort Zeitraum  

Zeige vergangene Termine >>

Klausuren

  • Keine gefunden

Fachzuordnungen

Modul Veranstaltung Leistungen  
39-Inf-EGMI Ergänzungsmodul Informatik vertiefendes Seminar 1 unbenotete Prüfungsleistung
Studieninformation
vertiefendes Seminar 2 unbenotete Prüfungsleistung
Studieninformation
vertiefendes Seminar 3 unbenotete Prüfungsleistung
Studieninformation
vertiefendes Seminar 4 unbenotete Prüfungsleistung
Studieninformation

Die verbindlichen Modulbeschreibungen enthalten weitere Informationen, auch zu den "Leistungen" und ihren Anforderungen. Sind mehrere "Leistungsformen" möglich, entscheiden die jeweiligen Lehrenden darüber.

Konkretisierung der Anforderungen

Aktive Teilnahme
• regelmäßige Teilnahme an dem Seminar,
• aktive Beteiligung an Diskussionen,
• Vorbereitung der Sitzungen und lesen der Literatur (oder der für den jeweiligen Termin vorgeschlagenen Auszüge aus einem wissenschaftlichen Artikel).
• Zusammenfassung eines Artikels im Seminar.
• Vorstellen eines Artikels in einer Präsentation. Inklusive Vorbereitung einer Diskussion, d.h. Formulierung von Fra-gen als Diskussionsgrundlage.

Lernraum
TeilnehmerInnen
Automatischer E-Mailverteiler der Veranstaltung
Änderungen/Aktualität der Veranstaltungsdaten
Sonstiges