Courses

Winter Term 2016/2017

Previous Lectures

For a complete list of previous lectures and student projects please click here.

Finished Thesis Projects

Download links for PDFs are only available to users who are logged in. They can also be sent to visitors via email on request; please contact one of the supervisors if you are interested.

Spotlights of Recent Theses

Daniel Stratz: Khepera III Navigation mit internen Sensoren

In dieser Arbeit soll eine Implementierung vorgestellt werden, die den Khepera möglichst genau entlang der Wand einer Testumgebung fahren lässt und hierbei die Ecken der Testumgebung erkennt und die Position dieser bestimmt.

Bachelor Thesis
Supervisors: Martin Riedmiller, Manuel Blum
Jan Wülfing: Feature Learning for Music Classification

In the area of music information retrieval, automatic categorization of music pieces into categories such as mood, artist or genre is a widely studied topic. Those categorization taks basically consist of two steps, feature selection/extraction and classification. In the past, a wide range of features have been proposed, most of them hand designed. More recently, there has been a growing interest in methods that learn those features from data. In his masters thesis, Jan works on a feature learning method that has been successfully used for object recognition in images, and investigates the applicability of this method for genre classification of music pieces.

Master Thesis
Supervisors: Martin Riedmiller
Tobias Springenberg: Feature Learning using Temporal Coherence

In his thesis Tobias develops feature learning algorithms that capture the relation between input samples. Most current feature learning algorithms were developed for object recognition in still images and are thus unable to capture this temporal information. This problem becomes apparent in many applications for which feature learning might be desireable such as: human motion and action recognition in video data, tracking and stereo vision. Apart from these applications features learned using temporal information can also be used to improve object recognition performance under complex image transformations.

Master Thesis
Supervisors: Martin Riedmiller
Hadi Kheyruri: Ball detection for RoboCup

Hadi develops computer vision algorithms, that are able to detect an arbitrary soccer ball in a RoboCup setting. Previously RoboCup balls were limited to a specific color and texture which is known before a competition. In this work Hadi tries to expand the state-of-the-art techniques in two ways. First by the collection of a well suited data-set, to perform as a benchmark for other researchers interested in investigating ball detection. Second part of this work focuses on the development and implementation of detection algorithms.

Master Thesis
Supervisors: Martin Riedmiller, Thomas Brox
Jeremias Holub: Approximate Dynamic Programming with a Fuzzy Parameterization

In this thesis, Jeremias implements and analyzes Fuzzy Q-iteration, a Q-iteration algorithm with a fuzzy parameterization of the state space and a discretization of the action space. A Fuzzy Q-iteration controller is implemented using the open source software CLSquare. The performance is tested within a two-link manipulator control problem and is compared with other state-of-the-art reinforcement learning algorithms.

Bachelor Thesis
Supervisors: Manuel Blum, Martin Riedmiller
Jan MattnerJan Mattner: Application of Visual Deep Learning to a Real-World System

Jan's thesis focuses on the problem of delays and inaccuracies which arises when controlling a dynamic system on basis of raw camera images. An augmented state space is used to overcome these difficulties. His experiments empirically demonstrate learning on automatically constructed feature spaces to be in the same order of complexity as learning on low-dimensional sensory data.

Bachelor Thesis, finished 10/2011
Supervisors: Sascha Lange, Martin Riedmiller

PDF

Jan MattnerManuel Watter: Object Tracking using Reinforcement Learning with a Pan-Tilt Camera

This thesis explores the challenge to focus and follow a moving object in a actuated Pan-Tilt Camera. An algorithm was developed to center a general object in the camera and to learn an augmentation add-on controller to be able to follow once this object starts moving on a predefined path.

Bachelor Thesis, finished 9/2011
Supervisors: Stefan Welker, Martin Riedmiller

PDF

Nikolaus MayerNikolaus Mayer: Globale Explorationsstrategien (German)

This thesis analyzes the effects of globally directed exploration on a learning agent's performance. We introduce the BELT growing-batch algorithm which, for the first time, employs a learning procedure to create an independent policy for exploration itself. Compared to undirected and locally directed exploration, we find global exploration to accelerate the learning of good policies.

Bachelor Thesis, finished 09/2011
Supervisors: Sascha Lange, Martin Riedmiller

PDF

Arne VoigtländerArne Voigtländer: Using Visual Deep Reinforcement Learning to Control a Real-World Dynamical System

In his thesis, Arne applies our ground-breaking Deep Fitted Q Algorithm (DFQ) to a real-world dynamical system. DFQ combines batch reinforcement learning methods with deep neural networks in order to solve visual reinforcement learning tasks. With his thesis Arne has demonstrated that it is possible to learn near-optimal control policies from raw camera data just by trial and error.

Master Thesis, finished 07/2010
Supervisors: Sascha Lange, Martin Riedmiller

PDF

Full List of Theses

Student Title Type Completion
Christopher Schröder Sentimentanalyse deutschsprachiger Twitter-Nachrichten Bachelor 2014
Florian Geißer General Game Playing under Uncertainty Master 2013
Markus Grützner Detektion von Multis in MMO durch Analyse von Handelsaktivitäten Master 2013
Max Schumacher A study of DFQ on playing pong Bachelor 2013
Denis Stier Semantic Attachments for Decision-Theoretic Planning Master 2013
Jonas Delleske Verbesserung des client-side Caching der Spielkarte im Browserspiel Wack-a-doo mit Techniken des Machine Learnings Bachelor 2013
Daniel Stratz Khepera III Navigation mit internen Sensoren Bachelor 2013
Tatiana Jimenez Cardenas Training Pose Specific Detectors Master 2013
Mesbahuddin Anwari Learning binding preferences of RNA-binding proteins using in vitro affinities and in vivo binding sites Master 2012
Stefan Band Ein KI-Ansatz für Strategiespiele Bachelor 2012
Urban Himmelsbach Entwicklung eines autonom fahrenden Fahrzeugs Master 2012
Jost Tobias Springenberg Feature Learning using Temporal Coherence Master 2012
Jan Manuel Wülfing Unsupervised Feature Learning for Music Classification Master 2012
Martin Gloderer Optimizing Sequences of Learned Movement Primitives Using Data Gathered from the Demonstrations Master 2012
Hadi Kheyruri Ball Detection Based on Histogram of Features on an own collected Dataset for Soccer Robots Master 2012
Jeremias Holub Implementierung eines approximativen Q-Iterationsalgorithmus mit einer unscharfen Parametrisierung Bachelor 2012
Dragos Alexandru Sorescu De Novo Molecular Design Using Graph Kernels Master 2012
Johannes Andreas Stork Learning Social Navigation Behaviors with Dynamic Features and Inverse Reinforcement Learning Master 2012
Jens Silva Santisteban Socially Acceptable Robot Navigation: A Learning Approach Master 2012
Kiran Kumar Telukunta Secondary structure motif determination in ncRNA via graph kernel based computational models Master 2012
Julian Schmid Kontrollieren von Einheiten in Starcraft mit Reinforcement Lernen Bachelor 2012
Jörg Holschuh Zustandskodierung für Reinforcement Learning mittels neuronaler Netze Bachelor 2012
Johannes Aldinger Algorithms for the Canadian Traveler´s Problem with Remote Sensing Master 2011
Jan Peter Mattner Application of Visual Deep Learning to a Real-World System Bachelor 2011
Manuel Meyer Überwachtes Lernen in pixelbasierter Objekterkennung Bachelor 2011
Manuel Watter Objektverfolgung mittels Reinforcement Learning am bewegten Kamerakopf Bachelor 2011
Nikolaus Mayer Globale Explorationsstrategien Bachelor 2011
Li Zhang RNA-Protein Interaction Prediction with Graph Kernels Master 2011
Elmar Haussmann Contextual Sentence Decomposition with Applications to Semantic Full-Text Search Master 2011
Thomas Huber Evaluation einer dateneffizienten neuronalen Lernmethode am Katana Roboterarm Master 2011
Thorsten Engesser Generalisierendes Neural Fitted Q Learning im TORCS-Competition-Framework Bachelor 2011
Benedikt Mendorf Analyse und Gegenüberstellung von selbstlernender neuronaler Reglern und PID-Reglern im pneumatischen Positionierungssystem Bachelor 2011
Philipp Blohm Maschinelle Lernverfahren in Information Retrieval Master 2011
Oliver Mitevski Semantic Indexing in Information Retrieval Master 2011
Alexandru-Dorin Oprea Recommender System for Research Papers Master 2010
Roxana Dolores Bersan Plan-based Control of Characters in Video Games Master 2010
Norman Kohler Bewegungsplanung für autonome Luftschiffe Diploma 2010
Denis Stier Einsatz von Reinforcment Learning zur Geschwindigkeitsregulierung in mikroskopischer Verkehrsimulation Bachelor 2010
Christian Lutz Autonom lernende Agenten in Computerspielen: Anwendung von Reinforcement-Lernverfahren in Star Ships Learning Framework Bachelor 2010
Stefan Keller Priorisierung von Lernzielen im Nebenfach Medizin für Informatiker Diploma 2010
Manuel Blum Merkmalsextraktion bei Klassifikationsproblemen Master 2010
Thomas Lampe Visual Servoing durch Neuronales Reinforcement Learning Master 2010
Andreas Witsch
(Uni Kassel)
Policy Gradient Methods Diploma 2010
Jonas Gehring Intelligente Objekterkennung für ein lernfähiges Carrerabahn-System Bachelor 2009
Nathan Franklin Vision Based Navigation Master 2009
Hannes Schulz Euclidean Embedding of Co-Proven Queries Master 2009
Matthias Laquai Intelligente Steuerung autonomer Carrera-Rennwagen mittels Lernverfahren Diploma 2009
Jost Tobias Springenberg Machine Learning on Massively Parallel Architectures - A Case Study Bachelor 2009
Philipp Blohm Beurteilung von Stärken und Schwächen von neuroevolutionären Verfahren zum Einsatz im Bereich des Reinforcement Learnings Bachelor 2008
Manuel Blum Klassifikation von Bilddaten mittels rekurrenter neuronaler Netze zur Steuerung eines Torwarts im Roboterfußball Bachelor 2008
Christian Müller Reinforcement-Lernen mit Pac-Man Bachelor 2007
Sebastian Schnock Implementation und Evaluation von Neuroevolution in CLS Bachelor 2007
Hussain Iqbal Evolutionary Algorithm using neuro-evolutionary techniques in dynamic environment Master 2007
Michael Wiegand
(FH Karlsruhe)
Vorhersage der Bewegung eines autonomen Roboters Diploma 2007
Tim Kitzmann View-Based Object Recognition: A Biologically Motivated Approach using iGRLVQ Bachelor 2006
Andrea Albrecht Erkennung und Klassifizierung von Objekten mit Neuronalen Netzen Bachelor 2007
Srinivas Gudipally Survey: Approaches of Artificial Intelligence with the focus on Machine Learning within the RoboCup domain Master 2006
Andreas G. Nie RoboPECS - Autonome Fußballfans im RoboCup Entwicklung eines Multiagentensystems zur Simulation von Fußballfans Magister 2005