Erscheinungsdatum | Titel | Autor(en) |
2009 | A case study on improving defense behavior in soccer simulation 2D: The neurohassle approach | Gabel, T.; Riedmiller, M.; Trost, F. |
2006 | Abstract state spaces with history | Timmer, S.; Riedmiller, M. |
2007 | An analysis of case-based value function approximation by approximating state transition graphs | Gabel, T.; Riedmiller, M. |
2007 | Appearance-based robot discrimination using eigenimages | Lange, S.; Riedmiller, M. |
2005 | Comparing different methods to speed up reinforcement learning in a complex domain | Riedmiller, M.; Withopf, D. |
2005 | Effective methods for reinforcement learning in large multi-agent domains [Leistungsfähige verfahren für das reinforcement lernen in komplexen multi-agenten-umgebungen] | Riedmiller, M.; Withopf, D. |
2008 | Evaluation of batch-mode reinforcement learning methods for solving DEC-MDPs with changing action sets | Gabel, T.; Riedmiller, M. |
2007 | Evaluation of policy gradient methods and variants on the cart-pole benchmark | Riedmiller, M.; Peters, J.; Schaal, S. |
2007 | Fitted Q iteration with CMACs | Timmer, S.; Riedmiller, M. |
2008 | Increasing precision of credible case-based inference | Gabel, T.; Riedmiller, M. |
2008 | Joint equilibrium policy search for multi-agent scheduling problems | Gabel, T.; Riedmiller, M. |
2005 | Learning policies for abstract state spaces | Timmer, S.; Riedmiller, M. |
2008 | Learning to dribble on a real robot by success and failure | Riedmiller, M.; Hafner, R.; Lange, S.; Lauer, M. |
2007 | Learning to drive a real car in 20 minutes | Riedmiller, M.; Montemerlo, M.; Dahlkamp, H. |
2007 | Making a robot learn to play soccer using reward and punishment | Müller, H.; Lauer, M.; Hafner, R.; Lange, S.; Merke, A.; Riedmiller, M. |
2007 | Neural reinforcement learning controllers for a real robot application | Hafner, R.; Riedmiller, M. |
2005 | Neural reinforcement learning to swing-up and balance a real pole | Riedmiller, M. |
2007 | On a successful application of multi-agent reinforcement learning to operations research benchmarks | Gabel, T.; Riedmiller, M. |
2007 | On experiences in a complex and competitive gaming domain: Reinforcement learning meets RoboCup | Riedmiller, M.; Gabel, T. |
2006 | Reducing policy degradation in neuro-dynamic programming | Gabel, T.; Riedmiller, M. |