Publications on Hierarchical Methods

Effective Control Knowledge Transfer Through Learning Skill and Representation Hierarchies
International Joint Conference on Artificial Intelligence 2007 (pdf - 192 KB) Abstract:
Learning capabilities of computer systems still lag far behind biological systems. One of the reason...

Caironi, Pierguido ( caironi@elet.polimi.it)

Gradient-Based Reinforcement Learning: Learning Combinations of Control Policies
Technical Report 97.50, Dip. Elettronica e Informazione, Politecnico di Milano ( gzipped Postscript - 381 KB) Abstract:
This report presents two innovative model-based reinforcement learning algorithms for continuous st...

Dietterich, Thomas ( tgd@cs.orst.edu)

The MAXQ Method for Hierarchical Reinforcement Learning
Proceedings of the International Conference on Machine Learning, 1998 ( gzipped Postscript - 53Kb) Abstract:
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decompo...

Dietterich, Thomas ( tgd@cs.orst.edu)

Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition
journal version; under review ( gzipped Postscript - 192Kb) Abstract:
This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decompo...

Dietterich, Thomas

E-mail: tgd@cs.orst.edu
State abstraction in MAXQ hierarchical reinforcement learning
unpublished ( gzipped Postscript - 102Kb) Abstract:
Many researchers have explored methods for hierarchical reinforcement learning (RL) with tempora...

Dietterich, Thomas ( tg@cs.orst.edu)

Hierarchical reinforcement learning with the MAXQ value function decomposition.
unpublished ( gzipped Postscript - 360Kb) Abstract:
Note: Supersedes previous version with same title. This paper presents a new approach to hier...

Diuk, Carlos , Alexander Strehl, Michael Littman( cdiuk@cs.rutgers.edu)

A Hierarchical Approach to Efficient Reinforcement Learning in Deterministic Domains
AAMAS 2006 (PDF - 140KB) Abstract:
Factored representations, model-based learning, and hierar- chies are well-studied techniques for i...

Faihe, Yassine , Jean-Pierre Muller( yfaihe@acm.org)

Behaviors Coordination Using Restless Bandits Allocation Indexes
Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (SAB98) ( gzipped Postscript - 166KB) Abstract:
In order to remain viable and to reproduce an animal has to continuously deal with the problem of ...

Hengst, Bernhard ( bernhardh@cse.unsw.edu.au)

Generating Hierarchical Structure in Reinforcement Learning from State Variables
Springer-Verlag (copyright) as a volume in their Lecture Notes in Artificial Intelligence series. (2000) ( gzipped Postscript - 11 pages) Abstract:
This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by ...

Kalmar, Zsolt , Cs. Szepesvári and A. Lorincz( kalmar@mindmaker.kfkipark.hu)

Module Based Reinforcement Learning for a Real Robot
Proceedings of the 6th European Workshop on Learning Robots, 22-32, 1997 ( gzipped Postscript - 755 Kb) Abstract:
This is the shortest version of our Module-Based RL paper. The behaviour of reinforcement learnin...

Kalmar, Zsolt , Cs. Szepesvari and A. Lorincz ( kalmar@mindmaker.kfkipark.hu)

Generalized Dynamic Concept Model as a Route to Construct Adaptive Autonomous Agents
Neural Network World 3:353-360, 1995 ( gzipped Postscript - ) Abstract:
This is an early approach to introduce generalization in RL. A model of adaptive autonomous agent...

Kalmár, Zsolt , Cs. Szepesvári and A. Lorincz( kalmar@mindmaker.kfkipark.hu)

Module-Based Reinforcement Learning: Experiments with a Real Robot
Machine Learning ( gzipped Postscript - 755) Abstract:
The behavior of reinforcement learning (RL) algorithms is best understood in completely observable, ...

Laurent, Guillaume , Emmanuel Piat( mel@guillaume-laurent.levillage.org)

Parallel Q-Learning for a block-pushing problem
Conference proceedings (pdf - 679KB) Abstract:
Our approach is based on reinforcement learning algorithm (Q-Learning). We propose an original archi...

McGovern, Amy , Richard S. Sutton( amy@cs.umass.edu)

Macro-Actions in Reinforcement Learning: An Empirical Analysis
technical report ( gzipped Postscript - 440K) Abstract:
Several researchers have proposed reinforcement learning methods that obtain advantages in le...

McGovern, Amy , Doina Precup, Balaraman Ravindran, Satinder Singh, Richard S Sutton( amy@cs.umass.edu)

Hierarchical Optimal Control of MDP's
Proceedings of the 10th Yale Workshop on Adaptive and Learning systems. ( gzipped Postscript - 600 KB) Abstract:
In this paper we survey a new approach to reinforcement learning in which high and low-level decis...

Munos, Remi

E-mail: munos@cs.cmu.edu
Finite-Element methods with local triangulation refinement for continuous Reinforcement Learning problems
European Conference on Machine Learning, 1997 (compressed Postscript - 283Kb) Abstract:
This paper presents a reinforcement learning algorithm for generating an adaptive control for a ...

Parr, Ron , Stuart Russell( parr@cs.berkeley.edu )

Reinforcement Learning with Hierarchies of Machines
NIPS'97 ( gzipped Postscript - ) Abstract:
We present a new approach to reinforcement learning in which the policies considered by the learning...

Parr, Ronald ( parr@cs.stanford.edu)

Hierarchical Control and Learning for Markov Decision Processes
PhD Thesis ( gzipped Postscript - 440 KB ) Abstract:
This dissertation investigates the use of hierarchy and problem decomposition as a means of solv...

Schmidhuber, Juergen ( juergen@idsia.ch)

A whole bunch of papers on hierarchical reinforcement learning (since 1990)
journal papers and conference papers (HTML - 100 KB) Abstract:
There is no teacher providing useful intermediate subgoals for our hierarchical reinforcement learni...

Strehl, Alexander , Carlos Diuk, Michael Littman( strehl@cs.rutgers.edu)

Efficient Structure Learning in Factored-state MDPs
AAAI 2007 (PDF - 115KB) Abstract:
We consider the problem of reinforcement learning in factored-state MDPs in the setting in which le...

Wang, Gang , Sridhar Mahadevan( wanggan1@cse.msu.edu)

Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes
International Conference on Machine Learning (ICML-99) ( gzipped Postscript - 250 KB) Abstract:
Manufacturing is a challenging real-world domain for applying MDP-based reinforcement learning algo...

Wiering, M. and Schmidhuber, Jurgen , ( juergen@isdia.ch)

HQ-Learning: Discovering Markovian subgoals for non-Markovian reinforcement learning
Technical Report IDSIA-95-96, October 1996 ( gzipped Postscript - 111 KB) Abstract:
To solve partially observable Markov decision problems, we introduce HQ-learning, a hierarchical ex...

Singh, Satinder ( baveja@cs.colorado.edu)

Reinforcement Learning with a Hierarchy of Abstract Models
Appears in Proceedings of the Tenth National Conference on Artificial Intelligence, 1992 ( gzipped Postscript - 105 KB) Abstract:
Reinforcement learning (RL) algorithms have traditionally been thought of as trial and error learni...

Thrun, Sebastian , Anton Schwartz( thrun+@heaven.learning.cs.cmu.edu>

Finding Structure in Reinforcement Learning
Advances in Neural Information Processing Systems (NIPS) 7, 1995. ( gzipped Postscript - 149 KB) Abstract:
Reinforcement learning addresses the problem of learning to select actions$ maximize one...