Publications on Industrial Applications

Crites, Robert , Andrew Barto( crites@cs.umass.edu)

Improving elevator performance using reinforcement learning
unpublished (compressed Postscript - 58 KB) Abstract:
This paper describes the application of reinforcement learning (RL) to the difficult real world pro...

Damien, Ernst , Mevludin Glavic and Louis Wehenkel( dernst@ulg.ac.be)

Power Systems Stability Control : Reinforcement Learning Framework
IEEE transactions on Power Systems Abstract:
In this paper we explore how a computational approach to learning from interactions, called Reinfo...

Dietterich, Thomas , W. Zhang( tgd@cs.orst.edu)

A Reinforcement Learning Approach to Job-shop Scheduling
Proceedings of IJCAI95 ( gzipped Postscript - ) Abstract:
We apply reinforcement learning methods to learn domain-specific heuristics for job shop scheduling...

Ernst, Damien ( dernst@ulg.ac.be)

Power Systems Stability Control : Reinforcement Learning Framework
IEEE transactions on Power Systems Abstract:
In this paper we explore how a computational approach to learning from interactions, called Reinfo...

Ernst, Damien , Pierre Geurts, Mevludin Glavic, Louis Wehenkel

E-mail: ernst@montefiore.ulg.ac.be
Approximate value iteration in the reinforcement learning context. Application to electrical power system control
International Journal of Emerging Electric Power Systems (.pdf - 780) Abstract:
In this paper we explain how to design intelligent agents able to process the information acquired f...

Ernst, Damien , Guy-Bart Stan, Jorge Goncalves and L. Wehenkel( ernst@montefiore.ulg.ac.be)

Clinical data based optimal STI strategies for HIV: a reinforcement learning approach
Proceedings of Benelearn 2006, 11-12 May 2006, Ghent, Belgium (pdf - 172 KB) Abstract:
This paper addresses the problem of computing optimal structured treatment interruption strategies f...

Likas, Aristidis ( arly@cs.uoi.gr)

A Reinforcement Learning Approach to On-line Clustering
Neural Computation, to appear ( gzipped Postscript - 80KB) Abstract:
A general technique is proposed for embedding on-line clustering algorithms based on competitive l...

Mahadevan, Sridhar , Nicholas Marchalleck, Tapas Das, and Abhijit Gosavi( mahadeva@cps.msu.edu)

Self-Improving Factory Simulation using Continuous-Time Average-Reward Reinforcement Learning
Proceedings of the 14th International Conference on Machine Learning (IMLC '97), Nashville, TN, July 1997. ( gzipped Postscript - ) Abstract:
Many factory optimization problems, from inventory control to scheduling and reliability, can be f...

Mahadevan, Sridhar , Nikfar Khaleeli, Nicholas Marchalleck (mahadeva@cps.msu.edu)

Designing Agent Controllers using Discrete Event Markov Models
AAAI Fall Symposium on Model-Directed Autonomous Systems, Nov. 5-7, Cambridge, MA (gzipped Postscript - 200 kb)
Abstract: This paper describes the use of discrete-event Markov decision process models to design robust age...

Syam, Syafiie , F. Tadeo, E. Martinez( syam@autom.uva.es)

Model Free Intelligent Control Using Reinforcement Learning and Temporal Abstraction-applied to pH Control.
IFAC 2005 (pdf - 300) Abstract:
This article presents a solution to pH control based on model-free intelligent control (MFIC) using ...

Walker, Marilyn

E-mail: walker@research.att.com
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
Journal of Artificial Intelligence Research, Vol 12., pp. 387-416, 2000. (Postscript - 340K) Abstract:
In the past several years, it has become possible to build spoken dialogue systems that can communi...

Wang, Gang , Sridhar Mahadevan

E-mail: wanggan1@cse.msu.edu
Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes
International Conference on Machine Learning (ICML-99) ( gzipped Postscript - 250) Abstract:
Manufacturing is a challenging real-world domain for applying MDP-based reinforcement learning algo...