Publications on Industrial Applications
Crites,
Robert
, Andrew Barto( crites@cs.umass.edu)
Improving elevator performance using reinforcement learning
unpublished
(compressed Postscript - 58 KB)
Abstract:
This paper describes the application of reinforcement learning (RL)
to the difficult real world pro...
Damien,
Ernst
, Mevludin Glavic and Louis Wehenkel( dernst@ulg.ac.be)
Power Systems Stability Control : Reinforcement Learning Framework
IEEE transactions on Power Systems
Abstract:
In this paper we explore how a computational approach to learning from
interactions, called Reinfo...
Dietterich,
Thomas
, W. Zhang( tgd@cs.orst.edu)
A Reinforcement
Learning Approach to Job-shop Scheduling
Proceedings of IJCAI95
( gzipped Postscript - )
Abstract:
We apply reinforcement learning methods to learn domain-specific
heuristics for job shop scheduling...
Ernst,
Damien
( dernst@ulg.ac.be)
Power Systems Stability Control : Reinforcement Learning Framework
IEEE transactions on Power Systems
Abstract:
In this paper we explore how a computational approach to learning from
interactions, called Reinfo...
Ernst,
Damien
, Pierre Geurts, Mevludin Glavic, Louis WehenkelE-mail: ernst@montefiore.ulg.ac.be
Approximate value iteration in the reinforcement learning context. Application to electrical power system control
International Journal of Emerging Electric Power Systems
(.pdf - 780)
Abstract:
In this paper we explain how to design intelligent agents able to process the information acquired f...
Ernst,
Damien
, Guy-Bart Stan, Jorge Goncalves and L. Wehenkel( ernst@montefiore.ulg.ac.be)
Clinical data based optimal STI strategies for HIV: a reinforcement learning approach
Proceedings of Benelearn 2006, 11-12 May 2006, Ghent, Belgium
(pdf - 172 KB)
Abstract:
This paper addresses the problem of computing optimal structured treatment interruption strategies f...
Likas,
Aristidis
( arly@cs.uoi.gr)
A Reinforcement Learning Approach to On-line Clustering
Neural Computation, to appear
( gzipped Postscript - 80KB)
Abstract:
A general technique is proposed for
embedding on-line clustering algorithms based on competitive
l...
Mahadevan,
Sridhar
, Nicholas Marchalleck, Tapas Das, and Abhijit Gosavi( mahadeva@cps.msu.edu)
Self-Improving Factory Simulation using
Continuous-Time Average-Reward Reinforcement Learning
Proceedings of the 14th International Conference on Machine Learning (IMLC '97), Nashville,
TN, July 1997.
( gzipped Postscript - )
Abstract:
Many factory optimization problems, from inventory control to
scheduling and reliability, can be f...
Mahadevan, Sridhar , Nikfar Khaleeli, Nicholas Marchalleck
(mahadeva@cps.msu.edu)
Designing Agent Controllers using Discrete Event Markov Models
AAAI Fall Symposium on Model-Directed Autonomous Systems, Nov. 5-7, Cambridge, MA
(gzipped Postscript - 200 kb)
Abstract:
This
paper describes the use of discrete-event Markov decision
process models to design robust age...
Syam,
Syafiie
, F. Tadeo, E. Martinez( syam@autom.uva.es)
Model Free Intelligent Control Using Reinforcement Learning and Temporal Abstraction-applied to pH Control.
IFAC 2005
(pdf - 300)
Abstract:
This article presents a solution to pH control based on model-free intelligent control (MFIC) using ...
Walker,
Marilyn
E-mail: walker@research.att.com
An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email
Journal of Artificial Intelligence Research, Vol 12., pp. 387-416, 2000.
(Postscript - 340K)
Abstract:
In the past several years, it has become possible to build spoken
dialogue systems that can communi...
Wang,
Gang
, Sridhar MahadevanE-mail: wanggan1@cse.msu.edu
Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes
International Conference on Machine Learning (ICML-99)
( gzipped Postscript - 250)
Abstract:
Manufacturing is a challenging real-world domain for applying
MDP-based reinforcement learning algo...