Article Contents

Article Navigation> JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY> 1999> 8(3): 245-250

Hu Guanghua, Wu Cangpu. Incremental Multi Step R Learning [J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 1999, 8(3): 245-250.

Citation:

Hu Guanghua, Wu Cangpu. Incremental Multi Step R Learning [J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 1999, 8(3): 245-250.

Hu Guanghua, Wu Cangpu. Incremental Multi Step R Learning [J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 1999, 8(3): 245-250.

Citation:

Hu Guanghua, Wu Cangpu. Incremental Multi Step R Learning [J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 1999, 8(3): 245-250.

PDF( 0 KB)

Incremental Multi Step R Learning 

Department of Automatic Control, Beijing Institute of Technology, Beijing 100081

Received Date:1998-10-13

Abstract

Abstract

Aim To investigate the model free multi step average reward reinforcement learning algorithm. Methods By combining the R learning algorithms with the temporal difference learning (TD( λ ) learning) algorithms for average reward problems, a novel incremental algorithm, called R( λ ) learning, was proposed. Results and Conclusion The proposed algorithm is a natural extension of the Q( λ) learning, the multi step discounted reward reinforcement learning algorithm, to the average reward cases. Simulation results show that the R( λ ) learning with intermediate λ values makes significant performance improvement over the simple R learning.
- reinforcement learning,
- average reward,
- R-learning,
- Markov decision processes,
- temporal difference learning

FullText(HTML)

References (2)

References

[1]

Schwartz A. Areinforcement learning method for maximizing undiscounted rewards. In: SaittaL,ed. Proceeding of the Tenth International Conference on Machine Learning. Amherst: Morgan Kaufmann, 1993.298- 305
[2] Mahadevan S. Average reward reinforcement learning: foundations, algorithms and empirical results. Machine Learning, 1996, 22:159-195
[3] Tadepalli P, Ok D. Model??based average reward reinforcement learning. Artificial Intelligence, 1998, 100:177-224

[2]

Peng J, Williams R J. Increment multi step Q-learning. Machine learning, 1996, 22:283-290
[5] Bertsekas D P. Dynamic programming: Deterministic and stochastic methods. Englewood Cliffs: Prentice Hall, 1987
[6] Cichosz P, Mulawka J J. Fast and efficient reinforcement learning with truncated temporal differences. In: Prieditis A, Russell S, ed. Proceeding of the Twelfth International Conference on Ma??chine Learning. San Francisco: Mo rgan Kaufmann, 1995.99-107

Relative Articles

Supplements (0)

Cited By

Proportional views

Proportional views

通讯作者:陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Get Citation

PDF

XML

Article Metrics

Article views (326) PDF downloads(1)

Incremental Multi Step R Learning 

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Incremental Multi Step R Learning 

Abstract

References

Proportional views

Catalog

通讯作者:陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Export File

Citation

Format

Content