Welcome to Journal of Beijing Institute of Technology
Volume 8Issue 3
.
Turn off MathJax
Article Contents
Hu Guanghua, Wu Cangpu. Incremental Multi Step R Learning [J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 1999, 8(3): 245-250.
Citation: Hu Guanghua, Wu Cangpu. Incremental Multi Step R Learning [J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 1999, 8(3): 245-250.

Incremental Multi Step R Learning 

  • Received Date:1998-10-13
  • Aim To investigate the model free multi step average reward reinforcement learning algorithm. Methods By combining the R learning algorithms with the temporal difference learning (TD( λ ) learning) algorithms for average reward problems, a novel incremental algorithm, called R( λ ) learning, was proposed. Results and Conclusion The proposed algorithm is a natural extension of the Q( λ) learning, the multi step discounted reward reinforcement learning algorithm, to the average reward cases. Simulation results show that the R( λ ) learning with intermediate λ values makes significant performance improvement over the simple R learning.
  • loading
  • [1]
    Schwartz A. Areinforcement learning method for maximizing undiscounted rewards. In: SaittaL,ed. Proceeding of the Tenth International Conference on Machine Learning. Amherst: Morgan Kaufmann, 1993.298- 305
    [2] Mahadevan S. Average reward reinforcement learning: foundations, algorithms and empirical results. Machine Learning, 1996, 22:159-195
    [3] Tadepalli P, Ok D. Model??based average reward reinforcement learning. Artificial Intelligence, 1998, 100:177-224
    [2]
    Peng J, Williams R J. Increment multi step Q-learning. Machine learning, 1996, 22:283-290
    [5] Bertsekas D P. Dynamic programming: Deterministic and stochastic methods. Englewood Cliffs: Prentice Hall, 1987
    [6] Cichosz P, Mulawka J J. Fast and efficient reinforcement learning with truncated temporal differences. In: Prieditis A, Russell S, ed. Proceeding of the Twelfth International Conference on Ma??chine Learning. San Francisco: Mo rgan Kaufmann, 1995.99-107
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (326) PDF downloads(1) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map