Welcome to Journal of Beijing Institute of Technology
Volume 15Issue 2
.
Turn off MathJax
Article Contents
TONG Liang, LU Ji-lian. Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction[J]. JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2006, 15(2): 133-137.
Citation: TONG Liang, LU Ji-lian. Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction[J].JOURNAL OF BEIJING INSTITUTE OF TECHNOLOGY, 2006, 15(2): 133-137.

Multi-Agent Reinforcement Learning Algorithm Based on Action Prediction

Funds:Sponsored bythe Ministerial Level Foundation (70302)
  • Received Date:2004-10-26
  • Multi-agent reinforcement learning algorithms are studied.A prediction-based multi-agent reinforcement learning algorithm is presented for multi-robot cooperation task.The multi-robot cooperation experiment based on multi-agent inverted pendulum is made to test the efficency of the new algorithm,and the experiment results show that the new algorithm can achieve the cooperation strategy much faster than the primitive multi-agent reinforcement learning algorithm.
  • loading
  • [1]
    Sutton R S. Learning to predict by the methods of tem por al differ ence[J]. Machine Learning, 1988, 7(3): 9-44.
    [2]
    Littman M L. Markov games as a framework for multi agent reinforcement learning[Z]. 11th International Con ference on Machine Lear ning, San Francisco, CA, 1994.
    [3]
    Hu Junling, Wellman M P. Multiagent r einforcementlearning: T heoretical framew ork and an alg orithm [Z]. Fifteenth International Conf on Machine Learning, Wis consin, 1998.
    [4]
    Claus C, Boutilier C. The dyanmics of r einforcementlearning in cooperative multiagent systems [A]. MenloPar k. Proceedings of the Fifteenth Natio nal Conferenceon Artificial Intelligence [c]. [s. l.]: AAAI Press,1998. 746-752.
    [5]
    Watkins C, Dayan P. Q lear ning [J]. Machine Learn ing, 1992, 8(4): 279-292.
    [6]
    Sutton R S. Temporal cr edit assignment in r einforcementlearning [D]. Amherst, MA: Universit y of Mas sachusetts, 1984.
    [7]
    Oh C H, Nakashima T, Ishibuchi H. Initialization of Q v alues by fuzzy rules for accelerating Q lear ning [Z]. In ternational Joint Confer ence on Neural Networks, An chor ag e, Alaska, 1998.
  • 加载中

Catalog

    通讯作者:陈斌, bchen63@163.com
    • 1.

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article views (174) PDF downloads(0) Cited by()
    Proportional views
    Related

    /

      Return
      Return
        Baidu
        map