مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Verion

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

video

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

sound

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Persian Version

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

View:

1,076
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Download:

0
مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Cites:

Information Journal Paper

Title

IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION

Pages

  137-146

Abstract

 Q-learning is a one of the most popular and frequently used model-free reinforcement learning method. Among the advantages of this method is independent in its prior knowledge and there is a proof for its convergence to the optimal policy. One of the main limitations of this method is its low CONVERGENCE SPEED, especially when the dimension is high. Accelerating convergence of this method is a challenge. Q-LEARNING can be accelerated the convergence by the notion of OPPOSITE ACTION. Since two Q-values are updated simultaneously at each learning step. In this paper, ADAPTIVE POLICY and the notion of OPPOSITE ACTION are used to speed up the learning process by integrated approach. The methods are simulated for the grid world problem. The results demonstrate a great advance in the learning in terms of success rate, the percent of optimal states, the number of steps to goal, and average reward.

Cites

  • No record.
  • References

  • No record.
  • Cite

    APA: Copy

    POUYAN, M., GOLZARI, S., MOUSAVI, A., & HATAM, A.. (2016). IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION. NASHRIYYAH -I MUHANDISI -I BARQ VA MUHANDISI -I KAMPYUTAR -I IRAN, B- MUHANDISI -I KAMPYUTAR, 14(2), 137-146. SID. https://sid.ir/paper/228376/en

    Vancouver: Copy

    POUYAN M., GOLZARI S., MOUSAVI A., HATAM A.. IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION. NASHRIYYAH -I MUHANDISI -I BARQ VA MUHANDISI -I KAMPYUTAR -I IRAN, B- MUHANDISI -I KAMPYUTAR[Internet]. 2016;14(2):137-146. Available from: https://sid.ir/paper/228376/en

    IEEE: Copy

    M. POUYAN, S. GOLZARI, A. MOUSAVI, and A. HATAM, “IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION,” NASHRIYYAH -I MUHANDISI -I BARQ VA MUHANDISI -I KAMPYUTAR -I IRAN, B- MUHANDISI -I KAMPYUTAR, vol. 14, no. 2, pp. 137–146, 2016, [Online]. Available: https://sid.ir/paper/228376/en

    Related Journal Papers

    Related Seminar Papers

  • No record.
  • Related Plans

  • No record.
  • Recommended Workshops






    Move to top
    telegram sharing button
    whatsapp sharing button
    linkedin sharing button
    twitter sharing button
    email sharing button
    email sharing button
    email sharing button
    sharethis sharing button