Information Journal Paper
APA:
CopyPOUYAN, M., GOLZARI, S., MOUSAVI, A., & HATAM, A.. (2016). IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION. NASHRIYYAH -I MUHANDISI -I BARQ VA MUHANDISI -I KAMPYUTAR -I IRAN, B- MUHANDISI -I KAMPYUTAR, 14(2), 137-146. SID. https://sid.ir/paper/228376/en
Vancouver:
CopyPOUYAN M., GOLZARI S., MOUSAVI A., HATAM A.. IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION. NASHRIYYAH -I MUHANDISI -I BARQ VA MUHANDISI -I KAMPYUTAR -I IRAN, B- MUHANDISI -I KAMPYUTAR[Internet]. 2016;14(2):137-146. Available from: https://sid.ir/paper/228376/en
IEEE:
CopyM. POUYAN, S. GOLZARI, A. MOUSAVI, and A. HATAM, “IMPROVING Q-LEARNING USING SIMULTANEOUS UPDATING AND ADAPTIVE POLICY BASED ON OPPOSITE ACTION,” NASHRIYYAH -I MUHANDISI -I BARQ VA MUHANDISI -I KAMPYUTAR -I IRAN, B- MUHANDISI -I KAMPYUTAR, vol. 14, no. 2, pp. 137–146, 2016, [Online]. Available: https://sid.ir/paper/228376/en