└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Multi-agent Reinforcement Learning 2 | This maintains a reading list for multi-agent reinforcement learning. 3 | 4 | # Papers 5 | 6 | - Multi-agent reinforcement learning as a rehearsal for decentralized planning. L. Kraemer and B. Banerjee. Neurocomputing, 190:82–94, 2016. [pdf](http://www.ifaamas.org/Proceedings/aamas2013/docs/p1291.pdf). 7 | 8 | - Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving. Shai Shalev-Shwartz, Shaked Shammah, Amnon Shashua. 2016. [pdf](https://arxiv.org/pdf/1610.03295v1.pdf). 9 | 10 | - Multi-Agent Deep Reinforcement Learning. Maxim Egorov. 2016. [pdf](http://cs231n.stanford.edu/reports2016/122_Report.pdf). 11 | 12 | - Learning to communicate to solve riddles with deep distributed recurrent q-networks. J. N. Foerster, Y. M. Assael, N. de Freitas, and S. Whiteson. arXiv preprint arXiv:1602.02672, 2016. [pdf](https://arxiv.org/pdf/1602.02672.pdf). 13 | 14 | - Learning to Communicate with Deep Multi-Agent Reinforcement Learning. Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson. 2016. [pdf](https://arxiv.org/pdf/1605.06676v2.pdf). 15 | 16 | - Multiagent cooperation and competition with deep reinforcement learning. Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, Raul Vicente. 2015. [pdf](https://arxiv.org/pdf/1511.08779v1). 17 | 18 | - Empirically evaluating multiagent learning algorithms. E. Zawadzki, A. Lipson, and K. Leyton-Brown. 2014. [pdf](https://arxiv.org/pdf/1401.8074v1). 19 | 20 | - Coordinating multi-agent reinforcement learning with limited communication. C. Zhang and V. Lesser. In AAMAS, volume 2, pages 1101–1108, 2013. [pdf](http://www.aamas-conference.org/Proceedings/aamas2013/docs/p1101.pdf). 21 | 22 | - A novel multi-agent reinforcement learning approach for job scheduling in Grid computing, J Wu, X Xu, P Zhang, C Liu, 23 | [pdf](A novel multi-agent reinforcement learning approach for job scheduling in Grid computing). 2011. 24 | 25 | - Multi-agent reinforcement learning: An overview. L. Bus¸oniu, R. Babuska, and B. De Schutter. 2010. [pdf](http://www.dcsc.tudelft.nl/~bdeschutter/pub/rep/10_003.pdf). 26 | 27 | - A Distributed Actor-Critic Algorithm and Applications to Mobile Sensor Network Coordination Problems. Paris et al. IEEE TRANS. AUTOMATIC CONTROL, VOL. 55, NO. 2, FEB. 2010. [html Link](http://ieeexplore.ieee.org/document/5382498/#full-text-section) 28 | 29 | - Hierarchical Multi-Agent Reinforcement Learning, Mohammad Ghavamzadeh, Sridhar Mahadevan, Rajbala Makar, JAAMAS, 2006. [pdf](http://www-anw.cs.umass.edu/pubs/2006/ghavamzadeh_mm_JAAMAS06.pdf) 30 | 31 | - Reinforcement Learning for RoboCup-Soccer Keepaway, Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Adaptive Behavior, 2005. [pdf](http://www.cs.utexas.edu/users/pstone/Papers/bib2html-links/AB05.pdf) 32 | 33 | - Multi-agent patrolling with reinforcement learning,Hugo Santana, Geber Ramalho, Vincent Corruble, Bohdana Ratitch, AAMAS, 2004. [pdf](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.307.6566&rep=rep1&type=pdf) 34 | 35 | - Multi-Agent Reinforcement Learning: a critical survey. Yoav Shoham, Rob Powers and Trond Grenager, 2003. [pdf](http://www.cc.gatech.edu/~isbell/classes/2009/cs7641_spring/handouts/MALearning_ACriticalSurvey_2003_0516.pdf) 36 | 37 | - Learning competitive pricing strategies by multi-agent reinforcement learning, E Kutschinski, T Uthmann, D Polani - Journal of Economic Dynamics and finance, Erich Kutschinskia, , , Thomas Uthmannb, , Daniel Polani, 2003. [html link](http://www.sciencedirect.com/science/article/pii/S0165188902001227) 38 | 39 | - An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems, Martin Lauer , Martin Riedmiller, ICML, 2000. [html version](https://www.researchgate.net/publication/225815648_Multi-agent_Reinforcement_Learning_An_Overview) 40 | 41 | - Multi-agent Reinforcement Learning for Traffic Light Control. Marco Weiring, ICML, 2000. [pdf](http://www.dcsc.tudelft.nl/~sc4081/assign/pap/Reinforcement_Learning.pdf) 42 | 43 | - The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems, Caroline Claus and Craig Boutilier. AAAI, 1998. [pdf](https://www.aaai.org/Papers/AAAI/1998/AAAI98-106.pdf) 44 | 45 | - Elevator Group Control Using Multiple Reinforcement Learning Agents, ROBERT H. CRITES, ANDREW G. BARTO, Machine Learning, 33, 235–262 (1998), [pdf](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.464.6183&rep=rep1&type=pdf) 46 | 47 | - Reinforcement learning in the multi-robot domain, MJ Matarić, Autonomous Robots 4, 73–83 (1997), [pdf](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.454.1747&rep=rep1&type=pdf) 48 | 49 | - Multiagent reinforcement learning in the iterated prisoner's dilemma,TW Sandholm, RH Crites, Biosystems, 1996. [pdf](http://opim.wharton.upenn.edu/~sok/papers/s/sandholm-biosystems95.pdf) 50 | 51 | - Markov games as a framework for multi-agent reinforcement learning, Michael L. Littman, ICML, 1994 [pdf](https://www.researchgate.net/profile/Michael_Littman2/publication/2799903_Markov_Games_as_a_Framework_for_Multi-Agent_Reinforcement_Learning/links/54b66cbb0cf24eb34f6d19de.pdf) 52 | 53 | - Multi-agent reinforcement learning: Independent vs. cooperative agents, ICML, 1993, Ming Tan. [pdf](http://web.mit.edu/16.412j/www/html/Advanced%20lectures/2004/Multi-AgentReinforcementLearningIndependentVersusCooperativeAgents.pdf) 54 | --------------------------------------------------------------------------------