└── README.md


/README.md:
--------------------------------------------------------------------------------
 1 | # Multi-agent Reinforcement Learning
 2 | This maintains a reading list for multi-agent reinforcement learning. 
 3 | 
 4 | # Papers
 5 | 
 6 | - Multi-agent reinforcement learning as a rehearsal for decentralized planning. L. Kraemer and B. Banerjee. Neurocomputing, 190:82–94, 2016. [pdf](http://www.ifaamas.org/Proceedings/aamas2013/docs/p1291.pdf).
 7 | 
 8 | - Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving. Shai Shalev-Shwartz, Shaked Shammah, Amnon Shashua. 2016. [pdf](https://arxiv.org/pdf/1610.03295v1.pdf).
 9 | 
10 | - Multi-Agent Deep Reinforcement Learning. Maxim Egorov. 2016. [pdf](http://cs231n.stanford.edu/reports2016/122_Report.pdf).
11 | 
12 | - Learning to communicate to solve riddles with deep distributed recurrent q-networks. J. N. Foerster, Y. M. Assael, N. de Freitas, and S. Whiteson. arXiv preprint arXiv:1602.02672, 2016. [pdf](https://arxiv.org/pdf/1602.02672.pdf). 
13 | 
14 | - Learning to Communicate with Deep Multi-Agent Reinforcement Learning. Jakob N. Foerster, Yannis M. Assael, Nando de Freitas, Shimon Whiteson. 2016. [pdf](https://arxiv.org/pdf/1605.06676v2.pdf).
15 | 
16 | - Multiagent cooperation and competition with deep reinforcement learning. Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, Ilya Kuzovkin, Kristjan Korjus, Juhan Aru, Jaan Aru, Raul Vicente. 2015. [pdf](https://arxiv.org/pdf/1511.08779v1).
17 | 
18 | - Empirically evaluating multiagent learning algorithms. E. Zawadzki, A. Lipson, and K. Leyton-Brown. 2014. [pdf](https://arxiv.org/pdf/1401.8074v1).
19 | 
20 | - Coordinating multi-agent reinforcement learning with limited communication. C. Zhang and V. Lesser.  In AAMAS, volume 2, pages 1101–1108, 2013. [pdf](http://www.aamas-conference.org/Proceedings/aamas2013/docs/p1101.pdf).
21 | 
22 | - A novel multi-agent reinforcement learning approach for job scheduling in Grid computing, J Wu, X Xu, P Zhang, C Liu, 
23 | [pdf](A novel multi-agent reinforcement learning approach for job scheduling in Grid computing). 2011. 
24 | 
25 | - Multi-agent reinforcement learning: An overview. L. Bus¸oniu, R. Babuska, and B. De Schutter. 2010. [pdf](http://www.dcsc.tudelft.nl/~bdeschutter/pub/rep/10_003.pdf).
26 | 
27 | - A Distributed Actor-Critic Algorithm and Applications to Mobile Sensor Network Coordination Problems. Paris et al. IEEE TRANS. AUTOMATIC CONTROL, VOL. 55, NO. 2, FEB. 2010.  [html Link](http://ieeexplore.ieee.org/document/5382498/#full-text-section)
28 | 
29 | - Hierarchical Multi-Agent Reinforcement Learning, Mohammad Ghavamzadeh, Sridhar Mahadevan, Rajbala Makar, JAAMAS, 2006. [pdf](http://www-anw.cs.umass.edu/pubs/2006/ghavamzadeh_mm_JAAMAS06.pdf)
30 | 
31 | - Reinforcement Learning for RoboCup-Soccer Keepaway, Peter Stone, Richard S. Sutton, and Gregory Kuhlmann. Adaptive Behavior, 2005. [pdf](http://www.cs.utexas.edu/users/pstone/Papers/bib2html-links/AB05.pdf)
32 | 
33 | - Multi-agent patrolling with reinforcement learning,Hugo Santana, Geber Ramalho, Vincent Corruble, Bohdana Ratitch, AAMAS, 2004. [pdf](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.307.6566&rep=rep1&type=pdf)
34 | 
35 | - Multi-Agent Reinforcement Learning: a critical survey. Yoav Shoham, Rob Powers and Trond Grenager, 2003. [pdf](http://www.cc.gatech.edu/~isbell/classes/2009/cs7641_spring/handouts/MALearning_ACriticalSurvey_2003_0516.pdf)
36 | 
37 | - Learning competitive pricing strategies by multi-agent reinforcement learning, E Kutschinski, T Uthmann, D Polani - Journal of Economic Dynamics and finance, Erich Kutschinskia, , , Thomas Uthmannb, , Daniel Polani, 2003. [html link](http://www.sciencedirect.com/science/article/pii/S0165188902001227) 
38 | 
39 | - An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems,  Martin Lauer , Martin Riedmiller, ICML, 2000. [html version](https://www.researchgate.net/publication/225815648_Multi-agent_Reinforcement_Learning_An_Overview)
40 | 
41 | - Multi-agent Reinforcement Learning for Traffic Light Control. Marco Weiring, ICML, 2000. [pdf](http://www.dcsc.tudelft.nl/~sc4081/assign/pap/Reinforcement_Learning.pdf)
42 | 
43 | - The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems, Caroline Claus and Craig Boutilier. AAAI, 1998. [pdf](https://www.aaai.org/Papers/AAAI/1998/AAAI98-106.pdf)
44 | 
45 | - Elevator Group Control Using Multiple Reinforcement Learning Agents, ROBERT H. CRITES, ANDREW G. BARTO, Machine Learning, 33, 235–262 (1998), [pdf](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.464.6183&rep=rep1&type=pdf)
46 | 
47 | - Reinforcement learning in the multi-robot domain, MJ Matarić, Autonomous Robots 4, 73–83 (1997), [pdf](http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.454.1747&rep=rep1&type=pdf)
48 | 
49 | - Multiagent reinforcement learning in the iterated prisoner's dilemma,TW Sandholm, RH Crites, Biosystems, 1996.  [pdf](http://opim.wharton.upenn.edu/~sok/papers/s/sandholm-biosystems95.pdf)
50 | 
51 | - Markov games as a framework for multi-agent reinforcement learning, Michael L. Littman, ICML, 1994 [pdf](https://www.researchgate.net/profile/Michael_Littman2/publication/2799903_Markov_Games_as_a_Framework_for_Multi-Agent_Reinforcement_Learning/links/54b66cbb0cf24eb34f6d19de.pdf)
52 | 
53 | - Multi-agent reinforcement learning: Independent vs. cooperative agents, ICML, 1993, Ming Tan. [pdf](http://web.mit.edu/16.412j/www/html/Advanced%20lectures/2004/Multi-AgentReinforcementLearningIndependentVersusCooperativeAgents.pdf)
54 | 


--------------------------------------------------------------------------------