├── .gitattributes ├── .github └── FUNDING.yml ├── .gitignore ├── Causal └── PR Causal Introduction.md ├── Control ├── 1_LQR.md ├── 1_LQR_code.py ├── 2_DMP.md └── 3_DMP2.md ├── Efficient ├── PR Efficient Ⅰ:机器人中的数据高效强化学习.md ├── PR Efficient Ⅱ:Bayesian Transfer RL with prior knowledge.md ├── PR Efficient Ⅲ:Efficient RL for Multi-Step Visual Tasks with Sim2real.md ├── PR Efficient Ⅳ:五分钟内让四足机器人学会行走.md ├── PR Efficient Ⅴ:DERL with self-predictive representations.md ├── PR Efficient Ⅵ:从RL的五个方面分析Sample Efficient.md └── PR Efficient Ⅶ:Efficient RL 中表征学习的理论基础.md ├── Federated Learning └── FLⅠ:联邦学习(Federated Learning)入门指南.md ├── Imitation learning ├── 01Introduction.md ├── 02DAgger.assets │ ├── image-20200514200538762.png │ ├── 微信截图_20200514220002.png │ ├── 微信截图_20200514220137.png │ ├── 微信截图_20200514220221.png │ ├── 微信截图_20200514220412.png │ ├── 微信截图_20200514220508.png │ └── 微信截图_20200514220651.png ├── 02DAgger.md ├── 03EnsembleDAgger.assets │ ├── image-20200514223244922.png │ └── 微信截图_20200514225330.png ├── 03EnsembleDAgger.md ├── EnsembleDAgger.assets │ ├── image-20200514131357018.png │ └── image-20200514131458469.png └── Introduction.assets │ ├── 1_9gdENk_iThuoha-ZJK4oOQ.jpeg │ ├── 1_P076bt-xcC3mKyYCINzSFg.jpeg │ ├── 1_RkCKUyRW68fAuysDgWhuMA.png │ ├── 1_UuY1bsit07pwijg1pSOQgQ.jpeg │ ├── image-20200513095032814.png │ ├── image-20200513095203592.png │ ├── image-20200513110523699.png │ ├── stacking_demo.gif │ ├── v2-61bb833f9464f5c7fc088045f26c909d_1440w.png │ └── v2-9ad04f29683121bb7870e0589b8ec389_1440w.png ├── LICENSE ├── MARL ├── MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets │ ├── 2020-09-06_23-17-55.jpg │ ├── 2020-09-08_12-50-46.jpg │ ├── image-20200906172748898.png │ └── image-20200906225644703.png ├── MARL Ⅰ:A Selective Overview of Theories and Algorithms.md ├── MARL Ⅱ:QD-learning.assets │ └── image-20200914210232950.png └── MARL Ⅱ:QD-learning.md ├── MBRL ├── Model-Based RL Ⅲ 从源码读懂PILCO.md └── img │ ├── equation.svg │ ├── image-20200505172434162.png │ ├── 微信截图_20200505223246.png │ └── 微信截图_20200505223309.png ├── Memory └── Memory systems 2018 – towards a new paradigm.md ├── Paper Reading ├── Bayesian Relational Memory for Semantic Visual Navigation.assets │ ├── image-20200703144120242.png │ ├── image-20200703144159048.png │ ├── image-20200703144238614.png │ ├── image-20200703151212485.png │ ├── image-20200703154225724.png │ └── image-20200703154359760.png ├── Bayesian Relational Memory for Semantic Visual Navigation.md ├── Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets │ ├── image-20200705111106393.png │ ├── image-20200705111939601.png │ ├── image-20200705123909206.png │ ├── image-20200705130626188.png │ ├── image-20200705131656750.png │ ├── image-20200705131744033.png │ ├── image-20200705131747173.png │ ├── image-20200705131859781.png │ ├── image-20200705131926516.png │ └── image-20200705132141819.png ├── Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.md ├── Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning.md ├── Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets │ ├── image-20200702211649011.png │ ├── image-20200702213128415.png │ ├── image-20200703101618018.png │ ├── image-20200703101914142.png │ ├── image-20200703104319711.png │ ├── image-20200703121626950.png │ ├── image-20200703121644122.png │ ├── image-20200703121752526.png │ └── image-20200703121906914.png ├── Learning to learn how to learn Self-adaptive visual navigation using meta-learning.md ├── Long Range Neural Navigation Policies for the Real World.md ├── Scene memory transformer for embodied agents in long-horizon tasks.assets │ ├── image-20200706152627903.png │ ├── image-20200706153724992.png │ ├── image-20200706165235163.png │ ├── image-20200706175359541.png │ ├── image-20200706182841042.png │ ├── image-20200706182939318.png │ ├── image-20200706183146814.png │ ├── image-20200706183211120.png │ └── image-20200706190803028.png ├── Scene memory transformer for embodied agents in long-horizon tasks.md ├── Semi-parametric topological memory for navigation.assets │ ├── image-20200706125057832.png │ ├── image-20200706125603144.png │ ├── image-20200706140848398.png │ ├── image-20200706140947547.png │ └── image-20200706141720545.png ├── Semi-parametric topological memory for navigation.md ├── Target driven visual navigation exploiting object relationships.assets │ ├── image-20200702140946820.png │ ├── image-20200702151238953.png │ ├── image-20200702154153781.png │ ├── image-20200702161215057.png │ ├── image-20200702163714131.png │ ├── image-20200702164548441.png │ └── image-20200702172920825.png ├── Target driven visual navigation exploiting object relationships.md └── Uncertainty-Aware Reinforcement Learning for Collision Avoidance.md ├── Perspective ├── PR Perspective Ⅰ:Embodied AI 的新浪潮.md └── PR Perspective Ⅱ:Robot Learning思考.md ├── Preliminary ├── A Simple Guide for NN.assets │ ├── 20160707204048899.gif │ ├── 853467-20160630141449671-1058672778.png │ ├── 853467-20160630152018906-1524325812.png │ ├── 853467-20160630154317562-311369571.png │ ├── equation-1584587262781.svg │ ├── equation-1584587282205.svg │ ├── equation-1584587725413.svg │ └── equation.svg ├── A Simple Guide for NN.md ├── Preliminary RL basic knowledge.assets │ ├── 131433102201.jpg │ └── 132312526273.jpg ├── Preliminary of RL 1.md ├── Preliminary of RL 2.assets │ ├── 1042406-20180812184148124-1485684702.jpg │ ├── 1560008119444.png │ ├── 201019414696.png │ ├── 201019447506.png │ ├── 201019462191.png │ ├── 221402112851854.png │ ├── 221402155049842.png │ ├── 221402163881216.png │ ├── 221402175506201.png │ ├── interview-14.png │ ├── v2-111ca0554c4504c7aefc9a14d0d92d2f_1440w.jpg │ ├── v2-6c24d01db0b8b94589b2fe6a6efcc7b2_1440w.jpg │ ├── v2-ef32f6901c6a5b8f6eafd8d478ff83ef_1440w.jpg │ ├── v2-f3c12050c797196c7c37b003905a8d30_1440w.jpg │ └── 屏幕快照 2016-01-05 下午9.48.30.png ├── Preliminary of RL 2.md ├── Preliminary of RL 3.assets │ └── equation.svg ├── Preliminary of RL 3.md ├── Preliminary of RL 5.md ├── Reinforcement Learning Notes.assets │ ├── 0_kt9_Z41qxgiI0CDl │ ├── 0_kt9_Z41qxgiI0CDl-1575448739547 │ ├── 0_kt9_Z41qxgiI0CDl-1575448742639 │ ├── 0_kt9_Z41qxgiI0CDl-1575448756568 │ ├── 0_oh-lF13hYWt2Bd6V_ │ ├── 1564474069516.png │ ├── 1564549789614.png │ ├── 76a319586cd215c8f2075b938fc6f6e07c81714b.svg │ ├── 8795d42bd263dcbe55d123e7466b2dd5091490a7.svg │ ├── 9ed1a541005a48d51b624c3b329897064ec2c065.svg │ ├── a325c9e05fa2ccce85eb2384ca00b4888d1c7824.svg │ ├── a5132668c0af8733656505c5fb6c1dff4a7907a1.svg │ ├── dc4621f81a5205e6ae31a35b87c54316e043deda-1575549924223.svg │ ├── dc4621f81a5205e6ae31a35b87c54316e043deda-1575549965545.svg │ ├── dc4621f81a5205e6ae31a35b87c54316e043deda.svg │ ├── image-20191204161236516.png │ ├── image-20191204164022284.png │ ├── image-20191204200910603.png │ ├── image-20191204201005583.png │ ├── image-20191204204627884.png │ ├── image-20191204205459823.png │ ├── image-20191205092649257.png │ ├── image-20191205102211427.png │ ├── image-20191205103234091.png │ ├── image-20191205103636621.png │ ├── image-20191205103810941.png │ ├── image-20191205104741531.png │ ├── image-20191205105318993.png │ ├── image-20191205110708645.png │ ├── image-20191205111303995.png │ ├── image-20191205121930328.png │ └── image-20191205190844049.png ├── Reinforcement Learning Notes.md └── img │ ├── 1558592857137.png │ ├── 1558614556514.png │ ├── 1560008119444.png │ ├── 2019-04-10 19-14-32 的屏幕截图.png │ ├── 2019-04-10 19-17-30 的屏幕截图.png │ ├── 2019-04-10 21-00-18 的屏幕截图.png │ ├── 3-3-1.png │ ├── 3-3-2.png │ ├── 4-1-1-1554948278323.jpg │ ├── 4-1-1.jpg │ ├── 4-5-4.png │ ├── 4155986-e77eec1baba5aeea.webp │ ├── 5-1-1.png │ ├── DQN3.png │ ├── sl4.png │ └── 屏幕快照 2016-01-05 下午9.48.30.png ├── Probabilistic Robotics ├── PR GaussianProcessRegression.assets │ ├── 1_9xMQMnSPnAFkWqIY2jvIpQ.png │ ├── 1_IdGgdrY_n_9_YfkaCh-dag.png │ ├── 1_YAPmNXea5gKoH3uyRrtITQ.png │ ├── 1_zNQg-o-C2JELQFQjEEDrLw.png │ ├── Illustration-of-a-bivariate-Gaussian-distribution-The-marginal-and-joint-probability.png │ ├── Sun, 10 May 2020 143250.png │ ├── image-20200510155719330.png │ ├── v2-3c25a927c217f13a055794377635faaf_1440w.jpg │ └── 微信图片编辑_20200510130051.jpg ├── PR HMC&MH&Gibbs.assets │ ├── 2018-05-09-gibbs-100.png │ ├── gibbssampler-2dnormal1.png │ ├── image__14_.png │ ├── true.jpg │ └── v2-08cb302ac37b757ee390705d822f87f2_1440w.jpg ├── PR HMC&MH&Gibbs.md ├── PR HMM.md ├── PR IS&MCMC.assets │ ├── 1_3nBb4AqcriLcENdpBp4fpQ@2x.png │ ├── 1_AZBh2kDanLoTFmb3yzErGQ@2x.png │ ├── 1_HclnWfZrh7Nzuj2_aHkPCQ.png │ ├── 1_hKQcryMc6fbcS7r-g0sriQ.png │ ├── 20190511000705.png │ ├── 5cdd91aec0102b09bad70aff4bd0e9b2.jpg │ ├── importance_sampling_concept.png │ ├── v2-9514f7703820b5bf99c98405eb413359_1440w.jpg │ └── v2-eb0945aa2185df958f4568e58300e77a_1440w.gif ├── PR IS&MCMC.md ├── PR Ⅰ MLE&MAP.md ├── PR Ⅱ Bayesian ├── PR Ⅱ MCMC&EM.md ├── PR Ⅲ Bayesian_MCMC.assets │ ├── 1_3nBb4AqcriLcENdpBp4fpQ@2x.png │ ├── 1_AZBh2kDanLoTFmb3yzErGQ@2x.png │ ├── Beta_9_7.png │ └── Example-of-Bayesian-inference-with-a-prior-distribution-a-posterior-distribution-and.png ├── PR Ⅲ GaussianProcessRegression.md ├── PR Ⅳ BayesFilter.md ├── PR Ⅳ BayesNeuralNetwork.assets │ ├── bayes_nn.png │ ├── bayesian_statistics.jpg │ └── extrapolation_graph.png ├── PR Ⅳ BayesNeuralNetwork.md ├── PR Ⅴ GMM.assets │ ├── aHR0cDovL2ltZy5ibG9nLmNzZG4ubmV0LzIwMTcwMzAyMTc1NDQyMjcy.jfif │ └── aHR0cDovL2ltZy5ibG9nLmNzZG4ubmV0LzIwMTcwMzAyMTc1NTQ5ODc3.jfif ├── PR Ⅴ GMM.md ├── PR Ⅵ BayesGraph.md ├── PR Ⅶ VariationalInference.md ├── PR Ⅷ MeanField.md ├── PR Ⅸ Entropy.assets │ ├── OIP.jfif │ ├── image-20200519155921660.png │ ├── image-20200523201708073.png │ └── multicolored-abstract-painting-1095624-710x210.jpg ├── PR Ⅸ Entropy.md ├── Probabilistic in Robotic (PR).png └── Probabilistic in Robotic (PR).xmind ├── README.md ├── RL from Demonstration ├── Deep_Q_From_Demonstration.assets │ ├── image-20200522121546661.png │ ├── image-20200522121626682.png │ └── image-20200522122022338.png ├── Deep_Q_From_Demonstration.md ├── RLfrom_Imperfect_Demonstration.assets │ ├── image-20200524205516760.png │ ├── image-20200524213207117.png │ ├── image-20200524215212011.png │ ├── image-20200524215647493.png │ ├── image-20200524221043525.png │ ├── image-20200524221144933.png │ ├── image-20200524221938482.png │ └── image-20200524222429294.png └── RLfrom_Imperfect_Demonstration.md ├── ROS ├── ROS Ⅰ:An Introduction.md ├── ROS Ⅱ:报错解决方案集锦.md ├── ROS Ⅲ:ROS 话题.md ├── ROS Ⅳ:ROS 消息&服务.md └── ROS 机器人实战Ⅰ:TurtleBot3 Simulation SLAM + Navigation.md ├── Reasoning ├── PR Reasoning Ⅰ:Bandit问题与 UCB UCT AlphaGo.md ├── PR Reasoning Ⅱ:Inductive bias 归纳偏置及其在深度学习中的应用.md ├── PR Reasoning Ⅲ:基于图表征的关系推理框架 —— Graph Network.md ├── PR Reasoning Ⅳ:数理逻辑(命题逻辑、谓词逻辑)知识整理.md ├── PR Reasoning Ⅴ:命题推理与First Order Logic Reasoning.md ├── PR Reasoning Ⅵ:Counterfactual Reasoning 反事实推理及其在深度学习中的应用.md ├── PR Reasoning Ⅶ:Graph Reasoning 基于图的推理.md ├── PR Reasoning 序:Reasoning Robotics 推理机器人.md └── Relational inductive biases, deep learning, and graph networks.md ├── Related Works ├── A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets │ ├── image-20200528152451409.png │ ├── image-20200528201113694.png │ ├── image-20200528213920267.png │ ├── image-20200529151635542.png │ ├── image-20200529151707215.png │ └── image-20200529154918336.png ├── A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.md ├── Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates.md ├── End-to-End Robotic Reinforcement Learning without Reward Engineering.assets │ ├── image-20191208214135640.png │ ├── image-20191209102302915.png │ └── image-20191209104206015.png ├── End-to-End Robotic Reinforcement Learning without Reward Engineering.md ├── IROS2019速读(一).md ├── IROS2019速读(三).assets │ ├── image-20191221113203207.png │ ├── image-20191221114139768.png │ ├── image-20191221114316092.png │ ├── image-20191221200948469.png │ └── image-20191221234256181.png ├── IROS2019速读(三).md ├── IROS2019速读(二).assets │ └── image-20191219185107956.png ├── IROS2019速读(二).md ├── IROS2019速读(五).md ├── IROS2019速读(四).assets │ ├── dream_2.gif │ ├── dream_6.gif │ ├── image-20191222112555208.png │ ├── image-20191222123748477.png │ └── image-20191222144804275.png ├── IROS2019速读(四).md ├── IROS2019速读.assets │ ├── image-20191216203301798.png │ ├── image-20191216211111079.png │ ├── image-20191216212441606.png │ └── image-20191217203423494.png ├── Meta learning An Introduction.assets │ ├── 1_AcaPiikZErVv_iFJzWekQg.gif │ ├── NTM.png │ ├── combine-slow-fast-weights.png │ ├── equation-1577535254476.svg │ ├── equation.svg │ ├── few-shot-classification.png │ ├── image-20191226194740259.png │ ├── image-20191226200939808.png │ ├── image-20191226202123590.png │ ├── image-20191226202326065.png │ ├── lstm-meta-learner.png │ ├── maml-algo.png │ ├── maml.png │ ├── mann-meta-learning.png │ ├── matching-networks.png │ ├── meta-network.png │ ├── prototypical-networks-1577419209148.png │ ├── prototypical-networks.png │ ├── relation-network.png │ ├── reptile-algo.png │ ├── reptile_vs_FOMAML.png │ ├── siamese-conv-net.png │ ├── train-meta-learner.png │ └── v2-2d61ff11eb1a5a9e52d6c12eb333eb4b_hd.jpg ├── Meta learning An Introduction.md ├── Meta-Reinforcement-Learning An Introduction.md ├── Overcoming Exploration in Reinforcement Learning with Demonstrations.assets │ └── image-20191211211229554.png ├── Overcoming Exploration in Reinforcement Learning with Demonstrations.md ├── The Predictron End-To-End Learning and Planning.assets │ ├── image-20191211163322741.png │ ├── image-20191211201710251.png │ ├── image-20191211202507981.png │ ├── image-20191212102538657.png │ └── image-20191212102714873.png ├── The Predictron End-To-End Learning and Planning.md ├── When to Trust Your Model Model-Based Policy Optimization.assets │ └── image-20191215201141993.png ├── When to Trust Your Model Model-Based Policy Optimization.md ├── 智源大会笔记.assets │ ├── image-20200624101204649.png │ ├── image-20200624101305253.png │ ├── image-20200624101402324.png │ ├── image-20200624101416544.png │ ├── image-20200624101704078.png │ ├── image-20200624101836768.png │ ├── image-20200624102016044.png │ ├── image-20200624102239093.png │ ├── image-20200624102756731.png │ ├── image-20200624102845779.png │ ├── image-20200624102945812.png │ ├── image-20200624103018972.png │ ├── image-20200624103109295.png │ ├── image-20200624103145785.png │ ├── image-20200624104915752.png │ ├── image-20200624105312169.png │ ├── image-20200624105434860.png │ ├── image-20200624105534892.png │ ├── image-20200624110446944.png │ └── image-20200624110634783.png └── 智源大会笔记.md ├── Representation └── Repre 1:Introduction.md ├── Robotics └── Bimanual coordination.md ├── Simulator ├── MuJoCo机器人建模教程.assets │ ├── download.html │ ├── grid1.png │ ├── grid2.png │ ├── grid2pin.png │ ├── openai-robotics-hand-with-cube-solved-crop-2000w.jpg │ ├── particle2.png │ └── unnamed.png ├── MuJoCo机器人建模教程.md ├── MuJoCo详细使用指南.md ├── PyBullet详细使用指南.md └── Sim2real.md ├── Structured ├── PR Structure Ⅱ .assets │ ├── image-20200719135624489.png │ ├── image-20200719135706275.png │ ├── image-20200719141215552.png │ ├── image-20200719141544422.png │ ├── image-20200719141645589.png │ ├── image-20200719142143932.png │ ├── image-20200719160240612.png │ ├── image-20200719160620909.png │ ├── image-20200719161117095.png │ ├── image-20200719161134980.png │ ├── image-20200719163653512.png │ ├── image-20200719164027556.png │ ├── image-20200719164332203.png │ └── image-20200719164745855.png ├── PR Structured Ⅰ GNN.assets │ ├── image-20200712132319666.png │ ├── image-20200712152008297.png │ ├── image-20200712152040834.png │ ├── image-20200712152050882.png │ ├── image-20200712154103209.png │ ├── image-20200712155738229.png │ ├── image-20200712161616515.png │ ├── image-20200712165321733.png │ ├── image-20200712170431243.png │ ├── image-20200712171214741.png │ ├── image-20200712172004136.png │ ├── image-20200712172227154.png │ ├── image-20200712174617755.png │ ├── image-20200712175300499.png │ ├── image-20200712175439303.png │ ├── image-20200712175812940.png │ ├── image-20200712180112851.png │ ├── image-20200712195813501.png │ ├── image-20200712203453039.png │ ├── image-20200712212800180.png │ ├── image-20200712212854141.png │ ├── image-20200712213349638.png │ ├── image-20200712213614662.png │ ├── image-20200712214932503.png │ ├── image-20200712215137652.png │ ├── image-20200712230809276.png │ ├── image-20200712231606429.png │ └── image-20200719135706275.png ├── PR Structured Ⅱ:Structured Probabilistic Model Ⅰ.md ├── PR Structured Ⅲ:马尔可夫、隐马尔可夫 HMM 、条件随机场 CRF 全解析及其python实现.md ├── PR Structured Ⅳ:General Conditional Random Field (CRF).md ├── PR Structured Ⅴ:GraphRNN——依次生成节点和边的图生成模型.md └── PR StructuredⅠ:Graph Neural Network An Introduction .md ├── Tools ├── Atlas │ ├── Atlas 使用指南.assets │ │ └── Atlas软硬件架构.png │ ├── Atlas 使用指南.md │ ├── Atlas安装配置流程.eps │ ├── Atlas安装配置流程.png │ ├── Atlas软硬件架构.eps │ └── Atlas软硬件架构.png ├── C++部署Pytorch模型方法.docx ├── Docker │ ├── Docker Ⅰ:安装与测试指南.md │ ├── Docker Ⅱ:管理与使用命令手册.md │ ├── Docker Ⅲ:Nvidia Docker安装与测试指南.md │ ├── Docker Ⅳ:Nvidia Docker使用命令手册.md │ └── Docker Ⅴ:Docker与Nvidia Docker踩坑与解决方案记录集.md ├── Habitat │ └── Habitat Challenge提交指南.md ├── Tools 1:Qt 转 PyQt5 的 Pycharm 插件.md ├── Tools 3:python socket 服务器与客户端双向通信.md ├── Tools 4:Python三行转并行——真香.md ├── Tools 5:Python三行转并行后续——全局变量.md ├── Tools 6:如何用Readthedoc写一份优雅的技术文档.md ├── Tools 7:Python颜色设置.md ├── Tools 8:Tex符号大全.md ├── Tools 9:Zotero使用指南.md ├── Ubuntu │ └── Ubuntu系统问题.md └── color.py ├── Utils ├── HTML2PDF.py ├── PDFselector.py └── basic_plot.py └── img └── image-20230825121432059.png /.gitattributes: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/.gitattributes -------------------------------------------------------------------------------- /.github/FUNDING.yml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/.github/FUNDING.yml -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- 1 | Reasoning/BOOKS 2 | 3 | *.pdf 4 | .idea/ 5 | .vscode/ 6 | 7 | .DS_Store 8 | -------------------------------------------------------------------------------- /Causal/PR Causal Introduction.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Causal/PR Causal Introduction.md -------------------------------------------------------------------------------- /Control/1_LQR.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Control/1_LQR.md -------------------------------------------------------------------------------- /Control/1_LQR_code.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Control/1_LQR_code.py -------------------------------------------------------------------------------- /Control/2_DMP.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Control/2_DMP.md -------------------------------------------------------------------------------- /Control/3_DMP2.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Control/3_DMP2.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅰ:机器人中的数据高效强化学习.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅰ:机器人中的数据高效强化学习.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅱ:Bayesian Transfer RL with prior knowledge.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅱ:Bayesian Transfer RL with prior knowledge.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅲ:Efficient RL for Multi-Step Visual Tasks with Sim2real.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅲ:Efficient RL for Multi-Step Visual Tasks with Sim2real.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅳ:五分钟内让四足机器人学会行走.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅳ:五分钟内让四足机器人学会行走.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅴ:DERL with self-predictive representations.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅴ:DERL with self-predictive representations.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅵ:从RL的五个方面分析Sample Efficient.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅵ:从RL的五个方面分析Sample Efficient.md -------------------------------------------------------------------------------- /Efficient/PR Efficient Ⅶ:Efficient RL 中表征学习的理论基础.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Efficient/PR Efficient Ⅶ:Efficient RL 中表征学习的理论基础.md -------------------------------------------------------------------------------- /Federated Learning/FLⅠ:联邦学习(Federated Learning)入门指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Federated Learning/FLⅠ:联邦学习(Federated Learning)入门指南.md -------------------------------------------------------------------------------- /Imitation learning/01Introduction.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/01Introduction.md -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/image-20200514200538762.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/image-20200514200538762.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/微信截图_20200514220002.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/微信截图_20200514220002.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/微信截图_20200514220137.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/微信截图_20200514220137.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/微信截图_20200514220221.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/微信截图_20200514220221.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/微信截图_20200514220412.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/微信截图_20200514220412.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/微信截图_20200514220508.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/微信截图_20200514220508.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.assets/微信截图_20200514220651.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.assets/微信截图_20200514220651.png -------------------------------------------------------------------------------- /Imitation learning/02DAgger.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/02DAgger.md -------------------------------------------------------------------------------- /Imitation learning/03EnsembleDAgger.assets/image-20200514223244922.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/03EnsembleDAgger.assets/image-20200514223244922.png -------------------------------------------------------------------------------- /Imitation learning/03EnsembleDAgger.assets/微信截图_20200514225330.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/03EnsembleDAgger.assets/微信截图_20200514225330.png -------------------------------------------------------------------------------- /Imitation learning/03EnsembleDAgger.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/03EnsembleDAgger.md -------------------------------------------------------------------------------- /Imitation learning/EnsembleDAgger.assets/image-20200514131357018.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/EnsembleDAgger.assets/image-20200514131357018.png -------------------------------------------------------------------------------- /Imitation learning/EnsembleDAgger.assets/image-20200514131458469.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/EnsembleDAgger.assets/image-20200514131458469.png -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/1_9gdENk_iThuoha-ZJK4oOQ.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/1_9gdENk_iThuoha-ZJK4oOQ.jpeg -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/1_P076bt-xcC3mKyYCINzSFg.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/1_P076bt-xcC3mKyYCINzSFg.jpeg -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/1_RkCKUyRW68fAuysDgWhuMA.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/1_RkCKUyRW68fAuysDgWhuMA.png -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/1_UuY1bsit07pwijg1pSOQgQ.jpeg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/1_UuY1bsit07pwijg1pSOQgQ.jpeg -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/image-20200513095032814.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/image-20200513095032814.png -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/image-20200513095203592.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/image-20200513095203592.png -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/image-20200513110523699.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/image-20200513110523699.png -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/stacking_demo.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/stacking_demo.gif -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/v2-61bb833f9464f5c7fc088045f26c909d_1440w.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/v2-61bb833f9464f5c7fc088045f26c909d_1440w.png -------------------------------------------------------------------------------- /Imitation learning/Introduction.assets/v2-9ad04f29683121bb7870e0589b8ec389_1440w.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Imitation learning/Introduction.assets/v2-9ad04f29683121bb7870e0589b8ec389_1440w.png -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/LICENSE -------------------------------------------------------------------------------- /MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/2020-09-06_23-17-55.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/2020-09-06_23-17-55.jpg -------------------------------------------------------------------------------- /MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/2020-09-08_12-50-46.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/2020-09-08_12-50-46.jpg -------------------------------------------------------------------------------- /MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/image-20200906172748898.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/image-20200906172748898.png -------------------------------------------------------------------------------- /MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/image-20200906225644703.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.assets/image-20200906225644703.png -------------------------------------------------------------------------------- /MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅰ:A Selective Overview of Theories and Algorithms.md -------------------------------------------------------------------------------- /MARL/MARL Ⅱ:QD-learning.assets/image-20200914210232950.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅱ:QD-learning.assets/image-20200914210232950.png -------------------------------------------------------------------------------- /MARL/MARL Ⅱ:QD-learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MARL/MARL Ⅱ:QD-learning.md -------------------------------------------------------------------------------- /MBRL/Model-Based RL Ⅲ 从源码读懂PILCO.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MBRL/Model-Based RL Ⅲ 从源码读懂PILCO.md -------------------------------------------------------------------------------- /MBRL/img/equation.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MBRL/img/equation.svg -------------------------------------------------------------------------------- /MBRL/img/image-20200505172434162.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MBRL/img/image-20200505172434162.png -------------------------------------------------------------------------------- /MBRL/img/微信截图_20200505223246.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MBRL/img/微信截图_20200505223246.png -------------------------------------------------------------------------------- /MBRL/img/微信截图_20200505223309.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/MBRL/img/微信截图_20200505223309.png -------------------------------------------------------------------------------- /Memory/Memory systems 2018 – towards a new paradigm.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Memory/Memory systems 2018 – towards a new paradigm.md -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703144120242.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703144120242.png -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703144159048.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703144159048.png -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703144238614.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703144238614.png -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703151212485.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703151212485.png -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703154225724.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703154225724.png -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703154359760.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.assets/image-20200703154359760.png -------------------------------------------------------------------------------- /Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Bayesian Relational Memory for Semantic Visual Navigation.md -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705111106393.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705111106393.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705111939601.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705111939601.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705123909206.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705123909206.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705130626188.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705130626188.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131656750.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131656750.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131744033.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131744033.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131747173.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131747173.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131859781.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131859781.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131926516.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705131926516.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705132141819.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.assets/image-20200705132141819.png -------------------------------------------------------------------------------- /Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships.md -------------------------------------------------------------------------------- /Paper Reading/Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to Adapt in Dynamic, Real-World Environments through Meta-Reinforcement Learning.md -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200702211649011.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200702211649011.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200702213128415.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200702213128415.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703101618018.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703101618018.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703101914142.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703101914142.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703104319711.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703104319711.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121626950.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121626950.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121644122.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121644122.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121752526.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121752526.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121906914.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.assets/image-20200703121906914.png -------------------------------------------------------------------------------- /Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Learning to learn how to learn Self-adaptive visual navigation using meta-learning.md -------------------------------------------------------------------------------- /Paper Reading/Long Range Neural Navigation Policies for the Real World.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Long Range Neural Navigation Policies for the Real World.md -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706152627903.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706152627903.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706153724992.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706153724992.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706165235163.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706165235163.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706175359541.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706175359541.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706182841042.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706182841042.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706182939318.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706182939318.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706183146814.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706183146814.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706183211120.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706183211120.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706190803028.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.assets/image-20200706190803028.png -------------------------------------------------------------------------------- /Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Scene memory transformer for embodied agents in long-horizon tasks.md -------------------------------------------------------------------------------- /Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706125057832.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706125057832.png -------------------------------------------------------------------------------- /Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706125603144.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706125603144.png -------------------------------------------------------------------------------- /Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706140848398.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706140848398.png -------------------------------------------------------------------------------- /Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706140947547.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706140947547.png -------------------------------------------------------------------------------- /Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706141720545.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Semi-parametric topological memory for navigation.assets/image-20200706141720545.png -------------------------------------------------------------------------------- /Paper Reading/Semi-parametric topological memory for navigation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Semi-parametric topological memory for navigation.md -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702140946820.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702140946820.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702151238953.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702151238953.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702154153781.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702154153781.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702161215057.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702161215057.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702163714131.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702163714131.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702164548441.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702164548441.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702172920825.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.assets/image-20200702172920825.png -------------------------------------------------------------------------------- /Paper Reading/Target driven visual navigation exploiting object relationships.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Target driven visual navigation exploiting object relationships.md -------------------------------------------------------------------------------- /Paper Reading/Uncertainty-Aware Reinforcement Learning for Collision Avoidance.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Paper Reading/Uncertainty-Aware Reinforcement Learning for Collision Avoidance.md -------------------------------------------------------------------------------- /Perspective/PR Perspective Ⅰ:Embodied AI 的新浪潮.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Perspective/PR Perspective Ⅰ:Embodied AI 的新浪潮.md -------------------------------------------------------------------------------- /Perspective/PR Perspective Ⅱ:Robot Learning思考.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Perspective/PR Perspective Ⅱ:Robot Learning思考.md -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/20160707204048899.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/20160707204048899.gif -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/853467-20160630141449671-1058672778.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/853467-20160630141449671-1058672778.png -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/853467-20160630152018906-1524325812.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/853467-20160630152018906-1524325812.png -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/853467-20160630154317562-311369571.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/853467-20160630154317562-311369571.png -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/equation-1584587262781.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/equation-1584587262781.svg -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/equation-1584587282205.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/equation-1584587282205.svg -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/equation-1584587725413.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/equation-1584587725413.svg -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.assets/equation.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.assets/equation.svg -------------------------------------------------------------------------------- /Preliminary/A Simple Guide for NN.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/A Simple Guide for NN.md -------------------------------------------------------------------------------- /Preliminary/Preliminary RL basic knowledge.assets/131433102201.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary RL basic knowledge.assets/131433102201.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary RL basic knowledge.assets/132312526273.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary RL basic knowledge.assets/132312526273.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 1.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 1.md -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/1042406-20180812184148124-1485684702.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/1042406-20180812184148124-1485684702.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/1560008119444.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/1560008119444.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/201019414696.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/201019414696.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/201019447506.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/201019447506.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/201019462191.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/201019462191.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/221402112851854.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/221402112851854.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/221402155049842.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/221402155049842.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/221402163881216.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/221402163881216.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/221402175506201.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/221402175506201.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/interview-14.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/interview-14.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/v2-111ca0554c4504c7aefc9a14d0d92d2f_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/v2-111ca0554c4504c7aefc9a14d0d92d2f_1440w.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/v2-6c24d01db0b8b94589b2fe6a6efcc7b2_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/v2-6c24d01db0b8b94589b2fe6a6efcc7b2_1440w.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/v2-ef32f6901c6a5b8f6eafd8d478ff83ef_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/v2-ef32f6901c6a5b8f6eafd8d478ff83ef_1440w.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/v2-f3c12050c797196c7c37b003905a8d30_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/v2-f3c12050c797196c7c37b003905a8d30_1440w.jpg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.assets/屏幕快照 2016-01-05 下午9.48.30.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.assets/屏幕快照 2016-01-05 下午9.48.30.png -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 2.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 2.md -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 3.assets/equation.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 3.assets/equation.svg -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 3.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 3.md -------------------------------------------------------------------------------- /Preliminary/Preliminary of RL 5.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Preliminary of RL 5.md -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl-1575448739547: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl-1575448739547 -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl-1575448742639: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl-1575448742639 -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl-1575448756568: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/0_kt9_Z41qxgiI0CDl-1575448756568 -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/0_oh-lF13hYWt2Bd6V_: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/0_oh-lF13hYWt2Bd6V_ -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/1564474069516.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/1564474069516.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/1564549789614.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/1564549789614.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/76a319586cd215c8f2075b938fc6f6e07c81714b.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/76a319586cd215c8f2075b938fc6f6e07c81714b.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/8795d42bd263dcbe55d123e7466b2dd5091490a7.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/8795d42bd263dcbe55d123e7466b2dd5091490a7.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/9ed1a541005a48d51b624c3b329897064ec2c065.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/9ed1a541005a48d51b624c3b329897064ec2c065.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/a325c9e05fa2ccce85eb2384ca00b4888d1c7824.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/a325c9e05fa2ccce85eb2384ca00b4888d1c7824.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/a5132668c0af8733656505c5fb6c1dff4a7907a1.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/a5132668c0af8733656505c5fb6c1dff4a7907a1.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/dc4621f81a5205e6ae31a35b87c54316e043deda-1575549924223.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/dc4621f81a5205e6ae31a35b87c54316e043deda-1575549924223.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/dc4621f81a5205e6ae31a35b87c54316e043deda-1575549965545.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/dc4621f81a5205e6ae31a35b87c54316e043deda-1575549965545.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/dc4621f81a5205e6ae31a35b87c54316e043deda.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/dc4621f81a5205e6ae31a35b87c54316e043deda.svg -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191204161236516.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191204161236516.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191204164022284.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191204164022284.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191204200910603.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191204200910603.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191204201005583.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191204201005583.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191204204627884.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191204204627884.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191204205459823.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191204205459823.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205092649257.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205092649257.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205102211427.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205102211427.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205103234091.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205103234091.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205103636621.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205103636621.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205103810941.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205103810941.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205104741531.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205104741531.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205105318993.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205105318993.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205110708645.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205110708645.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205111303995.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205111303995.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205121930328.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205121930328.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.assets/image-20191205190844049.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.assets/image-20191205190844049.png -------------------------------------------------------------------------------- /Preliminary/Reinforcement Learning Notes.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/Reinforcement Learning Notes.md -------------------------------------------------------------------------------- /Preliminary/img/1558592857137.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/1558592857137.png -------------------------------------------------------------------------------- /Preliminary/img/1558614556514.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/1558614556514.png -------------------------------------------------------------------------------- /Preliminary/img/1560008119444.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/1560008119444.png -------------------------------------------------------------------------------- /Preliminary/img/2019-04-10 19-14-32 的屏幕截图.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/2019-04-10 19-14-32 的屏幕截图.png -------------------------------------------------------------------------------- /Preliminary/img/2019-04-10 19-17-30 的屏幕截图.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/2019-04-10 19-17-30 的屏幕截图.png -------------------------------------------------------------------------------- /Preliminary/img/2019-04-10 21-00-18 的屏幕截图.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/2019-04-10 21-00-18 的屏幕截图.png -------------------------------------------------------------------------------- /Preliminary/img/3-3-1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/3-3-1.png -------------------------------------------------------------------------------- /Preliminary/img/3-3-2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/3-3-2.png -------------------------------------------------------------------------------- /Preliminary/img/4-1-1-1554948278323.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/4-1-1-1554948278323.jpg -------------------------------------------------------------------------------- /Preliminary/img/4-1-1.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/4-1-1.jpg -------------------------------------------------------------------------------- /Preliminary/img/4-5-4.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/4-5-4.png -------------------------------------------------------------------------------- /Preliminary/img/4155986-e77eec1baba5aeea.webp: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/4155986-e77eec1baba5aeea.webp -------------------------------------------------------------------------------- /Preliminary/img/5-1-1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/5-1-1.png -------------------------------------------------------------------------------- /Preliminary/img/DQN3.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/DQN3.png -------------------------------------------------------------------------------- /Preliminary/img/sl4.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/sl4.png -------------------------------------------------------------------------------- /Preliminary/img/屏幕快照 2016-01-05 下午9.48.30.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Preliminary/img/屏幕快照 2016-01-05 下午9.48.30.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/1_9xMQMnSPnAFkWqIY2jvIpQ.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/1_9xMQMnSPnAFkWqIY2jvIpQ.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/1_IdGgdrY_n_9_YfkaCh-dag.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/1_IdGgdrY_n_9_YfkaCh-dag.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/1_YAPmNXea5gKoH3uyRrtITQ.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/1_YAPmNXea5gKoH3uyRrtITQ.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/1_zNQg-o-C2JELQFQjEEDrLw.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/1_zNQg-o-C2JELQFQjEEDrLw.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/Illustration-of-a-bivariate-Gaussian-distribution-The-marginal-and-joint-probability.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/Illustration-of-a-bivariate-Gaussian-distribution-The-marginal-and-joint-probability.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/Sun, 10 May 2020 143250.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/Sun, 10 May 2020 143250.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/image-20200510155719330.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/image-20200510155719330.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/v2-3c25a927c217f13a055794377635faaf_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/v2-3c25a927c217f13a055794377635faaf_1440w.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR GaussianProcessRegression.assets/微信图片编辑_20200510130051.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR GaussianProcessRegression.assets/微信图片编辑_20200510130051.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMC&MH&Gibbs.assets/2018-05-09-gibbs-100.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR HMC&MH&Gibbs.assets/2018-05-09-gibbs-100.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMC&MH&Gibbs.assets/gibbssampler-2dnormal1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR HMC&MH&Gibbs.assets/gibbssampler-2dnormal1.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMC&MH&Gibbs.assets/image__14_.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR HMC&MH&Gibbs.assets/image__14_.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMC&MH&Gibbs.assets/true.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR HMC&MH&Gibbs.assets/true.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMC&MH&Gibbs.assets/v2-08cb302ac37b757ee390705d822f87f2_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR HMC&MH&Gibbs.assets/v2-08cb302ac37b757ee390705d822f87f2_1440w.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMC&MH&Gibbs.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR HMC&MH&Gibbs.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR HMM.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/1_3nBb4AqcriLcENdpBp4fpQ@2x.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/1_3nBb4AqcriLcENdpBp4fpQ@2x.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/1_AZBh2kDanLoTFmb3yzErGQ@2x.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/1_AZBh2kDanLoTFmb3yzErGQ@2x.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/1_HclnWfZrh7Nzuj2_aHkPCQ.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/1_HclnWfZrh7Nzuj2_aHkPCQ.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/1_hKQcryMc6fbcS7r-g0sriQ.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/1_hKQcryMc6fbcS7r-g0sriQ.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/20190511000705.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/20190511000705.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/5cdd91aec0102b09bad70aff4bd0e9b2.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/5cdd91aec0102b09bad70aff4bd0e9b2.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/importance_sampling_concept.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/importance_sampling_concept.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/v2-9514f7703820b5bf99c98405eb413359_1440w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/v2-9514f7703820b5bf99c98405eb413359_1440w.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.assets/v2-eb0945aa2185df958f4568e58300e77a_1440w.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.assets/v2-eb0945aa2185df958f4568e58300e77a_1440w.gif -------------------------------------------------------------------------------- /Probabilistic Robotics/PR IS&MCMC.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR IS&MCMC.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅰ MLE&MAP.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅰ MLE&MAP.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅱ Bayesian: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅱ Bayesian -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅱ MCMC&EM.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅱ MCMC&EM.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/1_3nBb4AqcriLcENdpBp4fpQ@2x.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/1_3nBb4AqcriLcENdpBp4fpQ@2x.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/1_AZBh2kDanLoTFmb3yzErGQ@2x.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/1_AZBh2kDanLoTFmb3yzErGQ@2x.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/Beta_9_7.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/Beta_9_7.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/Example-of-Bayesian-inference-with-a-prior-distribution-a-posterior-distribution-and.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅲ Bayesian_MCMC.assets/Example-of-Bayesian-inference-with-a-prior-distribution-a-posterior-distribution-and.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅲ GaussianProcessRegression.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅲ GaussianProcessRegression.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅳ BayesFilter.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.assets/bayes_nn.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.assets/bayes_nn.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.assets/bayesian_statistics.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.assets/bayesian_statistics.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.assets/extrapolation_graph.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.assets/extrapolation_graph.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅳ BayesNeuralNetwork.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅴ GMM.assets/aHR0cDovL2ltZy5ibG9nLmNzZG4ubmV0LzIwMTcwMzAyMTc1NDQyMjcy.jfif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅴ GMM.assets/aHR0cDovL2ltZy5ibG9nLmNzZG4ubmV0LzIwMTcwMzAyMTc1NDQyMjcy.jfif -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅴ GMM.assets/aHR0cDovL2ltZy5ibG9nLmNzZG4ubmV0LzIwMTcwMzAyMTc1NTQ5ODc3.jfif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅴ GMM.assets/aHR0cDovL2ltZy5ibG9nLmNzZG4ubmV0LzIwMTcwMzAyMTc1NTQ5ODc3.jfif -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅴ GMM.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅴ GMM.md -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅵ BayesGraph.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅶ VariationalInference.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅷ MeanField.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅸ Entropy.assets/OIP.jfif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅸ Entropy.assets/OIP.jfif -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅸ Entropy.assets/image-20200519155921660.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅸ Entropy.assets/image-20200519155921660.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅸ Entropy.assets/image-20200523201708073.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅸ Entropy.assets/image-20200523201708073.png -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅸ Entropy.assets/multicolored-abstract-painting-1095624-710x210.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅸ Entropy.assets/multicolored-abstract-painting-1095624-710x210.jpg -------------------------------------------------------------------------------- /Probabilistic Robotics/PR Ⅸ Entropy.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/PR Ⅸ Entropy.md -------------------------------------------------------------------------------- /Probabilistic Robotics/Probabilistic in Robotic (PR).png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/Probabilistic in Robotic (PR).png -------------------------------------------------------------------------------- /Probabilistic Robotics/Probabilistic in Robotic (PR).xmind: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Probabilistic Robotics/Probabilistic in Robotic (PR).xmind -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/README.md -------------------------------------------------------------------------------- /RL from Demonstration/Deep_Q_From_Demonstration.assets/image-20200522121546661.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/Deep_Q_From_Demonstration.assets/image-20200522121546661.png -------------------------------------------------------------------------------- /RL from Demonstration/Deep_Q_From_Demonstration.assets/image-20200522121626682.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/Deep_Q_From_Demonstration.assets/image-20200522121626682.png -------------------------------------------------------------------------------- /RL from Demonstration/Deep_Q_From_Demonstration.assets/image-20200522122022338.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/Deep_Q_From_Demonstration.assets/image-20200522122022338.png -------------------------------------------------------------------------------- /RL from Demonstration/Deep_Q_From_Demonstration.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/Deep_Q_From_Demonstration.md -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524205516760.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524205516760.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524213207117.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524213207117.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524215212011.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524215212011.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524215647493.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524215647493.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524221043525.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524221043525.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524221144933.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524221144933.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524221938482.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524221938482.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524222429294.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.assets/image-20200524222429294.png -------------------------------------------------------------------------------- /RL from Demonstration/RLfrom_Imperfect_Demonstration.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/RL from Demonstration/RLfrom_Imperfect_Demonstration.md -------------------------------------------------------------------------------- /ROS/ROS Ⅰ:An Introduction.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/ROS/ROS Ⅰ:An Introduction.md -------------------------------------------------------------------------------- /ROS/ROS Ⅱ:报错解决方案集锦.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/ROS/ROS Ⅱ:报错解决方案集锦.md -------------------------------------------------------------------------------- /ROS/ROS Ⅲ:ROS 话题.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /ROS/ROS Ⅳ:ROS 消息&服务.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /ROS/ROS 机器人实战Ⅰ:TurtleBot3 Simulation SLAM + Navigation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/ROS/ROS 机器人实战Ⅰ:TurtleBot3 Simulation SLAM + Navigation.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅰ:Bandit问题与 UCB UCT AlphaGo.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅰ:Bandit问题与 UCB UCT AlphaGo.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅱ:Inductive bias 归纳偏置及其在深度学习中的应用.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅱ:Inductive bias 归纳偏置及其在深度学习中的应用.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅲ:基于图表征的关系推理框架 —— Graph Network.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅲ:基于图表征的关系推理框架 —— Graph Network.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅳ:数理逻辑(命题逻辑、谓词逻辑)知识整理.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅳ:数理逻辑(命题逻辑、谓词逻辑)知识整理.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅴ:命题推理与First Order Logic Reasoning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅴ:命题推理与First Order Logic Reasoning.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅵ:Counterfactual Reasoning 反事实推理及其在深度学习中的应用.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅵ:Counterfactual Reasoning 反事实推理及其在深度学习中的应用.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning Ⅶ:Graph Reasoning 基于图的推理.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning Ⅶ:Graph Reasoning 基于图的推理.md -------------------------------------------------------------------------------- /Reasoning/PR Reasoning 序:Reasoning Robotics 推理机器人.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/PR Reasoning 序:Reasoning Robotics 推理机器人.md -------------------------------------------------------------------------------- /Reasoning/Relational inductive biases, deep learning, and graph networks.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Reasoning/Relational inductive biases, deep learning, and graph networks.md -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200528152451409.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200528152451409.png -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200528201113694.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200528201113694.png -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200528213920267.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200528213920267.png -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200529151635542.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200529151635542.png -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200529151707215.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200529151707215.png -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200529154918336.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.assets/image-20200529154918336.png -------------------------------------------------------------------------------- /Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/A_survey_on_PS_for_Learning_Robot_controllers_in_a_Handful_of_trials.md -------------------------------------------------------------------------------- /Related Works/Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates.md -------------------------------------------------------------------------------- /Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.assets/image-20191208214135640.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.assets/image-20191208214135640.png -------------------------------------------------------------------------------- /Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.assets/image-20191209102302915.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.assets/image-20191209102302915.png -------------------------------------------------------------------------------- /Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.assets/image-20191209104206015.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.assets/image-20191209104206015.png -------------------------------------------------------------------------------- /Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/End-to-End Robotic Reinforcement Learning without Reward Engineering.md -------------------------------------------------------------------------------- /Related Works/IROS2019速读(一).md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(一).md -------------------------------------------------------------------------------- /Related Works/IROS2019速读(三).assets/image-20191221113203207.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(三).assets/image-20191221113203207.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(三).assets/image-20191221114139768.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(三).assets/image-20191221114139768.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(三).assets/image-20191221114316092.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(三).assets/image-20191221114316092.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(三).assets/image-20191221200948469.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(三).assets/image-20191221200948469.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(三).assets/image-20191221234256181.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(三).assets/image-20191221234256181.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(三).md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(三).md -------------------------------------------------------------------------------- /Related Works/IROS2019速读(二).assets/image-20191219185107956.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(二).assets/image-20191219185107956.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(二).md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(二).md -------------------------------------------------------------------------------- /Related Works/IROS2019速读(五).md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(五).md -------------------------------------------------------------------------------- /Related Works/IROS2019速读(四).assets/dream_2.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(四).assets/dream_2.gif -------------------------------------------------------------------------------- /Related Works/IROS2019速读(四).assets/dream_6.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(四).assets/dream_6.gif -------------------------------------------------------------------------------- /Related Works/IROS2019速读(四).assets/image-20191222112555208.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(四).assets/image-20191222112555208.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(四).assets/image-20191222123748477.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(四).assets/image-20191222123748477.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(四).assets/image-20191222144804275.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(四).assets/image-20191222144804275.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读(四).md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读(四).md -------------------------------------------------------------------------------- /Related Works/IROS2019速读.assets/image-20191216203301798.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读.assets/image-20191216203301798.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读.assets/image-20191216211111079.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读.assets/image-20191216211111079.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读.assets/image-20191216212441606.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读.assets/image-20191216212441606.png -------------------------------------------------------------------------------- /Related Works/IROS2019速读.assets/image-20191217203423494.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/IROS2019速读.assets/image-20191217203423494.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/1_AcaPiikZErVv_iFJzWekQg.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/1_AcaPiikZErVv_iFJzWekQg.gif -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/NTM.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/NTM.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/combine-slow-fast-weights.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/combine-slow-fast-weights.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/equation-1577535254476.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/equation-1577535254476.svg -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/equation.svg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/equation.svg -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/few-shot-classification.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/few-shot-classification.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/image-20191226194740259.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/image-20191226194740259.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/image-20191226200939808.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/image-20191226200939808.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/image-20191226202123590.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/image-20191226202123590.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/image-20191226202326065.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/image-20191226202326065.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/lstm-meta-learner.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/lstm-meta-learner.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/maml-algo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/maml-algo.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/maml.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/maml.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/mann-meta-learning.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/mann-meta-learning.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/matching-networks.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/matching-networks.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/meta-network.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/meta-network.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/prototypical-networks-1577419209148.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/prototypical-networks-1577419209148.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/prototypical-networks.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/prototypical-networks.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/relation-network.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/relation-network.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/reptile-algo.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/reptile-algo.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/reptile_vs_FOMAML.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/reptile_vs_FOMAML.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/siamese-conv-net.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/siamese-conv-net.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/train-meta-learner.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/train-meta-learner.png -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.assets/v2-2d61ff11eb1a5a9e52d6c12eb333eb4b_hd.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.assets/v2-2d61ff11eb1a5a9e52d6c12eb333eb4b_hd.jpg -------------------------------------------------------------------------------- /Related Works/Meta learning An Introduction.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta learning An Introduction.md -------------------------------------------------------------------------------- /Related Works/Meta-Reinforcement-Learning An Introduction.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Meta-Reinforcement-Learning An Introduction.md -------------------------------------------------------------------------------- /Related Works/Overcoming Exploration in Reinforcement Learning with Demonstrations.assets/image-20191211211229554.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Overcoming Exploration in Reinforcement Learning with Demonstrations.assets/image-20191211211229554.png -------------------------------------------------------------------------------- /Related Works/Overcoming Exploration in Reinforcement Learning with Demonstrations.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/Overcoming Exploration in Reinforcement Learning with Demonstrations.md -------------------------------------------------------------------------------- /Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191211163322741.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191211163322741.png -------------------------------------------------------------------------------- /Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191211201710251.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191211201710251.png -------------------------------------------------------------------------------- /Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191211202507981.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191211202507981.png -------------------------------------------------------------------------------- /Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191212102538657.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191212102538657.png -------------------------------------------------------------------------------- /Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191212102714873.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/The Predictron End-To-End Learning and Planning.assets/image-20191212102714873.png -------------------------------------------------------------------------------- /Related Works/The Predictron End-To-End Learning and Planning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/The Predictron End-To-End Learning and Planning.md -------------------------------------------------------------------------------- /Related Works/When to Trust Your Model Model-Based Policy Optimization.assets/image-20191215201141993.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/When to Trust Your Model Model-Based Policy Optimization.assets/image-20191215201141993.png -------------------------------------------------------------------------------- /Related Works/When to Trust Your Model Model-Based Policy Optimization.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/When to Trust Your Model Model-Based Policy Optimization.md -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624101204649.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624101204649.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624101305253.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624101305253.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624101402324.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624101402324.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624101416544.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624101416544.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624101704078.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624101704078.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624101836768.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624101836768.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624102016044.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624102016044.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624102239093.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624102239093.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624102756731.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624102756731.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624102845779.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624102845779.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624102945812.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624102945812.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624103018972.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624103018972.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624103109295.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624103109295.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624103145785.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624103145785.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624104915752.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624104915752.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624105312169.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624105312169.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624105434860.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624105434860.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624105534892.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624105534892.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624110446944.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624110446944.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.assets/image-20200624110634783.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.assets/image-20200624110634783.png -------------------------------------------------------------------------------- /Related Works/智源大会笔记.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Related Works/智源大会笔记.md -------------------------------------------------------------------------------- /Representation/Repre 1:Introduction.md: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Robotics/Bimanual coordination.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Robotics/Bimanual coordination.md -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/download.html: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/download.html -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/grid1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/grid1.png -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/grid2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/grid2.png -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/grid2pin.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/grid2pin.png -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/openai-robotics-hand-with-cube-solved-crop-2000w.jpg: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/openai-robotics-hand-with-cube-solved-crop-2000w.jpg -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/particle2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/particle2.png -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.assets/unnamed.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.assets/unnamed.png -------------------------------------------------------------------------------- /Simulator/MuJoCo机器人建模教程.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo机器人建模教程.md -------------------------------------------------------------------------------- /Simulator/MuJoCo详细使用指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Simulator/MuJoCo详细使用指南.md -------------------------------------------------------------------------------- /Simulator/PyBullet详细使用指南.md: -------------------------------------------------------------------------------- 1 | # PyBullet 详细使用指南 -------------------------------------------------------------------------------- /Simulator/Sim2real.md: -------------------------------------------------------------------------------- 1 | # Sim2real in Robot Learning: An Introduction -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719135624489.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719135624489.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719135706275.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719135706275.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719141215552.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719141215552.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719141544422.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719141544422.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719141645589.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719141645589.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719142143932.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719142143932.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719160240612.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719160240612.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719160620909.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719160620909.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719161117095.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719161117095.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719161134980.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719161134980.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719163653512.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719163653512.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719164027556.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719164027556.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719164332203.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719164332203.png -------------------------------------------------------------------------------- /Structured/PR Structure Ⅱ .assets/image-20200719164745855.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structure Ⅱ .assets/image-20200719164745855.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712132319666.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712132319666.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712152008297.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712152008297.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712152040834.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712152040834.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712152050882.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712152050882.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712154103209.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712154103209.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712155738229.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712155738229.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712161616515.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712161616515.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712165321733.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712165321733.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712170431243.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712170431243.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712171214741.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712171214741.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712172004136.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712172004136.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712172227154.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712172227154.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712174617755.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712174617755.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712175300499.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712175300499.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712175439303.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712175439303.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712175812940.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712175812940.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712180112851.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712180112851.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712195813501.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712195813501.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712203453039.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712203453039.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712212800180.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712212800180.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712212854141.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712212854141.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712213349638.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712213349638.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712213614662.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712213614662.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712214932503.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712214932503.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712215137652.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712215137652.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712230809276.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712230809276.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200712231606429.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200712231606429.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅰ GNN.assets/image-20200719135706275.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅰ GNN.assets/image-20200719135706275.png -------------------------------------------------------------------------------- /Structured/PR Structured Ⅱ:Structured Probabilistic Model Ⅰ.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅱ:Structured Probabilistic Model Ⅰ.md -------------------------------------------------------------------------------- /Structured/PR Structured Ⅲ:马尔可夫、隐马尔可夫 HMM 、条件随机场 CRF 全解析及其python实现.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅲ:马尔可夫、隐马尔可夫 HMM 、条件随机场 CRF 全解析及其python实现.md -------------------------------------------------------------------------------- /Structured/PR Structured Ⅳ:General Conditional Random Field (CRF).md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅳ:General Conditional Random Field (CRF).md -------------------------------------------------------------------------------- /Structured/PR Structured Ⅴ:GraphRNN——依次生成节点和边的图生成模型.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR Structured Ⅴ:GraphRNN——依次生成节点和边的图生成模型.md -------------------------------------------------------------------------------- /Structured/PR StructuredⅠ:Graph Neural Network An Introduction .md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Structured/PR StructuredⅠ:Graph Neural Network An Introduction .md -------------------------------------------------------------------------------- /Tools/Atlas/Atlas 使用指南.assets/Atlas软硬件架构.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Atlas/Atlas 使用指南.assets/Atlas软硬件架构.png -------------------------------------------------------------------------------- /Tools/Atlas/Atlas 使用指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Atlas/Atlas 使用指南.md -------------------------------------------------------------------------------- /Tools/Atlas/Atlas安装配置流程.eps: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Atlas/Atlas安装配置流程.eps -------------------------------------------------------------------------------- /Tools/Atlas/Atlas安装配置流程.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Atlas/Atlas安装配置流程.png -------------------------------------------------------------------------------- /Tools/Atlas/Atlas软硬件架构.eps: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Atlas/Atlas软硬件架构.eps -------------------------------------------------------------------------------- /Tools/Atlas/Atlas软硬件架构.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Atlas/Atlas软硬件架构.png -------------------------------------------------------------------------------- /Tools/C++部署Pytorch模型方法.docx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/C++部署Pytorch模型方法.docx -------------------------------------------------------------------------------- /Tools/Docker/Docker Ⅰ:安装与测试指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Docker/Docker Ⅰ:安装与测试指南.md -------------------------------------------------------------------------------- /Tools/Docker/Docker Ⅱ:管理与使用命令手册.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Docker/Docker Ⅱ:管理与使用命令手册.md -------------------------------------------------------------------------------- /Tools/Docker/Docker Ⅲ:Nvidia Docker安装与测试指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Docker/Docker Ⅲ:Nvidia Docker安装与测试指南.md -------------------------------------------------------------------------------- /Tools/Docker/Docker Ⅳ:Nvidia Docker使用命令手册.md: -------------------------------------------------------------------------------- 1 | # Docker Ⅳ:Nvidia Docker使用命令手册 2 | 3 | [[TOC]] 4 | 5 | > 工欲善其事,必先利其器 -------------------------------------------------------------------------------- /Tools/Docker/Docker Ⅴ:Docker与Nvidia Docker踩坑与解决方案记录集.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Docker/Docker Ⅴ:Docker与Nvidia Docker踩坑与解决方案记录集.md -------------------------------------------------------------------------------- /Tools/Habitat/Habitat Challenge提交指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Habitat/Habitat Challenge提交指南.md -------------------------------------------------------------------------------- /Tools/Tools 1:Qt 转 PyQt5 的 Pycharm 插件.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 1:Qt 转 PyQt5 的 Pycharm 插件.md -------------------------------------------------------------------------------- /Tools/Tools 3:python socket 服务器与客户端双向通信.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 3:python socket 服务器与客户端双向通信.md -------------------------------------------------------------------------------- /Tools/Tools 4:Python三行转并行——真香.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 4:Python三行转并行——真香.md -------------------------------------------------------------------------------- /Tools/Tools 5:Python三行转并行后续——全局变量.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 5:Python三行转并行后续——全局变量.md -------------------------------------------------------------------------------- /Tools/Tools 6:如何用Readthedoc写一份优雅的技术文档.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 6:如何用Readthedoc写一份优雅的技术文档.md -------------------------------------------------------------------------------- /Tools/Tools 7:Python颜色设置.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 7:Python颜色设置.md -------------------------------------------------------------------------------- /Tools/Tools 8:Tex符号大全.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 8:Tex符号大全.md -------------------------------------------------------------------------------- /Tools/Tools 9:Zotero使用指南.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Tools 9:Zotero使用指南.md -------------------------------------------------------------------------------- /Tools/Ubuntu/Ubuntu系统问题.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/Ubuntu/Ubuntu系统问题.md -------------------------------------------------------------------------------- /Tools/color.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Tools/color.py -------------------------------------------------------------------------------- /Utils/HTML2PDF.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Utils/HTML2PDF.py -------------------------------------------------------------------------------- /Utils/PDFselector.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Utils/PDFselector.py -------------------------------------------------------------------------------- /Utils/basic_plot.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/Utils/basic_plot.py -------------------------------------------------------------------------------- /img/image-20230825121432059.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Skylark0924/Reinforcement-Learning-in-Robotics/HEAD/img/image-20230825121432059.png --------------------------------------------------------------------------------