└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Reinforcement-Learning 2 | List of our studies related to reinforcement learning 3 | 4 | ## Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk 5 | Dohyeong Kim and Songhwai Oh, "Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk," IEEE Robotics and Automation Letters, vol. 7, no. 3, pp. 7644-7651, Jul. 2022. [[paper](https://ieeexplore.ieee.org/document/9802647)][[code](https://github.com/rllab-snu/Off-Policy-TRC)] 6 | 7 | ## TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning 8 | Dohyeong Kim and Songhwai Oh, “TRC: Trust region conditional value at risk for safe reinforcement learning,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 2621–2628, Apr. 2022. [[paper](https://ieeexplore.ieee.org/document/9677982)][[code](https://github.com/rllab-snu/Trust-Region-CVaR)] 9 | 10 | ## Dynamics-Aware-Metric-Embedding 11 | Mineui Hong, Kyungjae Lee, Minjae Kang, Wonsuhk Jung, and Songhwai Oh,`Dynamics-Aware-Metric-Embedding: Metric Learning in a Latent Space for Visual Planning,' IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 3388-3395, Apr. 2022. [[paper](https://ieeexplore.ieee.org/document/9696367)][[code](https://github.com/rllab-snu/Dynamics-Aware-Metric-Embedding)] 12 | 13 | ## Adaptive Soft Actor Critic 14 | Jae In Kim, Mineui Hong, Kyungjae Lee, DongWook Kim, Yong-Lae Park, and Songhwai Oh, "Learning to Walk A Tripod Mobile Robot Using Nonlinear Soft Vibration Actuators with Entropy Adaptive Reinforcement Learning," IEEE International Conference on Robotics and Automation (ICRA), May 2020. (RA-L option) [[paper](https://ieeexplore.ieee.org/document/8978537)][[code](https://github.com/rllab-snu/Adaptive-Soft-Actor-Critic)] 15 | 16 | ## Soft Action Particle Method 17 | Minjae Kang*, Kyungjae Lee*, and Songhwai Oh, "Soft Action Particle Deep Reinforcement Learning for a Continuous Action Space," *in Proc. of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)*, Nov. 2019. 18 | [[paper](http://rllab.snu.ac.kr/publications/papers/2019_iros_sapdrl.pdf)] [[code](https://github.com/rllab-snu/soft_action_particle_method)] 19 | 20 | ## Tsallis Actor Critic 21 | Kyungjae Lee, Sungyub Kim, Sungbin Lim, Sungjoon Choi, and Songhwai Oh, "Tsallis Reinforcement Learning: A Unified Framework for Maximum Entropy Reinforcement Learning," *arXiv preprint arXiv:1902.00137*, Feb. 2019. 22 | [[paper](https://arxiv.org/pdf/1902.00137.pdf)] [[code](https://github.com/rllab-snu/tsallis_actor_critic_mujoco)] 23 | 24 | ## Robot Learning Packages 25 | Kyungjae Lee. [[code](https://github.com/rllab-snu/Deep-Reinforcement-Learning)] 26 | --------------------------------------------------------------------------------