Deep Reinforcement Learning Notes

该博客用于记录自己学习深度强化学习及相关领域知识时的一些思考。

Posts

Nov 3, 2021
Tiny Project 2: Snake Game AI
Nov 2, 2021
Tutorial 3: RLlib (4) — Reinforcement Learning with RLlib in the Unity Game Engine
Nov 1, 2021
Tutorial 3: RLlib (3) — Scaling Multi-Agent Reinforcement Learning
Sep 18, 2021
Tutorial 3: RLlib (2) — A Gentle RLlib Tutorial
Sep 17, 2021
Tutorial 3: RLlib (1) — RLlib in 60 seconds
Aug 29, 2021
Paper 56: Unity: A General Platform for Intelligent Agents
Aug 7, 2021
Paper 55: AndroidEnv: A Reinforcement Learning Platform for Android
Jun 26, 2020
Paper 54: FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance
Jun 25, 2020
Tiny Project 1: Using Reinforcement Learning to Trade Stocks
Jun 24, 2020
Tutorial 2: Stable Baselines
Jun 23, 2020
Tutorial 1: Creating a Custom OpenAI Gym Environment for Stock Trading
Jun 22, 2020
Concept 9: Deep Reinforcement Learning for Trading (2)
Jun 21, 2020
Concept 9: Deep Reinforcement Learning for Trading (1)
Jun 20, 2020
Paper 53: Large-Scale Study of Curiosity-Driven Learning
Jun 19, 2020
Paper 52: Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning
Jun 18, 2020
Concept 8: Dual Gradient Descent
Jun 17, 2020
Paper 51: Learning to Walk in the Real World with Minimal Human Effort
Jun 16, 2020
Paper 50: Soft Q-Network (SQN)
Jun 15, 2020
Paper 49: Reinforcement Learning with Deep Energy-Based Policies (Soft Q Learning)
Jun 14, 2020
Paper 48: Soft Actor-Critic Algorithms and Applications (SAC)
Jun 13, 2020
Speech 13: QuantCon 2018-Tom Starke: Reinforcement Learning for Trading Practical Examples and Lessons Learned
Jun 12, 2020
Paper 47: Rainbow: Combining Improvements in Deep Reinforcement Learning
Jun 11, 2020
Paper 46: A Distributional Perspective on Reinforcement Learning (Categorical DQN)
Jun 10, 2020
Paper 45: Parameter Space Noise for Exploration
Jun 9, 2020
Paper 44: Noisy Networks for Exploration (Noisy DQN)
Jun 8, 2020
Paper 43: Dueling Network Architectures for Deep Reinforcement Learning (Dueling DQN)
Jun 7, 2020
Paper 42: A Solution to China Competitive Poker Using Deep Learning
Jun 6, 2020
Speech 12: Unity-Jeffrey Shih: Successfully Use Deep Reinforcement Learning in Testing and NPC Development
Jun 5, 2020
Paper 41: Attention Is All You Need
Jun 4, 2020
Paper 40: Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Jun 3, 2020
Paper 39: Deep Attention Recurrent Q-Network (DARQN)
Jun 2, 2020
Paper 38: Recurrent Models of Visual Attention (RAM)
Jun 1, 2020
Concept 7: Embedding
May 31, 2020
Paper 37: Dota 2 with Large Scale Deep Reinforcement Learning (OpenAI Five) — Appendix
May 30, 2020
Paper 37: Dota 2 with Large Scale Deep Reinforcement Learning (OpenAI Five) — Main Text
May 29, 2020
Paper 36: Fiber: A Platform for Efficient Development and Distributed Training for Reinforcement Learning and Population-Based Methods
May 28, 2020
Paper 35: Human-level Control through Deep Reinforcement Learning (DQN2015)
May 27, 2020
Paper 34: Playing Atari with Deep Reinforcement Learning (DQN2013)
May 26, 2020
Speech 11: Webinar: Defeating Bots with Machine Learning
May 25, 2020
Speech 10: Webinar: Optimize Your Game Architecture for AI
May 24, 2020
Speech 9: Webinar: Game Playing Bots for Game Development
May 23, 2020
Paper 33: Implementation Matters in Deep Policy Gradients: A Case Study on PPO and TRPO
May 22, 2020
Paper 32: A Closer Look at Deep Policy Gradients
May 21, 2020
Paper 31: Deep Reinforcement Learning that Matters
May 20, 2020
Paper 30: High-dimensional Continuous Control using Generalized Advantage Estimation (GAE)
May 19, 2020
Paper 29: Prioritized Experience Replay (PER)
May 18, 2020
Speech 8: OpenAI-Ilya Sutskever: Meta-Learning and Self-Play
May 17, 2020
Speech 7: DeepMind-Demis Hassabis: The Power of Self-Learning Systems
May 16, 2020
Paper 28: Distributed Distributional Deterministic Policy Gradients (D4PG)
May 15, 2020
Paper 27: Emergence of Locomotion Behaviours in Rich Environments (DPPO)
May 14, 2020
Paper 26: GA3C: GPU-based A3C for Deep Reinforcement Learning
May 13, 2020
Paper 25: Acme: A Research Framework for Distributed Reinforcement Learning
May 12, 2020
Paper 24: RLlib: Abstractions for Distributed Reinforcement Learning
May 11, 2020
Speech 6: NetEase-Tangjie Lv: Using Reinforcement Learning to Develop Game AI
May 10, 2020
Paper 23: Ray: A Distributed Framework for Emerging AI Applications
May 9, 2020
Speech 5: ScaledML 2020-Andrej Karpathy: AI for Full-Self Driving
May 8, 2020
Paper 22: Google Research Football: A Novel Reinforcement Learning Environment
May 7, 2020
Paper 21: SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
May 6, 2020
Paper 20: Making Efficient Use of Demonstrations to Solve Hard Exploration Problems (R2D3)
May 5, 2020
Paper 19: Deep Recurrent Q-Learning for Partially Observable MDPs (DRQN)
May 4, 2020
Paper 18: Recurrent Experience Replay in Distributed Reinforcement Learning (R2D2)
May 3, 2020
Concept 6: RNN, LSTM and GRU
May 2, 2020
Speech 4: NeurIPS 2018-Joelle Pineau: Reproducible, Reusable, and Robust Reinforcement Learning
May 1, 2020
Speech 3: NeurIPS 2019-Katja Hofmann: Reinforcement Learning Past, Present, and Future Perspective
Apr 30, 2020
Speech 2: KHIPU 2019-Nando de Freitas: Reinforcement Learning
Apr 29, 2020
Paper 17: Distributed Prioritized Experience Replay (Ape-X)
Apr 22, 2020
Paper 16: IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Mar 2, 2020
Paper 15: Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor (SAC)
Mar 1, 2020
Paper 14: Addressing Function Approximation Error in Actor-Critic Methods (TD3)
Feb 29, 2020
Paper 13: Deep Reinforcement Learning with Double Q-learning (Double DQN)
Feb 28, 2020
Paper 12: Continuous Control with Deep Reinforcement Learning (DDPG)
Feb 27, 2020
Paper 11: Deterministic Policy Gradient Algorithms (DPG)
Feb 26, 2020
Paper 10: Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments (MADDPG)
Feb 25, 2020
Paper 9: Hierarchical Reinforcement Learning for Multi-agent MOBA Game (Honour of Kings)
Feb 24, 2020
Paper 8: Playing FPS Games with Deep Reinforcement Learning (ViZDoom)
Feb 23, 2020
Paper 7: Mastering Complex Control in MOBA Games with Deep Reinforcement Learning (Honour of Kings)
Feb 22, 2020
Paper 6: Hierarchical Macro Strategy Model for MOBA Game AI (Honour of Kings)
Feb 21, 2020
Paper 5: Curiosity-driven Exploration by Self-supervised Prediction (ICM)
Feb 20, 2020
Paper 4: Asynchronous Methods for Deep Reinforcement Learning (A3C)
Feb 19, 2020
Speech 1: DLRLSS 2019-James Wright: Multi-Agent Systems
Feb 18, 2020
Paper 3: Scalable Trust-Region Method for Deep Reinforcement Learning using Kronecker-Factored Approximation (ACKTR)
Feb 17, 2020
Paper 2: Proximal Policy Optimization Algorithms (PPO)
Feb 16, 2020
Paper 1: Trust Region Policy Optimization (TRPO)
Feb 15, 2020
Concept 5: Kullback–Leibler Divergence
Feb 14, 2020
Concept 4: Monte Carlo Tree Search
Feb 12, 2020
Concept 3: Artificial General Intelligence
Feb 9, 2020
Concept 2: Multimodal distribution
Feb 8, 2020
Concept 1: Cross entropy
Jan 15, 2020
Hello World