├── .gitignore ├── LICENSE ├── README.md ├── bwkucb.py ├── deterministic.py ├── egreedy.py ├── env_dpls.py ├── monte_carlo.py ├── project_report.pdf ├── qlearning.py ├── rand.py ├── sarsa.py └── utils.py /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/.gitignore -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/README.md -------------------------------------------------------------------------------- /bwkucb.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/bwkucb.py -------------------------------------------------------------------------------- /deterministic.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/deterministic.py -------------------------------------------------------------------------------- /egreedy.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/egreedy.py -------------------------------------------------------------------------------- /env_dpls.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/env_dpls.py -------------------------------------------------------------------------------- /monte_carlo.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/monte_carlo.py -------------------------------------------------------------------------------- /project_report.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/project_report.pdf -------------------------------------------------------------------------------- /qlearning.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/qlearning.py -------------------------------------------------------------------------------- /rand.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/rand.py -------------------------------------------------------------------------------- /sarsa.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/sarsa.py -------------------------------------------------------------------------------- /utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/Lunj12/RL-Bandits-with-Knapsacks/HEAD/utils.py --------------------------------------------------------------------------------