├── .gitignore ├── CITATION.cff ├── LICENSE ├── MANIFEST.in ├── README.md ├── docs ├── documentation.md ├── results_plot_dark.png ├── results_plot_light.png └── terminal_interface.gif ├── example_a2c.py ├── example_short.py ├── example_standard.py ├── pyproject.toml ├── requirements.txt ├── run ├── run_a2c.py └── run_reinforce.py └── unstable ├── __init__.py ├── _types.py ├── actor.py ├── buffers.py ├── collector.py ├── game_scheduler.py ├── learners ├── __init__.py ├── a2c_learner.py ├── base.py ├── reinforce_learner.py └── utils.py ├── model_registry.py ├── reward_transformations ├── __init__.py ├── transformation_final.py ├── transformation_sampling.py └── transformation_step.py ├── runtime.py ├── samplers ├── __init__.py ├── env_samplers.py └── model_samplers.py ├── terminal_interface.py ├── trackers.py └── utils ├── __init__.py ├── logging.py ├── misc.py └── templates.py /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/.gitignore -------------------------------------------------------------------------------- /CITATION.cff: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/CITATION.cff -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/LICENSE -------------------------------------------------------------------------------- /MANIFEST.in: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/MANIFEST.in -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/README.md -------------------------------------------------------------------------------- /docs/documentation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/docs/documentation.md -------------------------------------------------------------------------------- /docs/results_plot_dark.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/docs/results_plot_dark.png -------------------------------------------------------------------------------- /docs/results_plot_light.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/docs/results_plot_light.png -------------------------------------------------------------------------------- /docs/terminal_interface.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/docs/terminal_interface.gif -------------------------------------------------------------------------------- /example_a2c.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/example_a2c.py -------------------------------------------------------------------------------- /example_short.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/example_short.py -------------------------------------------------------------------------------- /example_standard.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/example_standard.py -------------------------------------------------------------------------------- /pyproject.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/pyproject.toml -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/requirements.txt -------------------------------------------------------------------------------- /run/run_a2c.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/run/run_a2c.py -------------------------------------------------------------------------------- /run/run_reinforce.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/run/run_reinforce.py -------------------------------------------------------------------------------- /unstable/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/__init__.py -------------------------------------------------------------------------------- /unstable/_types.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/_types.py -------------------------------------------------------------------------------- /unstable/actor.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/actor.py -------------------------------------------------------------------------------- /unstable/buffers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/buffers.py -------------------------------------------------------------------------------- /unstable/collector.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/collector.py -------------------------------------------------------------------------------- /unstable/game_scheduler.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/game_scheduler.py -------------------------------------------------------------------------------- /unstable/learners/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/learners/__init__.py -------------------------------------------------------------------------------- /unstable/learners/a2c_learner.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/learners/a2c_learner.py -------------------------------------------------------------------------------- /unstable/learners/base.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/learners/base.py -------------------------------------------------------------------------------- /unstable/learners/reinforce_learner.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/learners/reinforce_learner.py -------------------------------------------------------------------------------- /unstable/learners/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/learners/utils.py -------------------------------------------------------------------------------- /unstable/model_registry.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/model_registry.py -------------------------------------------------------------------------------- /unstable/reward_transformations/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/reward_transformations/__init__.py -------------------------------------------------------------------------------- /unstable/reward_transformations/transformation_final.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/reward_transformations/transformation_final.py -------------------------------------------------------------------------------- /unstable/reward_transformations/transformation_sampling.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/reward_transformations/transformation_sampling.py -------------------------------------------------------------------------------- /unstable/reward_transformations/transformation_step.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/reward_transformations/transformation_step.py -------------------------------------------------------------------------------- /unstable/runtime.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/runtime.py -------------------------------------------------------------------------------- /unstable/samplers/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/samplers/__init__.py -------------------------------------------------------------------------------- /unstable/samplers/env_samplers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/samplers/env_samplers.py -------------------------------------------------------------------------------- /unstable/samplers/model_samplers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/samplers/model_samplers.py -------------------------------------------------------------------------------- /unstable/terminal_interface.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/terminal_interface.py -------------------------------------------------------------------------------- /unstable/trackers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/trackers.py -------------------------------------------------------------------------------- /unstable/utils/__init__.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/utils/__init__.py -------------------------------------------------------------------------------- /unstable/utils/logging.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/utils/logging.py -------------------------------------------------------------------------------- /unstable/utils/misc.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/utils/misc.py -------------------------------------------------------------------------------- /unstable/utils/templates.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LeonGuertler/UnstableBaselines/HEAD/unstable/utils/templates.py --------------------------------------------------------------------------------