DDPG-PyTorch

Deep Deterministic Policy Gradient || PyTorch || OpenAI Gym

Lorenzo Soligo, Ca' Foscari University of Venice. Project for the Artificial Intelligence: Machine Learning and Pattern Recognition course.

Instructions

Setup

Create the Conda environment: conda env create -f environment.yml
Activate the environment: conda activate deeprl
Install the requirements from pip: pip install -r requirements.txt

Running the code

python main.py

You can use the following flags:

--eval: will run an episode using an already saved model of the actor. Don't use this if you want to train the model.
--env: name of the OpenAI Gym environment to use. The default is LunarLanderContinuous-v2. Notice that DDPG is developed to be used with continuous action spaces.

Running a sample with LunarLander

Copy the models folder from results/lunarlander into the root of the project.
Run python main.py --eval to test LunarLander, or python main.py --eval --env "AnotherEnv" to test another environment
- beware that only LunarLander is provided.

Further information

This implementation does not precisely follow the one presented in the paper. As a matter of fact, I noticed that not using batch normalization and adding the actions in the critic's input layer drastically improved performance.
The results folder contains videos, Tensorboard logs and working models for LunarLander

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
models		models
results/lunarlander		results/lunarlander
src		src
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG-PyTorch

Instructions

Setup

Running the code

Running a sample with LunarLander

Further information

About

Releases

Packages

Contributors 2

Languages

License

LolloneS/DDPG-PyTorch

Folders and files

Latest commit

History

Repository files navigation

DDPG-PyTorch

Instructions

Setup

Running the code

Running a sample with LunarLander

Further information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages