Link to video summary of project’s progress and stories of our successes, challenges, and future works can be found here.

DuckieTown Project Status

Project Summary

We decided to work on the Duckietown project. The goal of our project is to develop an intelligent system for DuckieBot that enables self-navigation within the “Town” using reinforcement learning and machine learning. Our focus is on developing a simulation-based Duckietown environment and deploying a machine-learning-based solution to train the DuckieBot to detect lanes, follow them, recognize signals, stop when necessary, and avoid collisions with other objects such as walls, trees, and buses. Additionally, the system should respond effectively to environmental factors.

To achieve this, we will either purchase or create a DuckieBot equipped with a camera for sensor data collection. The DuckieBot should be capable of driving (accelerating), stopping, and turning. This is an exciting and novel project for us, and we believe it will significantly enhance our technical knowledge and teamwork skills. We are committed to collaborating effectively to ensure the successful completion of this project.

Approach

Since we aim to compare the performance of Soft Actor-Critic (SAC) and Proximal Policy Optimization (PPO), our team is split into two groups: one focusing on PPO and the other on SAC training.

SAC Approach

For SAC (not using Stable-Baselines3’s SAC implementation), we use a 5-dimensional state space:

SAC Rewards

The 2-dimensional action space consists of:

The reward structure is designed as follows:

PPO Approach

For PPO, we follow a CNN-based policy to process image-based observations. The key hyperparameters include:

Training is conducted for 100,000 timesteps, utilizing a vectorized environment setup with make_vec_env and VecTransposeImage to ensure proper input shape.

Evaluation

Quantitative Evaluation

We assess performance using key metrics:

SAC Rewards SAC Rewards SAC Rewards

SAC Rewards SAC Rewards SAC Rewards

Due to these challenges, we plan to:

Qualitative Evaluation

Remaining Goals and Challenges

Challenges

Goals

Resources Used

Libraries

GitHub Repositories

Papers and Websites