MicroRacer

A didactic car-racer micro environment for Deep Reinforcement Learning

Aim and motivation

MicroRacer is a simple environment inspired by car racing and especially meant for the didactics of Deep Reiforcement Learning. The complexity of the environment has been explicitly calibrated to allow to experiment with many different methods, networks and hyperparameters settings without the need of sophisticated software and no fear of getting bored by too long training times.

The MicroRacer environment

MicroRacer generates new random circular tracks at each episode. The Random track is defined by CubicSplines delimiting the inner and outer border; the number of turns and the width of the track are configurable. From this description, we derive a dense matrix of points of dimension 1300x1300 providing information about positions inside the track. This is the actual definition of the track used by the environment.

State and actions

MicroRacer does not intend to model realistic car dynamics. The model is explicitly meant to be as simple as possible, with the minimal amount of complexity that still makes learning interesting and challenging.

The state information available to actors is composed by:

a lidar-like vision of the track from the car's frontal perspective. This is an array of 19 values, expressing the distance of the car from the track's borders along uniformly spaced angles in the range -30°,+30°.
the car scalar velocity.

The actor (the car) has no global knowledge of the track, and no information about its absolute or relative position/direction w.r.t. the track.

The actor is supposed to answer with two actions, both in the range [-1,1]:

acceleration/deceleration
turning angle Maximum values for acceleration and turning angles can be configured.

Available learning models

We currently equip the code with basic actors trained with DDPG, PPO, SAC and DSAC (weights included). Students are supposed to develop their own models.

Requirements

The project just requires basic libraries: tensorflow, numpy, pyplotlib, scipy.interpolate (for Cubic Splines) and cython. A requirements file is available so you can easily install all the dependencies just using "pip install -r requirements.txt"

Plans for future work

We are extremely interested in collaborations, especially with colleagues teaching DRL at other Universities. We plan to organize soon a Championship.

Race examples

Racers in the legend in order from top to bottom: DDPG, TD3, SAC, PPO.

DDPG_TD3_SAC_PPO.mp4

Racers in the legend in order from top to bottom: SAC, DSAC.

SAC_DSAC.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
weights		weights
.gitignore		.gitignore
DDPG.py		DDPG.py
DDPG2.py		DDPG2.py
DSAC.py		DSAC.py
LICENSE		LICENSE
PPO.py		PPO.py
README.md		README.md
SAC.py		SAC.py
TD3.py		TD3.py
lidar.pyx		lidar.pyx
requirements.txt		requirements.txt
tracks.py		tracks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MicroRacer

Aim and motivation

The MicroRacer environment

State and actions

Available learning models

Requirements

Plans for future work

Race examples

About

Releases

Packages

Contributors 2

Languages

License

asperti/MicroRacer

Folders and files

Latest commit

History

Repository files navigation

MicroRacer

Aim and motivation

The MicroRacer environment

State and actions

Available learning models

Requirements

Plans for future work

Race examples

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages