[Question] Is DQN actually Double DQN? #1737

oliverc1623 · 2023-11-06T17:01:56Z

❓ Question

Hi, first off great work on making Stable-Baselines3 an excellent resource for deep reinforcement learning practitioners.

I noticed that your DQN implementation features a target q network which resembles Google's Deep Mind paper, Deep Reinforcement Learning with Double Q-learning. Meanwhile, Neural Fitted Q Iteration, by Riedmiller, calculates the target using the "current estimate" of the Q function. I am looking for clarification if DQN is truly a Double DQN. I hope to use this information to accurately hopefully implement prioritized experience replay based off of your DQN implementation.

Thanks,
Oliver

Checklist

I have checked that there is no similar issue in the repo
I have read the documentation
If code there is, it is minimal and working
If code there is, it is formatted using the markdown code blocks for both code and stack traces.

araffin · 2023-11-06T17:33:55Z

Duplicate of #487

It is not, current DQN is the vanilla DQN (cf. doc), but we are working (and welcome help) on #622 (there is a PR for PER).
However, we do provide QR-DQN in SB3 contrib repo.

oliverc1623 · 2023-11-06T20:21:34Z

Thanks for the clarification, will read up on #487 and the papers more throughly.

oliverc1623 added the question Further information is requested label Nov 6, 2023

araffin added duplicate This issue or pull request already exists RTFM Answer is the documentation labels Nov 6, 2023

oliverc1623 closed this as completed Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Is DQN actually Double DQN? #1737

[Question] Is DQN actually Double DQN? #1737

oliverc1623 commented Nov 6, 2023 •

edited

Loading

araffin commented Nov 6, 2023

oliverc1623 commented Nov 6, 2023

[Question] Is DQN actually Double DQN? #1737

[Question] Is DQN actually Double DQN? #1737

Comments

oliverc1623 commented Nov 6, 2023 • edited Loading

❓ Question

Checklist

araffin commented Nov 6, 2023

oliverc1623 commented Nov 6, 2023

oliverc1623 commented Nov 6, 2023 •

edited

Loading