IQL results different with the paper #172

dssrgu · 2023-01-20T04:45:14Z

Hi,

The IQL results from this repo seem to differ from the original paper.

According to the README of the IQL example code here, IQL scores an average raw return of about 1500 on hopper-medium-expert with offline training:
https://github.com/rail-berkeley/rlkit/tree/master/examples/iql

However, the original paper notes that IQL scores 91.5 in normalized average return (which is about 2950 in raw return):
https://arxiv.org/pdf/2110.06169.pdf

Can you take a look at this and check what is causing the difference?

Thank you!

anair13 · 2024-06-17T17:38:58Z

Sorry for the late response, but the IQL experiments in the paper were run in jax and should be reproducible with the other repo: https://github.com/ikostrikov/implicit_q_learning

This reimplementation is pytorch is for convenience and likely has minor initialization differences etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IQL results different with the paper #172

IQL results different with the paper #172

dssrgu commented Jan 20, 2023

anair13 commented Jun 17, 2024

IQL results different with the paper #172

IQL results different with the paper #172

Comments

dssrgu commented Jan 20, 2023

anair13 commented Jun 17, 2024