fix: unique file names and squashed neptune stdout #57

JemmaLDaniel · 2024-03-01T13:19:45Z

What?

Neptune was printing extra lines between tqdm progress bar updates, which has been suppressed so the progress bar displays as expected.
Where different runs are finishing at the same time, data_key is not unique. This causes two JSON files to be stored in one folder with the same data_key name and messes up plotting downstream. This has been fixed so that each JSON file is stored in a unique folder.

Why?

Prettier prints and easier to read.
JSON files are downloaded as marl-eval's concatenating and plotting tools require.

How?

During the download phase, set Neptune to only log messages at an error level or higher (i.e., we don't want the extra info).
Append a counter to each file name before creating the directory for the JSON file.

RuanJohn

Thank you for this @JemmaLDaniel! Just a minor style suggestion. I think we can use enumerate and manually control the tqdm progress bar. This way we do not have to set up and manually increment a counter.

I also think if we just initialise the download file with the counter variable we do not have to rename things, because every file will have a unique name when it is downloaded.

marl_eval/json_tools/json_utils.py

RuanJohn · 2024-03-01T14:03:28Z

marl_eval/json_tools/json_utils.py

+                # and doesn't need to be unzipped.
+                # Rename the file by appending run_counter to its existing name
+                renamed_file_path = f"{file_path}_{counter}"
+                os.rename(file_path, renamed_file_path)
            except Exception as e:
                print(f"An error occurred while unzipping or storing {file_path}: {e}")


Another minor detail here. I think it will be nice if we give the Neptune run ID here as well. This way someone could quickly know exactly which Neptune run is causing an issue and manually inspect JSON files if they wanted to.

I was doing this previously, but I opted for the run_counter in case run_id wasn't unique enough when there is more than one data_key per run_id. I can easily append it :)

Something that would still be nice to show is that the error happens when trying to pull data for run MAV-30546 for example. Then someone could easily inspect what these is is if they wanted to because file_parh and the error e will not make it easy to see which Neptune run is causing the issue.

run_id is appended to file_path instead of using i in the enumerator. Do you want it to be more explicit than that in the print message?

You are absolutely right. But since file_path will look something like <some_path>/<some_key>_MAV-30546_<j> and the most useful info for finding issues in Neptune runs will be just the run_id something that gives run id info and file path info could be:

Suggested change

print(f"An error occurred while unzipping or storing {file_path}: {e}")

print(

"The following error occurred while unzipping or storing JSON data for run "

f"{run_id} at path {file_path}: {e}"

)

This is a super nitpicky thing though, so we can also ignore it 😅

I agree, that is more readable! No need to lower the marl-eval standards 😎

RuanJohn

Thank you for this @JemmaLDaniel 🔥

fix: unique file names and squashed neptune stdout

2d0cddf

RuanJohn requested changes Mar 1, 2024

View reviewed changes

RuanJohn reviewed Mar 1, 2024

View reviewed changes

JemmaLDaniel added 2 commits March 1, 2024 17:24

chore: switch counter to enum

08bdb71

chore: more readable error message

e9db8c3

RuanJohn approved these changes Mar 4, 2024

View reviewed changes

JemmaLDaniel merged commit 63895c9 into instadeepai:main Mar 6, 2024
3 checks passed

JemmaLDaniel deleted the fix/pull-neptune-data branch March 6, 2024 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: unique file names and squashed neptune stdout #57

fix: unique file names and squashed neptune stdout #57

JemmaLDaniel commented Mar 1, 2024 •

edited

Loading

RuanJohn left a comment

RuanJohn Mar 1, 2024

JemmaLDaniel Mar 1, 2024

RuanJohn Mar 1, 2024 •

edited

Loading

JemmaLDaniel Mar 1, 2024 •

edited

Loading

RuanJohn Mar 4, 2024

JemmaLDaniel Mar 4, 2024 •

edited

Loading

RuanJohn left a comment

-                print(f"An error occurred while unzipping or storing {file_path}: {e}")
+                print(
+                    "The following error occurred while unzipping or storing JSON data for run "
+                    f"{run_id} at path {file_path}: {e}"
+                )

fix: unique file names and squashed neptune stdout #57

fix: unique file names and squashed neptune stdout #57

Conversation

JemmaLDaniel commented Mar 1, 2024 • edited Loading

What?

Why?

How?

RuanJohn left a comment

Choose a reason for hiding this comment

RuanJohn Mar 1, 2024

Choose a reason for hiding this comment

JemmaLDaniel Mar 1, 2024

Choose a reason for hiding this comment

RuanJohn Mar 1, 2024 • edited Loading

Choose a reason for hiding this comment

JemmaLDaniel Mar 1, 2024 • edited Loading

Choose a reason for hiding this comment

RuanJohn Mar 4, 2024

Choose a reason for hiding this comment

JemmaLDaniel Mar 4, 2024 • edited Loading

Choose a reason for hiding this comment

RuanJohn left a comment

Choose a reason for hiding this comment

JemmaLDaniel commented Mar 1, 2024 •

edited

Loading

RuanJohn Mar 1, 2024 •

edited

Loading

JemmaLDaniel Mar 1, 2024 •

edited

Loading

JemmaLDaniel Mar 4, 2024 •

edited

Loading