-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug(logger): wandb fails on sweep #1290
Comments
+1, I faced the same issue when using pytorch lightning with wandb sweeps. To summarize, wandb automatically logs hyperparams when we run the wandb sweep agent on a machine. Later, pytorch lightning again tries to log same hyperparams but due to precision error between lightning and wandb already logged hyperparams, wandb throws this error. Just a guess: wandb sweep agent might be using double format to generate new hyperparams and when lightning receives those args from command line, it converts them to float and tries to log it. I haven't digged in detail where these hyperparams get altered, it could be on wandb side or lightning side. I reported this issue to wandb and got the following response:
|
cc: @neggert, @Borda, @williamFalcon |
@borisdayma @calclavia pls ^^ |
The problem is that it tries to log this value twice and is probably called before automatically by pytorch-lightning. See an example of using pytorch-lightning with wandb (including sweeps) here: https://github.com/borisdayma/lightning-kitti I'll be adding it to the pytorch-lightning repo later but still need to push a PR related to the |
@borisdayma is it fixed now? |
overwrite run config parameters due to precision error fix Lightning-AI#1290
* fix(wandb): allow use of sweeps overwrite run config parameters due to precision error fix #1290 * docs(wandb): update changelog * test(wandb): update config test Co-authored-by: William Falcon <waf2107@columbia.edu>
🐛 Bug
When using
wandb
sweeps for hyperparameters search, I get this error:The reason is I ran:
Which I guess has some problem with floating-point numbers in high accuracy?
The text was updated successfully, but these errors were encountered: