Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

{lib}[foss/2022a] TensorFlow v2.10.1 w/ Python 3.10.4 #17168

Conversation

Flamefire
Copy link
Contributor

@Flamefire Flamefire commented Jan 20, 2023

(created using eb --new-pr)

I'm working on TF 2.11.0 but that is causing me some trouble, so adding this as an intermediate in addition to 2.9.1

Edit: I think I got TF 2.11.0 working and will open a PR at start of February as it needs polishing. So if that is short enough to skip TF 2.10.1 then this can be closed.

Edit2: TF 2.11.0 PR is at #17241

@boegelbot

This comment was marked as outdated.

@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
FAILED
Build succeeded for 2 out of 3 (3 easyconfigs in total)
taurusi8028 - Linux CentOS Linux 7.9.2009, x86_64, AMD EPYC 7352 24-Core Processor (zen2), 8 x NVIDIA NVIDIA A100-SXM4-40GB, 470.57.02, Python 2.7.5
See https://gist.github.com/212c88f429c4ff50eecf7fbc5fbfc7cd for a full test report.

@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
taurusml20 - Linux RHEL 7.6, POWER, 8335-GTX (power9le), 6 x NVIDIA Tesla V100-SXM2-32GB, 440.64.00, Python 2.7.5
See https://gist.github.com/6e3205a258932d259e9b9b7768e34df1 for a full test report.

@Flamefire
Copy link
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 3 out of 3 (3 easyconfigs in total)
taurusa12 - Linux CentOS Linux 7.7.1908, x86_64, Intel(R) Xeon(R) CPU E5-2603 v4 @ 1.70GHz (broadwell), 3 x NVIDIA GeForce GTX 1080 Ti, 460.32.03, Python 2.7.5
See https://gist.github.com/64fa1eb28489f488c5402fb393ca3e1b for a full test report.

@boegel boegel added the update label Jan 24, 2023
@boegel boegel added this to the 4.x milestone Jan 24, 2023
@easybuilders easybuilders deleted a comment from boegelbot Jan 24, 2023
@Flamefire Flamefire force-pushed the 20230120145341_new_pr_TensorFlow2101 branch from e01919d to 938ae06 Compare February 10, 2023 11:23
@boegelbot

This comment was marked as outdated.

@boegelbot

This comment was marked as outdated.

@Flamefire Flamefire force-pushed the 20230120145341_new_pr_TensorFlow2101 branch from 7f355ed to 33157ac Compare February 10, 2023 16:35
@smoors smoors mentioned this pull request Mar 8, 2023
4 tasks
@smoors
Copy link
Contributor

smoors commented Mar 14, 2023

@boegelbot: please test @ generoso

@boegelbot
Copy link
Collaborator

@smoors: Request for testing this PR well received on login1

PR test command 'EB_PR=17168 EB_ARGS= EB_CONTAINER= /opt/software/slurm/bin/sbatch --job-name test_PR_17168 --ntasks=4 ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 10470

Test results coming soon (I hope)...

- notification for comment with ID 1468375615 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@branfosj
Copy link
Member

I think the CUDA version will require the changes in easybuilders/easybuild-easyblocks#2854

@smoors
Copy link
Contributor

smoors commented Mar 14, 2023

Test report by @smoors
FAILED
Build succeeded for 5 out of 6 (3 easyconfigs in total)
node406.hydra.os - Linux CentOS Linux 7.9.2009, x86_64, AMD EPYC 7282 16-Core Processor, 1 x NVIDIA NVIDIA A100-PCIE-40GB, 515.48.07, Python 3.6.8
See https://gist.github.com/172b9493193381a875dfa95ac919077a for a full test report.

@smoors
Copy link
Contributor

smoors commented Mar 15, 2023

Test report by @smoors
Using easyblocks from PR(s) easybuilders/easybuild-easyblocks#2854
SUCCESS
Build succeeded for 1 out of 1 (1 easyconfigs in total)
node400.hydra.os - Linux CentOS Linux 7.9.2009, x86_64, AMD EPYC 7282 16-Core Processor, 1 x NVIDIA NVIDIA A100-PCIE-40GB, 515.48.07, Python 3.6.8
See https://gist.github.com/07e83804377fc37e815157ab0b42220f for a full test report.

@branfosj
Copy link
Member

branfosj commented Apr 1, 2023

Test report by @branfosj
FAILED
Build succeeded for 0 out of 1 (1 easyconfigs in total)
bear-pg0105u03a.bear.cluster - Linux RHEL 8.6, x86_64, Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz (icelake), Python 3.6.8
See https://gist.github.com/branfosj/4ac9519ae5c8bad6e1ad707a322212b3 for a full test report.

@Flamefire
Copy link
Contributor Author

@branfosj As we have TF 2.11 and 2.9 do we still want this? I'd like to just skip 2.10 as I don't see any benefit in spending more time on this. However it would be important that #17092 gets merged, so we actually have the fixed 2.9.1

@casparvl casparvl closed this Aug 10, 2023
@casparvl
Copy link
Contributor

Agreed, I'll close this.

@Flamefire Flamefire deleted the 20230120145341_new_pr_TensorFlow2101 branch August 10, 2023 08:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants