Add ORTModule distributed CI pipeline#6278
Add ORTModule distributed CI pipeline#6278baijumeswani merged 7 commits intothiagofc/ortmodule-apifrom
Conversation
801e128 to
d68ab42
Compare
thiagocrepaldi
left a comment
There was a problem hiding this comment.
Changming has a better view on where to add new dependencies to the repo. DeepSpeed is an important one for testing, but not really necessary for the final package/docker image
|
I don't know if your change will affect any training docker images in MCR. If yes, please give a list of the packages that deepspeed depends on. You can start a new docker instance based on Ubuntu:18.04, then do the pip install, and use "pip list" to get the list. |
d68ab42 to
a63a0de
Compare
|
This dependency is only for testing (and should only be a part of the CI docker images). I am hoping that installing |
6d84559 to
9ff70df
Compare
c59b14e to
0b8ec74
Compare
orttraining/orttraining/test/python/how_to_install_training_test_dependencies.md
Show resolved
Hide resolved
...uild/github/azure-pipelines/orttraining-linux-gpu-ortmodule-distributed-test-ci-pipeline.yml
Show resolved
Hide resolved
orttraining/orttraining/test/python/how_to_install_training_test_dependencies.md
Outdated
Show resolved
Hide resolved
|
Thanks. |
* Add ortmodule distributed ci pipeline
* Add ortmodule distributed ci pipeline
* Add ortmodule distributed ci pipeline
Description: Adding infrastructure for ortmodule distributed ci pipeline.