You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We moved them to M1 instances 2 days ago and things were fine in the PR but these jobs have been extremely flaky since then. E.g. the current failures on b0f9f7b
Conda build: 3.9 is passing but all other Python versions are failing on import torchvision.
import torchvision
File "/Users/ec2-user/runner/_work/_temp/conda_environment_9300373488/lib/python3.8/site-packages/torchvision/__init__.py", line 6, in <module>
from torchvision import _meta_registrations, datasets, io, models, ops, transforms, utils
File "/Users/ec2-user/runner/_work/_temp/conda_environment_9300373488/lib/python3.8/site-packages/torchvision/_meta_registrations.py", line 164, in <module>
def meta_nms(dets, scores, iou_threshold):
File "/Users/ec2-user/runner/_work/_temp/conda_environment_9300373488/lib/python3.8/site-packages/torch/library.py", line 639, in register
use_lib._register_fake(op_name, func, _stacklevel=stacklevel + 1)
File "/Users/ec2-user/runner/_work/_temp/conda_environment_9300373488/lib/python3.8/site-packages/torch/library.py", line 139, in _register_fake
handle = entry.abstract_impl.register(func_to_register, source)
File "/Users/ec2-user/runner/_work/_temp/conda_environment_9300373488/lib/python3.8/site-packages/torch/_library/abstract_impl.py", line 30, in register
if torch._C._dispatch_has_kernel_for_dispatch_key(self.qualname, "Meta"):
RuntimeError: operator torchvision::nms does not exist
This is probably related to these compilation warnings we're observing for every single op?
ld: warning: object file (/Users/ec2-user/runner/_work/vision/vision/pytorch/vision/build/temp.macosx-11.1-arm64-cpython-311/Users/ec2-user/runner/_work/vision/vision/pytorch/vision/torchvision/csrc/ops/autocast/nms_kernel.o) was built for newer macOS version (13.0) than being linked (11.1)
It's clear that the installed torch version is for MacOS 11 so if torchvision is being compiled on MacOS 13 this might be the cause?
Thanks a lot for looking into this @malfet and @huydhn , I was about to ping you both as I saw some related changes in the PyTorch repo from last week. I've opened #8478 to keep track of the MPS issue and with more details (I'll close this one if you don't mind as it contains a bit of noise from my investigations above).
We moved them to M1 instances 2 days ago and things were fine in the PR but these jobs have been extremely flaky since then. E.g. the current failures on b0f9f7b
import torchvision
.Maybe #8407 can help? EDIT: NOPE.
This is probably related to these compilation warnings we're observing for every single op?
It's clear that the installed torch version is for MacOS 11 so if torchvision is being compiled on MacOS 13 this might be the cause?
EDIT: NOPE. https://github.com/pytorch/vision/actions/runs/9288139810/job/25565749027?pr=8452 from yesterday has the exact same warnings but it runs fine.
The text was updated successfully, but these errors were encountered: