You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been trying to fine-tune a mask2former model for a binary segmentation task. However every time I start the training the model starts only predicting null-tensors after a few epochs.
I first thought that I was getting unlucky and the majority of images that RandomCrop gave for training were completely empty. However after implementing code that guarantees that my object is always on screen the model still outputs null-tensors after a few epochs.
I also implemented low-rank adaptation to account for catastrophic forgetting but that didn't help either.
At this point I am convinced that it has to do something with the way I am initializing my image processor, model or LoraConfig so I'd appreciate some help here.
This is the initialization of the image_processor:
Hello community,
I've been trying to fine-tune a mask2former model for a binary segmentation task. However every time I start the training the model starts only predicting null-tensors after a few epochs.
I first thought that I was getting unlucky and the majority of images that RandomCrop gave for training were completely empty. However after implementing code that guarantees that my object is always on screen the model still outputs null-tensors after a few epochs.
I also implemented low-rank adaptation to account for catastrophic forgetting but that didn't help either.
At this point I am convinced that it has to do something with the way I am initializing my image processor, model or LoraConfig so I'd appreciate some help here.
This is the initialization of the image_processor:
This is my model initialization:
And lastly this is the initialization of LoraConfig and everything related to that:
Any insight would be appreciated since I am at my wit's end.
The text was updated successfully, but these errors were encountered: