Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experiment Execution Challenge with RoFormer Model #38

Open
FedeVinc opened this issue Jan 13, 2025 · 1 comment
Open

Experiment Execution Challenge with RoFormer Model #38

FedeVinc opened this issue Jan 13, 2025 · 1 comment

Comments

@FedeVinc
Copy link

Hi,

I’ve been working with the RoFormer model recently and I'm specifically trying to train it using the Brassicales dataset. My goal is to compare its performance with a ConvNet model by running experiments on a GPU (configured using an sbatch file).

While everything works seamlessly with the ConvNet model, I’m encountering issues with RoFormer. Strangely, the debug phase runs without problems, but the training process fails when using the GPU.

I’ve attached the error details for reference. Have you had a chance to test the RoFormer model on a GPU, and if so, could you share any insights or configurations that worked for you?

I’m happy to provide additional details if needed. Thanks in advance for your help!

Best regards
Problem

@gonzalobenegas
Copy link
Collaborator

Hello, I'm sorry you are having issues. Here's a colab notebook training a small roformer for a few steps:
https://colab.research.google.com/drive/1c3EHJG19zMmVpPYyLBH7OTRXciHCEd3A?usp=sharing
Does that work for you?

I can also try help debug in your system. One note, I just updated the main branch of the repo with a small fix needed to run with the latest transformers library version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants