CUDA Error during CharacterEmbeddings #421

lisette-garciamoya · 2019-01-24T11:55:21Z

My code

from flair.embeddings import CharacterEmbeddings

sentence = Sentence('La casa es muy bonita.', use_tokenizer=True)

embedding = CharacterEmbeddings()
embedding.embed(sentence)

for token in sentence:
    print(token)
    print(token.embedding)

Errors

Environment:

OS: Ubuntu 18.04.1
Version: code from master branch
Nvidia:

stefan-it · 2019-01-24T17:35:26Z

Thanks for reporting :) I think this could be fixed by:

Current code:

# chars for rnn processing
chars = torch.LongTensor(tokens_mask)
chars = chars.to(flair.device)

character_embeddings = self.char_embedding(chars).transpose(0, 1)

Fix:

# chars for rnn processing
chars = torch.LongTensor(tokens_mask)
chars = chars.to(flair.device)
chars = chars.detach().cpu()   # <-- added

character_embeddings = self.char_embedding(chars).transpose(0, 1)

lisette-garciamoya · 2019-01-25T09:25:47Z

Thank you. It solves the problem.

alanakbik · 2019-01-25T17:55:38Z

Could you also try this fix?

# chars for rnn processing
chars = torch.LongTensor(tokens_mask)
chars = chars.cpu()   # <-- added (don't detach!)

character_embeddings = self.char_embedding(chars).transpose(0, 1)

I think that if you detach the vector, the gradients cannot flow into the character model during training and features are not trained. I.e. the character features would stay random.

If you do not detach with the code above, training will be slower but this way the character features are always trained on the downstream task like proposed by Lample et al., 2016.

lisette-garciamoya · 2019-01-29T10:17:26Z

It seemed to be fixed but now I get the same error while executing the real program.

(I tested it with .detach() and without .detach() -> same error)

JieyuZhao · 2019-01-31T05:48:16Z

I think same error occurred when I tried the BERT tutorial example.
It is also about "RuntimeError: Expected object of backend CPU but got backend CUDA for argument #3 'index'".

alanakbik · 2019-01-31T12:36:52Z

Hello @lisette-garciamoya what do you mean by executing the real program?

alanakbik · 2019-01-31T13:07:29Z

Hi @lisette-garciamoya - I was able to understand where the error is coming from. In fact, the original code of the CharacterEmbeddings class is correct.

However, when you instantiate the CharacterEmbeddings, by default it is only instantiated on CPU. The ModelTrainer then puts it on GPU which is why the training works. But if you instantiate the CharacterEmbeddings yourself, it is only on CPU even if you are on a GPU machine, which causes the error.

For now, the simplest fix is to do this:

 from flair.embeddings import CharacterEmbeddings

sentence = Sentence('La casa es muy bonita.', use_tokenizer=True)

embedding = CharacterEmbeddings()
embeddings = embeddings.cuda() # add this line to put the embeddings on CUDA
embedding.embed(sentence)

for token in sentence:
    print(token)
    print(token.embedding)

Could you test if this works for you?

We will also set up a PR that fixes this behavior. Default behavior should be that embeddings are instantiated on cuda if available.

Thanks for finding this error and reporting it!

Gh 421 character embeddings

alanakbik · 2019-01-31T14:13:13Z

Should be fixed by the latest PR. Feel free to reopen if there are still issues!

Thanks again for reporting the error!

PaulZhangIsing · 2019-02-25T06:58:11Z

Hi @lisette-garciamoya - I was able to understand where the error is coming from. In fact, the original code of the CharacterEmbeddings class is correct.

However, when you instantiate the CharacterEmbeddings, by default it is only instantiated on CPU. The ModelTrainer then puts it on GPU which is why the training works. But if you instantiate the CharacterEmbeddings yourself, it is only on CPU even if you are on a GPU machine, which causes the error.

For now, the simplest fix is to do this:
 from flair.embeddings import CharacterEmbeddings

sentence = Sentence('La casa es muy bonita.', use_tokenizer=True)

embedding = CharacterEmbeddings()
embeddings = embeddings.cuda() # add this line to put the embeddings on CUDA
embedding.embed(sentence)

for token in sentence:
    print(token)
    print(token.embedding)
Could you test if this works for you?

We will also set up a PR that fixes this behavior. Default behavior should be that embeddings are instantiated on cuda if available.

Thanks for finding this error and reporting it!

To me, I used
from flair.embeddings import BertEmbeddings
from flair.data import Sentence
from flair.embeddings import FlairEmbeddings

init embedding

flair_embedding_forward = FlairEmbeddings('news-forward')
bert_embedding = BertEmbeddings('bert-large-cased').cuda()

create a sentence

sentence = Sentence('The grass is green .')

embed words in sentence

x = bert_embedding.embed(sentence)

for token in sentence:
print(token)
print(token.embedding)
it works

lisette-garciamoya added the bug Something isn't working label Jan 24, 2019

alanakbik pushed a commit that referenced this issue Jan 31, 2019

GH-421: fix CharacterEmbeddings initialization on CUDA

c132412

alanakbik pushed a commit that referenced this issue Jan 31, 2019

GH-421: fix CharacterEmbeddings initialization on CUDA

b7547bc

alanakbik mentioned this issue Jan 31, 2019

Gh 421 character embeddings #434

Merged

kashif closed this as completed in #434 Jan 31, 2019

kashif added a commit that referenced this issue Jan 31, 2019

Merge pull request #434 from zalandoresearch/GH-421-character-embeddings

affec65

Gh 421 character embeddings

PaulZhangIsing mentioned this issue Feb 25, 2019

use BERT embedding as instruction and have ModuleNotFoundError: No module named 'fused_layer_norm_cuda' #525

Closed

pvcastro mentioned this issue Mar 15, 2019

Running predict on CPU using ELMo Embeddings #610

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA Error during CharacterEmbeddings #421

CUDA Error during CharacterEmbeddings #421

lisette-garciamoya commented Jan 24, 2019

stefan-it commented Jan 24, 2019 •

edited

Loading

lisette-garciamoya commented Jan 25, 2019

alanakbik commented Jan 25, 2019

lisette-garciamoya commented Jan 29, 2019

JieyuZhao commented Jan 31, 2019

alanakbik commented Jan 31, 2019

alanakbik commented Jan 31, 2019

alanakbik commented Jan 31, 2019

PaulZhangIsing commented Feb 25, 2019

CUDA Error during CharacterEmbeddings #421

CUDA Error during CharacterEmbeddings #421

Comments

lisette-garciamoya commented Jan 24, 2019

stefan-it commented Jan 24, 2019 • edited Loading

lisette-garciamoya commented Jan 25, 2019

alanakbik commented Jan 25, 2019

lisette-garciamoya commented Jan 29, 2019

JieyuZhao commented Jan 31, 2019

alanakbik commented Jan 31, 2019

alanakbik commented Jan 31, 2019

alanakbik commented Jan 31, 2019

PaulZhangIsing commented Feb 25, 2019

init embedding

create a sentence

embed words in sentence

stefan-it commented Jan 24, 2019 •

edited

Loading