请问出现这种情况：Token indices sequence length is longer than the specified maximum sequence length for this model (781 > 512)，如何解决？ #62

starxuh · 2024-06-13T09:27:57Z

请问出现下面这种情况
Token indices sequence length is longer than the specified maximum sequence length for this model (781 > 512). Running this sequence through the model will result in indexing errors
You're using a XLMRobertaTokenizerFast tokenizer. Please note that with a fast tokenizer, using the __call__ method is faster than using a method to encode the text followed by a call to the pad method to get a padded encoding.
如何解决？是模型token限制吗？

The text was updated successfully, but these errors were encountered:

shenlei1020 · 2024-06-16T15:51:59Z

没关系，不影响结果，只是warning，BCEmbedding的python包已经做了处理

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

请问出现这种情况：Token indices sequence length is longer than the specified maximum sequence length for this model (781 > 512)，如何解决？ #62

请问出现这种情况：Token indices sequence length is longer than the specified maximum sequence length for this model (781 > 512)，如何解决？ #62

starxuh commented Jun 13, 2024

shenlei1020 commented Jun 16, 2024

请问出现这种情况：Token indices sequence length is longer than the specified maximum sequence length for this model (781 > 512)，如何解决？ #62

请问出现这种情况：Token indices sequence length is longer than the specified maximum sequence length for this model (781 > 512)，如何解决？ #62

Comments

starxuh commented Jun 13, 2024

shenlei1020 commented Jun 16, 2024