use logger.warning_once to avoid massive outputs #27428

ranchlai · 2023-11-10T11:05:29Z

What does this PR do?

This is a quick fix to to avoid massive outputs when training/finetuning longformer (for text classifcation) by using logger.warning_once rather than logger.info

…ning longformer

amyeroberts

Thanks for updating - this is a great suggestion!

Would you mind extending this to our other models in the library?

ranchlai · 2023-11-10T15:30:20Z

Sure! Happy to do that. If I understand correctly, this will only apply to logger.info/warn in the forward() function and the functions called by forward().

amyeroberts · 2023-11-10T15:32:40Z

@ranchlai Awesome - thanks! Yep, that's right. Feel free to ping with any questions if you're not sure about any of them.

ranchlai · 2023-11-13T01:55:39Z

src/transformers/models/rwkv/modeling_rwkv.py

@@ -264,7 +264,9 @@ def __init__(self, config, layer_id=0):
            try:
                load_wkv_cuda_kernel(config.context_length)
            except Exception:
-                logger.info("Could not load the custom CUDA kernel for RWKV attention.")
+                logger.warning_once(


@amyeroberts Not quite sure about this. This line is in the init of attention layers.

In this case, I'd say let's keep it as info as it's really just the exception message, so we want it to be unaffected by the cache.

OK, will discard this commit

amyeroberts

Thanks for adding all of these!

* use logger.warning_once to avoid massive outputs when training/finetuning longformer * update more

use logger.warning_once to avoid massive outputs when training/finetu…

7722fed

…ning longformer

amyeroberts reviewed Nov 10, 2023

View reviewed changes

update more

d223171

ranchlai commented Nov 13, 2023

View reviewed changes

ranchlai force-pushed the update_logging_longformer branch from 5e4076c to d223171 Compare November 14, 2023 01:52

amyeroberts approved these changes Nov 14, 2023

View reviewed changes

huggingface deleted a comment from github-actions bot Dec 11, 2023

amyeroberts merged commit e49c385 into huggingface:main Dec 11, 2023
3 checks passed

iantbutler01 pushed a commit to BismuthCloud/transformers that referenced this pull request Dec 16, 2023

use logger.warning_once to avoid massive outputs (huggingface#27428)

fff5577

* use logger.warning_once to avoid massive outputs when training/finetuning longformer * update more

staghado pushed a commit to staghado/transformers that referenced this pull request Jan 15, 2024

use logger.warning_once to avoid massive outputs (huggingface#27428)

5ae183f

* use logger.warning_once to avoid massive outputs when training/finetuning longformer * update more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use logger.warning_once to avoid massive outputs #27428

use logger.warning_once to avoid massive outputs #27428

ranchlai commented Nov 10, 2023

amyeroberts left a comment

ranchlai commented Nov 10, 2023

amyeroberts commented Nov 10, 2023

ranchlai Nov 13, 2023

amyeroberts Nov 13, 2023

ranchlai Nov 14, 2023

amyeroberts left a comment

use logger.warning_once to avoid massive outputs #27428

use logger.warning_once to avoid massive outputs #27428

Conversation

ranchlai commented Nov 10, 2023

What does this PR do?

amyeroberts left a comment

Choose a reason for hiding this comment

ranchlai commented Nov 10, 2023

amyeroberts commented Nov 10, 2023

ranchlai Nov 13, 2023

Choose a reason for hiding this comment

amyeroberts Nov 13, 2023

Choose a reason for hiding this comment

ranchlai Nov 14, 2023

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment