refactor: change default block_size in block size > max position embeddings #26069

pphuc25 · 2023-09-09T21:13:31Z

Hi,
In the original code, this appears to function correctly when the default block_size is set to 1024. However, I believe that this setting might potentially hinder the training performance. Therefore, I have adjusted the default to be max_position_embeddings when it doesn't match case.
I would like to cc @sanchit-gandhi to review my PR, thank you so much

examples/flax/language-modeling/run_clm_flax.py

pphuc25 · 2023-09-12T18:25:10Z

Thank you for your helpful information.

sanchit-gandhi · 2023-09-14T14:30:39Z

No worries @pphuc25! Would you like to open a PR to set the block size to the minimum of (1024, config.max_position_embeddings)? This will prevent the error when we set the block size > max position embeddings

pphuc25 · 2023-09-14T15:45:29Z

That's the cool idea, I will do it, thank you @sanchit-gandhi.

sanchit-gandhi

Lovely! Thanks @pphuc25 for the clean PR :)

HuggingFaceDocBuilderDev · 2023-09-18T15:04:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

ArthurZucker

Thanks for the catch ! 🤗

…ddings (huggingface#26069) * refactor: change default block_size when not initialize * reformat: add the min of block size

refactor: change default block_size when not initialize

3298a24

sanchit-gandhi reviewed Sep 11, 2023

View reviewed changes

examples/flax/language-modeling/run_clm_flax.py Show resolved Hide resolved

pphuc25 closed this Sep 12, 2023

pphuc25 deleted the flax_token branch September 12, 2023 18:25

pphuc25 mentioned this pull request Sep 14, 2023

refactor: add min to block size #26166

Closed

pphuc25 restored the flax_token branch September 14, 2023 15:59

pphuc25 reopened this Sep 14, 2023

reformat: add the min of block size

f2fc115

pphuc25 requested a review from sanchit-gandhi September 14, 2023 16:04

pphuc25 changed the title ~~refactor: change default block_size when not initialize and not match case~~ refactor: change default block_size in block size > max position embeddings Sep 14, 2023

sanchit-gandhi approved these changes Sep 18, 2023

View reviewed changes

sanchit-gandhi requested a review from ArthurZucker September 18, 2023 14:37

ArthurZucker approved these changes Sep 18, 2023

View reviewed changes

sanchit-gandhi merged commit 8b5da9f into huggingface:main Sep 18, 2023

pphuc25 deleted the flax_token branch September 18, 2023 17:07

pphuc25 mentioned this pull request Sep 18, 2023

refactor: change default block_size #26229

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: change default block_size in block size > max position embeddings #26069

refactor: change default block_size in block size > max position embeddings #26069

pphuc25 commented Sep 9, 2023

pphuc25 commented Sep 12, 2023

sanchit-gandhi commented Sep 14, 2023

pphuc25 commented Sep 14, 2023

sanchit-gandhi left a comment

HuggingFaceDocBuilderDev commented Sep 18, 2023

ArthurZucker left a comment

refactor: change default block_size in block size > max position embeddings #26069

refactor: change default block_size in block size > max position embeddings #26069

Conversation

pphuc25 commented Sep 9, 2023

pphuc25 commented Sep 12, 2023

sanchit-gandhi commented Sep 14, 2023

pphuc25 commented Sep 14, 2023

sanchit-gandhi left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Sep 18, 2023

ArthurZucker left a comment

Choose a reason for hiding this comment