QoL: Default to using fast tokenizer for Llama models #3625

arnavgarg1 · 2023-09-17T15:41:18Z

Quality of life improvement - we used to set use_fast to False since the tokenizer was not natively written so it would take 2-4 minutes to load, however in newer transformers versions there is first class support so we no longer have to do this.

github-actions · 2023-09-17T16:21:58Z

Unit Test Results

  4 files ±0   4 suites ±0 31m 57s ⏱️ + 1m 59s
31 tests ±0 26 ✔️ ±0   5 💤 ±0 0 ❌ ±0
62 runs ±0 52 ✔️ ±0 10 💤 ±0 0 ❌ ±0

Results for commit 14b3534. ± Comparison against base commit 42723e3.

justinxzhao

Thanks for following up on this! I was wondering about this and why setting use_fast=True didn't seem faster.

Default to using fast tokenizer for Llama models

14b3534

arnavgarg1 requested review from tgaddair, justinxzhao and jeffkinnison September 17, 2023 15:51

justinxzhao approved these changes Sep 18, 2023

View reviewed changes

Fix failing test

bf90b24

arnavgarg1 merged commit 69c2c0b into master Sep 18, 2023

arnavgarg1 deleted the llama-tokenizer branch September 18, 2023 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QoL: Default to using fast tokenizer for Llama models #3625

QoL: Default to using fast tokenizer for Llama models #3625

arnavgarg1 commented Sep 17, 2023

github-actions bot commented Sep 17, 2023

justinxzhao left a comment

QoL: Default to using fast tokenizer for Llama models #3625

QoL: Default to using fast tokenizer for Llama models #3625

Conversation

arnavgarg1 commented Sep 17, 2023

github-actions bot commented Sep 17, 2023

Unit Test Results

justinxzhao left a comment

Choose a reason for hiding this comment