Support Llama3 8b/70b #256

wanchaol · 2024-04-19T03:39:49Z

This PR adds support for Llama3 8b/70b, mainly it:

add tiktonizer, add instructions to download tokenizer
add options for the llama model to support Llama3
add Llama3 8b/70b configs

README.md

requirements.txt

torchtitan/datasets/tokenizer/base_tokenizer.py

lessw2020

awesome, thanks for adding this!
left a couple minor grammar nits and one nit on naming, but overall looks fantastic.

gnadathur · 2024-04-19T05:16:05Z

Thank you for adding llama3 support so fast.
nit: to keep testing current to llama3, it would be good to move debug_model.toml to use llama3.

tianyu-l

Looks good! Can't wait to try running them. Had some inline comments.

README.md

torchtitan/datasets/tokenizer/base_tokenizer.py

torchtitan/datasets/tokenizer/tiktokenizer.py

torchtitan/models/__init__.py

train_configs/llama3_8b.toml

torchtitan/datasets/download_tokenizer.py

tianyu-l · 2024-04-19T22:07:27Z

train_configs/llama3_8b.toml

+[model]
+name = "llama3"
+flavor = "8B"
+tokenizer_path = "./torchtitan/datasets/tokenizer/original/tokenizer.model"


Putting llama2 tokenizer.model under torchtitan/datasets/tokenizer/ and llama3 tokenizer.model under /torchtitan/datasets/tokenizer/original/ is confusing, especially because they share the same file name. Can we organize them better?

README.md

train_configs/llama3_8b.toml

tianyu-l · 2024-04-19T22:29:05Z

requirements.txt

 datasets
 tomli >= 1.1.0 ; python_version < "3.11"
 tensorboard
+sentencepiece
+tiktoken==0.4.0


I was asked for pip install blobfile when tiktoken is first time running

Is it still the case for tiktokenizer version > 0.5.2? Or we have to install the blob file?

that's a good question... I haven't tried

on mast it requires an additional pkg chardet, which was not asked on local devgpu

hmmm I guess I'll try to remove the version pin for now

This PR adds support for Llama3 8b/70b, mainly it: - add tiktonizer, add instructions to download tokenizer - add options for the llama model to support Llama3 - add Llama3 8b/70b configs Have to remove integration test first, will add it back later once we figure out the details

wanchaol · 2024-04-20T05:13:07Z

Thank you for adding llama3 support so fast. nit: to keep testing current to llama3, it would be good to move debug_model.toml to use llama3.

@gnadathur I had to make debug_model.toml not run in the integration tests because of tokenier.model in order to support llama3, I'll try to add it back to the test next week once we have clarify how to fetch it dynamically

This PR adds support for Llama3 8b/70b, mainly it: - add tiktonizer, add instructions to download tokenizer - add options for the llama model to support Llama3 - add Llama3 8b/70b configs

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 19, 2024

wanchaol force-pushed the llama3 branch 3 times, most recently from 7404baf to 1d037e3 Compare April 19, 2024 04:12

lessw2020 reviewed Apr 19, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

lessw2020 reviewed Apr 19, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

lessw2020 reviewed Apr 19, 2024

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

lessw2020 reviewed Apr 19, 2024

View reviewed changes

torchtitan/datasets/tokenizer/base_tokenizer.py Outdated Show resolved Hide resolved

lessw2020 approved these changes Apr 19, 2024

View reviewed changes

tianyu-l approved these changes Apr 19, 2024

View reviewed changes

tianyu-l reviewed Apr 19, 2024

View reviewed changes

torchtitan/datasets/download_tokenizer.py Outdated Show resolved Hide resolved

tianyu-l reviewed Apr 19, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

tianyu-l reviewed Apr 19, 2024

View reviewed changes

train_configs/llama3_8b.toml Show resolved Hide resolved

tianyu-l reviewed Apr 19, 2024

View reviewed changes

wanchaol force-pushed the llama3 branch from 1d037e3 to 4860d62 Compare April 20, 2024 04:53

wanchaol force-pushed the llama3 branch from 4860d62 to a01010b Compare April 20, 2024 05:11

rename and nits

fdc3841

wanchaol merged commit 35470ca into main Apr 20, 2024
4 checks passed

wanchaol deleted the llama3 branch April 20, 2024 05:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Llama3 8b/70b #256

Support Llama3 8b/70b #256

wanchaol commented Apr 19, 2024

lessw2020 left a comment

gnadathur commented Apr 19, 2024 •

edited

Loading

tianyu-l left a comment

tianyu-l Apr 19, 2024

tianyu-l Apr 19, 2024

wanchaol Apr 19, 2024

tianyu-l Apr 20, 2024

wanchaol Apr 20, 2024

wanchaol commented Apr 20, 2024 •

edited

Loading

Support Llama3 8b/70b #256

Support Llama3 8b/70b #256

Conversation

wanchaol commented Apr 19, 2024

lessw2020 left a comment

Choose a reason for hiding this comment

gnadathur commented Apr 19, 2024 • edited Loading

tianyu-l left a comment

Choose a reason for hiding this comment

tianyu-l Apr 19, 2024

Choose a reason for hiding this comment

tianyu-l Apr 19, 2024

Choose a reason for hiding this comment

wanchaol Apr 19, 2024

Choose a reason for hiding this comment

tianyu-l Apr 20, 2024

Choose a reason for hiding this comment

wanchaol Apr 20, 2024

Choose a reason for hiding this comment

wanchaol commented Apr 20, 2024 • edited Loading

gnadathur commented Apr 19, 2024 •

edited

Loading

wanchaol commented Apr 20, 2024 •

edited

Loading