Add llama3.np / llama3.cuda port to README.md #525

likejazz · 2024-06-06T01:37:53Z

No description provided.

jameswdelancey · 2024-06-06T06:47:34Z

These appear to be llama2 ports vs llama3. They need to at least use tiktoken start tokens to work with the base, not 0 like sentencepiece, and if they work with tinystories, i suspect they are using sentencepiece tokenizer. Chat/instruct require a stack of tokens to start dialogue else you will be unimpressed with the output. There is a predictable output at zero temperature for both base/pretrained and chat/inspect that you can use to validate. You can copy this diff for the meat of what you need: jameswdelancey@7815cd3#diff-8935a7a088435e2ddf7315451f07fae16810932fb3a0a5d706a2eead1618af26R850-R854

Add llama3.np / llama3.cuda port to README.md

a84534a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llama3.np / llama3.cuda port to README.md #525

Add llama3.np / llama3.cuda port to README.md #525

likejazz commented Jun 6, 2024

jameswdelancey commented Jun 6, 2024

Add llama3.np / llama3.cuda port to README.md #525

Are you sure you want to change the base?

Add llama3.np / llama3.cuda port to README.md #525

Conversation

likejazz commented Jun 6, 2024

jameswdelancey commented Jun 6, 2024