Add streaming inference & fix stopping at EOS #180

Glavin001 · 2023-06-10T08:16:03Z

What's new?

Add streaming inference
Fix stopping at eos_token

When I was testing inference with the Falcon config file it used eos_token=<|endoftext|> instead of what was previously being set in the do_inference() code was <s/> so it wouldn’t stop early it would always use the entire 1024 max new tokens.

Demo

https://www.loom.com/share/03f8c8903b42484ea7aad72ab07a6340

Glavin001 · 2023-06-10T08:16:27Z

scripts/finetune.py


    prompter_module = getattr(importlib.import_module("axolotl.prompters"), prompter)

    while True:
+        print("=" * 80)


Let me know if you like or dislike the separators. Can remove.

NanoCode012 · 2023-06-10T11:28:50Z

Thank you for PR. I am currently still thinking how to handle this. Since we might not only use alpaca format for inference, I'm thinking of how to make the prompt dynamic/passable and setting the appropriate default tokens.

Edit: as winglian has approved, I guess we can deal with the above at a later point. The streaming is a great addition!

scripts/finetune.py

…ference Add streaming inference & fix stopping at EOS

Add streaming inference & fix stopping at EOS

Add streaming inference & fix stopping at EOS

fec6bcc

Glavin001 commented Jun 10, 2023

View reviewed changes

winglian approved these changes Jun 10, 2023

View reviewed changes

winglian reviewed Jun 10, 2023

View reviewed changes

scripts/finetune.py Outdated Show resolved Hide resolved

formatting for linter

f36e227

winglian merged commit 215d775 into axolotl-ai-cloud:main Jun 10, 2023

AngainorDev mentioned this pull request Jun 12, 2023

[Bug] Tokenizer's BOS/EOS/PAD not set for inference #139

Closed

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

Merge pull request axolotl-ai-cloud#180 from Glavin001/feat/stream-in…

f4c21e6

…ference Add streaming inference & fix stopping at EOS

djsaunde pushed a commit that referenced this pull request Dec 17, 2024

Merge pull request #180 from Glavin001/feat/stream-inference

87b5c3e

Add streaming inference & fix stopping at EOS

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add streaming inference & fix stopping at EOS #180

Add streaming inference & fix stopping at EOS #180

Glavin001 commented Jun 10, 2023 •

edited

Loading

Glavin001 Jun 10, 2023

NanoCode012 commented Jun 10, 2023 •

edited

Loading

Add streaming inference & fix stopping at EOS #180

Add streaming inference & fix stopping at EOS #180

Conversation

Glavin001 commented Jun 10, 2023 • edited Loading

What's new?

Demo

Glavin001 Jun 10, 2023

Choose a reason for hiding this comment

NanoCode012 commented Jun 10, 2023 • edited Loading

Glavin001 commented Jun 10, 2023 •

edited

Loading

NanoCode012 commented Jun 10, 2023 •

edited

Loading