-
Notifications
You must be signed in to change notification settings - Fork 910
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[WIP] ngram spec #2886
base: main
Are you sure you want to change the base?
[WIP] ngram spec #2886
Conversation
Could you please share some performance results? |
|
I think it actually implemented a more general approach, this PR only support single branch strategy mentioned in the other PR. Ill close this one |
@XiaotongJiang The other branch implemented lookahead decoding, yours is more close to tok-1 proposer n-gram prompt lookup decoding. |
mvp passing unit test with torch backend