-
Notifications
You must be signed in to change notification settings - Fork 499
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add CFG integration to VLLM #788
base: main
Are you sure you want to change the base?
Conversation
61186b7
to
17aec4d
Compare
17aec4d
to
1ef5c73
Compare
Fixed the failing test! Could you add a simple test here ? |
@rlouf I see that you added the test, I guess it's done? |
Would you mind removing the |
1ef5c73
to
5e03849
Compare
5e03849
to
778b581
Compare
@rlouf Done. |
Ok I'll update the vLLM install so we can run these tests in CI. I'll edit your PR directly so don't touch anything for now! |
I get an error on the CFG test, it generates tokens that are not allowed. We need to determine if it's a problem with the CFGGuide or the integration itself. Can you reproduce this locally?
|
@rlouf Using the debugger, it seems that is something related to the parser. I'm going to have a look at that. |
That's what I feared |
@rlouf After inspecting a little bit, The Concerning the error, the problem comes from the built regex from I could reproduce the same type of error using |
I'm going to fix it, I'll open a draft PR when the time comes. |
Thank you! I'm sorry for lack of responsiveness here, I'm really busy with other things but will have more time soon :) |
No problem! |
I have partially debugged the bug in |
It's not a good idea to return |
IMO it's better to add another method, say Alternatively, we can somehow make |
Hi @miftahmoha , I didnt get you. Could you please expand more on above? Thanks! |
@ekagra-ranjan Sorry for the late answer, there are two mechanisms that are in the works here. First, the one that gives the next possible tokens following the CFG (tokens in the context of a lexer in compilers) which is There is a problem in the second mechanism. |
This test failed, but it's related to the
llama.cpp
integration.Fixes #780.