Skip to content

Conversation

@suquark
Copy link
Contributor

@suquark suquark commented Apr 16, 2023

Bug fixed and ready to run version

@zhuohan123
Copy link
Member

Close this PR since it's too diverged from the current main.

@zhuohan123 zhuohan123 closed this Jun 17, 2023
@zhuohan123 zhuohan123 deleted the prefix_siyuan branch June 18, 2023 07:30
@huangtingwei9988
Copy link

@suquark hi,Will it work properly when you complete this PR?

fxmarty pushed a commit to fxmarty/vllm-public that referenced this pull request Jun 12, 2024
joerunde added a commit to joerunde/vllm that referenced this pull request Jun 17, 2024
This includes some fixes for supporting vllm 0.4.3+.

Mostly the `generate` api changed, so we have to update our grpc server
accordingly

---------

Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
yukavio pushed a commit to yukavio/vllm that referenced this pull request Jul 3, 2024
@alixiaodi alixiaodi mentioned this pull request Aug 2, 2024
bigPYJ1151 added a commit to bigPYJ1151/vllm that referenced this pull request Aug 8, 2024
* fix rope

* add warming-up

* add shm gather
heheda12345 pushed a commit to heheda12345/vllm that referenced this pull request Sep 29, 2025
* Squashed commit of lwilkinson/decode-only changes relative to origin/dev

Co-authored-by: Matthew Bonanni <mbonanni@redhat.com>
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

* update FlashMLA

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

* fix non-spec error

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>

---------

Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
Signed-off-by: Chen Zhang <zhangch99@outlook.com>
Co-authored-by: Matthew Bonanni <mbonanni@redhat.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants