[New Model]: Support Nemotron-4-340B #5722

dskhudia · 2024-06-20T20:02:36Z

🚀 The feature, motivation and pitch

Benchmarks on the Nvidia's latest nemotron model look great. Is there plan or already going work to support it?

Alternatives

No response

Additional context

https://huggingface.co/nvidia/Nemotron-4-340B-Instruct

mgoin · 2024-06-20T21:08:06Z

Once there is support in HF transformers, then it should be relatively straightforward to port into vLLM. It seems there aren't any efforts from searching the transformers issues/PRs

riverind · 2024-06-27T07:08:17Z

attention

natolambert · 2024-07-20T19:11:52Z

I started a paid bounty to close these issues. Already over $200 of support.
https://x.com/natolambert/status/1814735390877884823

dskhudia added the feature request label Jun 20, 2024

WoosukKwon added new model Requests to new models and removed feature request labels Jun 21, 2024

DarkLight1337 changed the title ~~[Feature]: Support Nemotron-4-340B~~ [Model]: Support Nemotron-4-340B Jun 27, 2024

DarkLight1337 changed the title ~~[Model]: Support Nemotron-4-340B~~ [New Model]: Support Nemotron-4-340B Jun 27, 2024

mgoin mentioned this issue Jul 21, 2024

[Model] Support Nemotron models (Nemotron-3, Nemotron-4, Minitron) #6611

Merged

mgoin closed this as completed in #6611 Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Model]: Support Nemotron-4-340B #5722

[New Model]: Support Nemotron-4-340B #5722

dskhudia commented Jun 20, 2024

mgoin commented Jun 20, 2024

riverind commented Jun 27, 2024

natolambert commented Jul 20, 2024

[New Model]: Support Nemotron-4-340B #5722

[New Model]: Support Nemotron-4-340B #5722

Comments

dskhudia commented Jun 20, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

mgoin commented Jun 20, 2024

riverind commented Jun 27, 2024

natolambert commented Jul 20, 2024