Add Transformers logits manipulators #241

0xymoro · 2023-11-02T01:59:52Z

Hi - really interesting work. We're currently using HF TGI in production and exploring using this instead, are there plans to add things like typical_p that transformers supports? Would greatly ease the transition. Thanks!

0xymoro · 2023-11-02T19:35:36Z

In particular typical p in production environments (our 300k users) has proved to create significantly more natural sequences. The Python code is at line 456 of https://github.com/huggingface/transformers/blob/main/src/transformers/generation/logits_process.py and it is a pretty simple entropy calculation & filtering out the high entropy (unpredictable/off the rails) and low entropy (boring and contributing nothing new) tokens.

I see the sampling is done at a much lower level here and it's pretty different but please let me know if I can help in making some PR. I'm not familiar with cuda programming as I am with python but happy to help if there's any way.

juney-nvidia · 2023-11-05T04:37:03Z

@jerryMeng100

Thanks for sharing the idea.

For sure it is more than welcome for you to make contribution to TensorRT-LLM to add the typical P support.
Currently, the community contribution process is(and the process may be iterated and improved based on the concrete feedback we receive):

Community members prepare the MR and do the validation in their local environment.
When the MR is ready, they can ping us for code review(like this one and this one). Dedicated NVIDIA engineers will be assigned to work with the community contributor to merge his or her MR into our internal repo and go through all the internal validation process.
When it is internally validated okay, the community contributed code will be incorporated as part of the next release(either to the main branch or the new release branch) commit(like this one), with explicitly acknowledging the community member's name, also in the release commit, the community member will be mentioned as the co-author. Thus to ensure the community contribution can be respected and acknowledged suitably.

Pls let us know whether it makes sense to you.

Thanks
June

nv-guomingz · 2024-11-18T14:37:23Z

@laikhtewari

jdemouth-nvidia assigned ncomly-nvidia Nov 2, 2023

jdemouth-nvidia added the feature request New feature or request label Nov 2, 2023

juney-nvidia self-assigned this Nov 5, 2023

juney-nvidia added Community want to contribute Sampling labels Nov 5, 2023

ncomly-nvidia added the triaged Issue has been triaged by maintainers label Nov 6, 2023

ncomly-nvidia mentioned this issue Dec 11, 2023

TensorRT-LLM Requests #632

Open

41 tasks

nv-guomingz assigned laikhtewari Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Transformers logits manipulators #241

Add Transformers logits manipulators #241

0xymoro commented Nov 2, 2023

0xymoro commented Nov 2, 2023 •

edited

Loading

juney-nvidia commented Nov 5, 2023

nv-guomingz commented Nov 18, 2024

Add Transformers logits manipulators #241

Add Transformers logits manipulators #241

Comments

0xymoro commented Nov 2, 2023

0xymoro commented Nov 2, 2023 • edited Loading

juney-nvidia commented Nov 5, 2023

nv-guomingz commented Nov 18, 2024

0xymoro commented Nov 2, 2023 •

edited

Loading