✨ Text generation input inference data models #151

tharapalanivel · 2023-08-28T22:32:35Z

Supports #140

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

caikit_nlp/data_model/generation.py

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

alex-jw-brooks · 2023-08-29T17:01:33Z

caikit_nlp/data_model/generation.py

+    top_k: int
+    top_p: int
+    typical_p: float
+    seed: Optional[int]


It might be a good idea to set some default! TGIS defaults are here.

Most of the time this doesn't matter, because 0 temperature (in the IBM fork) indicates greedy decoding, so top_k, top_p, typical_p, etc won't be used, as they're sampling only.

TGI doesn't use temperature 0 as a toggle though, so it would be also be nice in case those APIs are ever more unified - currently there are some small divergences with stuff like prompt IDs. I'm not sure if our raw generation modules are compatible with it or not

Haven't seen us setting defaults on the data models themselves, only in the inference methods. I don't really have a strong opinion on this, trying to understand if that is the general direction caikit is moving in

I think even if we set defaults on the DM, they won't propagate to proto, so the default here would be guided by the .run function themselves.

Good point - my main concern with leaving it up to run is that it's easy for defaults to get out of sync if we have multiple modules relying on them.

I guess an alternate is to either have a building for getting these objects with their default values that make sense, or to have consts be passed to the run function 🤔 is the intent with this type to have a parameter that is this DM object type, or to take primitives and build this object in the requests?

gkumbhat · 2023-09-01T13:57:22Z

We decided to go with flattened API for now. This may be revisited, so we are keeping this PR open

✨ Text generation input inference data models

bfaf8f9

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

tharapalanivel requested a review from gkumbhat August 28, 2023 22:32

tharapalanivel requested review from alex-jw-brooks, evaline-ju and gabe-l-hart as code owners August 28, 2023 22:32

gkumbhat requested changes Aug 28, 2023

View reviewed changes

caikit_nlp/data_model/generation.py Outdated Show resolved Hide resolved

🐛 Renaming params

844fd27

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

alex-jw-brooks requested changes Aug 29, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Text generation input inference data models #151

✨ Text generation input inference data models #151

tharapalanivel commented Aug 28, 2023

alex-jw-brooks Aug 29, 2023 •

edited

Loading

tharapalanivel Aug 29, 2023

gkumbhat Aug 29, 2023

alex-jw-brooks Aug 29, 2023

gkumbhat commented Sep 1, 2023

✨ Text generation input inference data models #151

Are you sure you want to change the base?

✨ Text generation input inference data models #151

Conversation

tharapalanivel commented Aug 28, 2023

alex-jw-brooks Aug 29, 2023 • edited Loading

Choose a reason for hiding this comment

tharapalanivel Aug 29, 2023

Choose a reason for hiding this comment

gkumbhat Aug 29, 2023

Choose a reason for hiding this comment

alex-jw-brooks Aug 29, 2023

Choose a reason for hiding this comment

gkumbhat commented Sep 1, 2023

alex-jw-brooks Aug 29, 2023 •

edited

Loading