✨ Text generation input inference data models by tharapalanivel · Pull Request #151 · caikit/caikit-nlp

tharapalanivel · 2023-08-28T22:32:35Z

Supports #140

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

alex-jw-brooks · 2023-08-29T17:01:33Z

+    top_k: int
+    top_p: int
+    typical_p: float
+    seed: Optional[int]


It might be a good idea to set some default! TGIS defaults are here.

Most of the time this doesn't matter, because 0 temperature (in the IBM fork) indicates greedy decoding, so top_k, top_p, typical_p, etc won't be used, as they're sampling only.

TGI doesn't use temperature 0 as a toggle though, so it would be also be nice in case those APIs are ever more unified - currently there are some small divergences with stuff like prompt IDs. I'm not sure if our raw generation modules are compatible with it or not

Haven't seen us setting defaults on the data models themselves, only in the inference methods. I don't really have a strong opinion on this, trying to understand if that is the general direction caikit is moving in

I think even if we set defaults on the DM, they won't propagate to proto, so the default here would be guided by the .run function themselves.

Good point - my main concern with leaving it up to run is that it's easy for defaults to get out of sync if we have multiple modules relying on them.

I guess an alternate is to either have a building for getting these objects with their default values that make sense, or to have consts be passed to the run function 🤔 is the intent with this type to have a parameter that is this DM object type, or to take primitives and build this object in the requests?

gkumbhat · 2023-09-01T13:57:22Z

We decided to go with flattened API for now. This may be revisited, so we are keeping this PR open

✨ Text generation input inference data models

bfaf8f9

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

tharapalanivel requested a review from gkumbhat August 28, 2023 22:32

tharapalanivel requested review from alex-jw-brooks, evaline-ju and gabe-l-hart as code owners August 28, 2023 22:32

gkumbhat requested changes Aug 28, 2023

View reviewed changes

Comment thread caikit_nlp/data_model/generation.py Outdated

🐛 Renaming params

844fd27

Signed-off-by: Thara Palanivel <130496890+tharapalanivel@users.noreply.github.com>

alex-jw-brooks requested changes Aug 29, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Text generation input inference data models#151

✨ Text generation input inference data models#151
tharapalanivel wants to merge 2 commits intocaikit:mainfrom
tharapalanivel:nested_inf_data_models

tharapalanivel commented Aug 28, 2023

Uh oh!

Uh oh!

alex-jw-brooks Aug 29, 2023 •

edited

Loading

Uh oh!

tharapalanivel Aug 29, 2023

Uh oh!

gkumbhat Aug 29, 2023

Uh oh!

alex-jw-brooks Aug 29, 2023

Uh oh!

gkumbhat commented Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tharapalanivel commented Aug 28, 2023

Uh oh!

Uh oh!

alex-jw-brooks Aug 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tharapalanivel Aug 29, 2023

Choose a reason for hiding this comment

Uh oh!

gkumbhat Aug 29, 2023

Choose a reason for hiding this comment

Uh oh!

alex-jw-brooks Aug 29, 2023

Choose a reason for hiding this comment

Uh oh!

gkumbhat commented Sep 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alex-jw-brooks Aug 29, 2023 •

edited

Loading