Make feedback more flexible and light #337

jklaise · 2018-12-10T10:56:36Z

Currently the send-feedback API takes in a batch of samples (request+response) but only one float for the reward which means that design choices must be made when distributing this reward between all samples in the batch (e.g. #336). One potential fix would be to change reward to be of type DefaultData so that each sample in a batch gets its own reward.

Additionally, we need to re-evaluate sending both request and response as part of the feedback as this could be wasteful with large payloads.

janvdvegt · 2018-12-17T11:09:45Z

I agree with both statements, aggregating up front seems like throwing away information unnecessarily and shutting off options for more elaborate choices. Aggregating them later if needed seems better.

With regards to the request and response, would there be a possibility to send a unique identifier instead of the full request if needed? That way we could still have a full audit to link the reward to an earlier request but not have to send the full input multiple times. In other places I also believe that using an identifier instead of the full payload is useful.

ukclivecox · 2019-08-20T07:48:58Z

Related to new spec in https://github.com/SeldonIO/mlgraph

seldondev · 2020-04-17T09:15:54Z

Issues go stale after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close.
/lifecycle stale

ukclivecox · 2022-01-09T10:38:51Z

Iter8 integration allows for more flexible usage now. We will work further on this for v2.

ukclivecox added External API Reinforcement Learning area/routing labels Jan 27, 2019

ukclivecox added this to the 0.3.x milestone Jan 27, 2019

ukclivecox modified the milestones: 0.3.x, 2.0.x Aug 23, 2019

seldondev added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 17, 2020

axsaucedo changed the title ~~Make feedback more flexible and light~~ OSS-2: Make feedback more flexible and light Apr 26, 2021

axsaucedo changed the title ~~OSS-2: Make feedback more flexible and light~~ Make feedback more flexible and light Apr 28, 2021

ukclivecox closed this as completed Jan 9, 2022

agrski pushed a commit that referenced this issue Dec 2, 2022

v0.1.0 change log (#337)

372f7f7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make feedback more flexible and light #337

Make feedback more flexible and light #337

jklaise commented Dec 10, 2018

janvdvegt commented Dec 17, 2018

ukclivecox commented Aug 20, 2019

seldondev commented Apr 17, 2020

ukclivecox commented Jan 9, 2022

Make feedback more flexible and light #337

Make feedback more flexible and light #337

Comments

jklaise commented Dec 10, 2018

janvdvegt commented Dec 17, 2018

ukclivecox commented Aug 20, 2019

seldondev commented Apr 17, 2020

ukclivecox commented Jan 9, 2022