Temperature application order non standard? #4091

electronjoe · 2023-11-15T17:55:50Z

I was reading a really interesting piece on Reddit regarding samplers, and a particularly interesting exchange came up which appears to have highlighted a discrepancy between the order llama.cpp applies temperature (to probabilities) while research literature / other implementations apply temperature earlier in the chain (to logits).

I thought it would be unfortunate for this discussion to die without visibility and discussion, so I've tossed up a GH issue.

I would normally look for historical / closed issues that are related, but I'm on my phone and that's rather complex.

The interesting Reddit discussion:

https://www.reddit.com/r/LocalLLaMA/s/WonSDiMCoD

MaggotHATE · 2023-11-16T11:09:09Z

There was a discussion of this topic in min_p PR, but after it was already merged. I remember playing with temp being first after that, but a customizable samplers order seems like a better idea anyway.

electronjoe · 2023-11-20T14:03:43Z

There was a discussion of this topic in min_p PR, but after it was already merged. I remember playing with temp being first after that, but a customizable samplers order seems like a better idea anyway.

Thanks for the pointer! Closing this issue as it's on the radar of those that matter:)

electronjoe added the bug-unconfirmed label Nov 15, 2023

electronjoe closed this as completed Nov 20, 2023

ZoomRmc mentioned this issue Apr 24, 2024

Min P sampler implementation [alternative to Top P/Top K] #3841

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temperature application order non standard? #4091

Temperature application order non standard? #4091

electronjoe commented Nov 15, 2023

MaggotHATE commented Nov 16, 2023

electronjoe commented Nov 20, 2023

Temperature application order non standard? #4091

Temperature application order non standard? #4091

Comments

electronjoe commented Nov 15, 2023

MaggotHATE commented Nov 16, 2023

electronjoe commented Nov 20, 2023