The way to calculate the log_probs #18

laurenlong · 2024-06-08T15:07:33Z

Hi,
As the output of the model in each token's position represents the possibilities of next token, should the calculation of log_probs be misaligned.
I mean "diff_logits[range(diff_logits.shape[0]-1), continue_ids[1:]].sum().item()"
instead of "log_probs = diff_logits[range(diff_logits.shape[0]), continue_ids].sum().item()".

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The way to calculate the log_probs #18

The way to calculate the log_probs #18

laurenlong commented Jun 8, 2024

The way to calculate the log_probs #18

The way to calculate the log_probs #18

Comments

laurenlong commented Jun 8, 2024