Question about the threshold #3

initzhang · 2024-12-02T07:53:33Z

Hello team, thanks for this great work!

I am confused about the threshold value. As shown in the Equation 3 of the v4 version of paper, a negative value of $\Delta_{conf}^t$ signifies a reduced confidence in achieving a correct answer after step $t$. Therefore, an intuitive value for the process labeling threshold $\theta$ should be > 0, which implies that a good step makes the prediction closer to the ground truth. However, as discussed in Section 4.2.2, the threshold is set to $-0.5$ and experimented from $-0.5$ to $-0.9$ in the paper.

Could you explain why only negative $\theta$ is experimented in Table 5? Did you observe that most steps have negative $\Delta_{conf}^t$ during process labeling? Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the threshold #3

Question about the threshold #3

initzhang commented Dec 2, 2024

Question about the threshold #3

Question about the threshold #3

Comments

initzhang commented Dec 2, 2024