Interleaving pos/neg prompts to match what the repe code expects #59

chanind · 2024-01-11T20:09:32Z

It appears that the original Repe reading vector code expects every positive example to be paired with its corresponding negative example next to it in a big list passed to the find_directions() method. This isn't documented anywhere, but means we can't simply pass prompts to the find_directions() method unimpeded. Also, the Repe PCA code doesn't properly reflect each difference vector across the origin, so it's important to have a balance of <neg, pos> and <pos, neg> pairs passed to the find_directions() method.

This PR addresses these issues by adding a multi_answer_method param to the RepeReadingControl algorithm. By default, this will select just the first incorrect answer to pair with the correct answer, but can also be set to random_incorrect to pick at random, or repeat_correct to instead duplicate the correct answer to pair with every incorrect answer. This PR also alternates the order of the <neg, pos> for each example to try to ensure a good balance of directions for PCA.

… repe code expects

chanind · 2024-01-11T21:06:56Z

Merging this now for the sake of speed to test it on colab. @dtch1997 Please feel free to code-review this still, any changes can be addressed in a follow-up PR

chanind requested a review from dtch1997 January 11, 2024 20:09

interleaving positive and negative prompts to match what the original…

b77ef3e

… repe code expects

chanind force-pushed the interleaved-pos-neg-repe branch from 55a8432 to b77ef3e Compare January 11, 2024 20:55

chanind merged commit 8eb3b75 into main Jan 11, 2024
2 checks passed

chanind deleted the interleaved-pos-neg-repe branch January 11, 2024 21:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interleaving pos/neg prompts to match what the repe code expects #59

Interleaving pos/neg prompts to match what the repe code expects #59

chanind commented Jan 11, 2024

chanind commented Jan 11, 2024

Interleaving pos/neg prompts to match what the repe code expects #59

Interleaving pos/neg prompts to match what the repe code expects #59

Conversation

chanind commented Jan 11, 2024

chanind commented Jan 11, 2024