new simulation (and visualization) for finding funniest caption #44

cwagaman · 2022-06-14T18:42:31Z

This new simulation (and its corresponding visualization) attempts to show how quickly a "best caption" rises to the top of the rankings. We perform two visualizations.

The graph titled "# Captions within 95% CI of Current Funniest" provides a visualization for how soon a caption (not necessarily the true funniest caption) can plausibly be identified as the funniest. First, the average user-provided rating is computed for each caption. Then, a 95% CI is computed for each of these average user-provided ratings (basically using the central limit theorem). The corresponding graph displays the number of captions with a 95% CI intersecting the 95% CI around the caption with the highest average user-provided rating.
The graph titled "# Captions with Simulated Rating Higher than True Funniest" provides a visualization for how quickly the funniest caption can be correctly identified. the following. Recall that we have access to the ground truth for which caption is funniest. This graph displays how many captions, after a given number of queries, have recieved an average user-provided rating that is better than the average user-provided rating received by the true funniest caption.

Each visualization is performed for three different learning strategies.

"Random" randomly selects captions for users to rate.
"Active" adaptively chooses captions for users to rate according to the upper confidence bound strategy described in https://arxiv.org/abs/1312.7308.
"lil_KLUCB" adaptively chooses captions for users to rate according to the upper confidence bound strategy described in https://arxiv.org/abs/1709.03570.

The line on each graph is a plot of the mean, taken over 10 samples. The shaded region around each line is the standard deviation.

cwagaman added 2 commits June 14, 2022 14:38

new simulation (and visualization) for finding funniest caption

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

2c93b05

IQR-based CIs for graphs

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23
Expired

Verified
Learn about vigilant mode

76ed443

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

new simulation (and visualization) for finding funniest caption #44

new simulation (and visualization) for finding funniest caption #44

cwagaman commented Jun 14, 2022

Uh oh!

new simulation (and visualization) for finding funniest caption #44

Are you sure you want to change the base?

new simulation (and visualization) for finding funniest caption #44

Conversation

cwagaman commented Jun 14, 2022

Uh oh!