Benchmarking #10

gperdrizet · 2024-07-21T12:51:58Z

Closed and re-opened this pull request to make sure that notebooks were up to date before merging.

Fixed some minor benchmarking bugs, finished full run of benchmarks. Also, started work on cleaning up and finishing benchmarking analysis notebooks. Finished notebook demonstrating Kullback-Leibler divergence for perplexity ratio scores. Merging to avoid excessive branch divergence.

…ng runs was not working correctly without it.

… on branch.

…t if it's empty because all runs are complete.

…xperiment which is already complete.

… the LLM(s) are loaded after getting strange results from model loading benchmark.

…ass during benchmarking runs.

…out of plotting functions. Added some other quality of life improvements.

…ing data, added master outline of notebooks.

gperdrizet added 18 commits July 11, 2024 09:41

Added hf_model_string to results of model loading benchmark, restarti…

6109105

…ng runs was not working correctly without it.

Merge branch 'main' into benchmarking to get updated requierments.txt…

d048a9a

… on branch.

Added fence to avoid asking for first element of run dictionaries lis…

3b53197

…t if it's empty because all runs are complete.

Added note about HuggingFace login for gated models.

c5897b3

Finished dealing with the edge case where we are asked to resume an e…

e115449

…xperiment which is already complete.

Moved CPU core count assignment to batch runner function, i.e. before…

9fa9b14

… the LLM(s) are loaded after getting strange results from model loading benchmark.

Fixed bug where CPU thread count was not properly being set in LLM cl…

188384a

…ass during benchmarking runs.

Updated plots with new data, refactored factor level exclusion logic …

983f0da

…out of plotting functions. Added some other quality of life improvements.

Started work on cleaning up and updating notebooks with new benchmark…

c9b8635

…ing data, added master outline of notebooks.

Added KL divergence example iamge.

0109f62

Cropped KL divergence figure.

481d14e

Set up Kullback-Leibler divergence for perplexity ratio scores.

ec389b4

Re-ran benchmarking notebooks with new data.

cfecba7

Finalized perplexity ratio benchmarks.

bc6042e

Added note about next steps.

03274cc

Updated>

fb502db

Re-ran with new data.

af90d0a

Merge branch 'main' into benchmarking

1ad776a

gperdrizet merged commit 9283967 into main Jul 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarking #10

Benchmarking #10

gperdrizet commented Jul 21, 2024

Benchmarking #10

Benchmarking #10

Conversation

gperdrizet commented Jul 21, 2024