Skip to content

cychomatica/Inference-Scaling

Repository files navigation

ComputeScaling-Replication

replication of part of the huggingface blog https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute

Since the details of grading implementation in the blog is not enough to reproduce the results in the blog, i adapted the grading code in the https://github.com/openai/prm800k

Replication of math-psa (https://huggingface.co/openreasoner/Math-psa/tree/main)

Using "last" as the aggregation method: replication results of math-psa

Using "mean" as the aggregation method: replication results of math-psa

Using "min" as the aggregation method: replication results of math-psa

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published