Skip to content

Conversation

Ahmad21Omar
Copy link
Contributor

We are pleased to propose the addition of SLR-Bench as a new Community Task, authored in collaboration with @lukashelff

SLR-Bench is a large-scale benchmark for scalable logical reasoning with language models, comprising 19,000 prompts organized into 20 curriculum levels. The tasks progressively increase in relational, arithmetic, and recursive complexity, requiring models to synthesize Prolog rules that classify train compositions.

Link to the Paper: https://arxiv.org/abs/2506.15787
Link to the Dataset: https://huggingface.co/datasets/AIML-TUDA/SLR-Bench

@HuggingFaceDocBuilderDev
Copy link
Collaborator

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@NathanHB NathanHB left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hey ! thanks for the addition it looks great, only have a few nits :)

@Ahmad21Omar
Copy link
Contributor Author

hey ! thanks for the addition it looks great, only have a few nits :)

Hello there! :)

Thank you for your quick feedback and review. I have addressed the mentioned issues in my latest commit.

If you notice any issues with my recent changes or have any additional suggestions for improvement, please feel free to let us know.

Best regards!

@NathanHB
Copy link
Member

looks good ! thanks for the fixes :)

@NathanHB NathanHB merged commit c7a063a into huggingface:main Sep 25, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants