Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the reproduction. #4

Open
jumptoliujj opened this issue Nov 18, 2023 · 6 comments
Open

About the reproduction. #4

jumptoliujj opened this issue Nov 18, 2023 · 6 comments

Comments

@jumptoliujj
Copy link

We run experiments on PrOntoQA and FOLIO, the results of accuracy are only about 51%~53%. We run logic_program.py, self_refinement.py, and then logic_inference.py, evaluation.py. Any wrong with my steps? Correct me.

@RutaTang
Copy link

Hi @jumptoliujj,

Did you figure it out? I have also faced this issue. I am not sure whether and where I was doing wrong.

@jumptoliujj
Copy link
Author

Hi @jumptoliujj,

Did you figure it out? I have also faced this issue. I am not sure whether and where I was doing wrong.

Just use text-davinci-003 instead of gpt-3.5-turbo, and you will get the similar results in the paper.

@RutaTang
Copy link

RutaTang commented Feb 21, 2024

Thank you!

@zhuang-li
Copy link

I have the same issue.. The logical program only obtained 51.4 on ProntoQA with GPT-3.5-turbo, which is far from the reported result (61) in the paper.

@abhinandan12345678
Copy link

I tried to run the models as per the commands in readme but it is giving "Error in generating example".
In the requirements.txt the changes i made are:
changed certifi @ file:///croot/certifi_1671487769961/work/certifi to certifi==2022.12.7, because the url was giving error.
removed sklearn as it got deprecated.
Can anyone please help me regarding this issue.

@wernerolaf
Copy link

wernerolaf commented Sep 8, 2024

I have problems with Pyke as it is not always deterministic, also there are problems with cache

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants