Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guidance on Using Spider2 for Local Text-to-SQL Benchmarking #28

Open
shafqatjamil opened this issue Nov 29, 2024 · 3 comments
Open

Guidance on Using Spider2 for Local Text-to-SQL Benchmarking #28

shafqatjamil opened this issue Nov 29, 2024 · 3 comments

Comments

@shafqatjamil
Copy link

Hi,

I am new to Spider2 and looking for guidance on how to use it effectively for benchmarking our existing Text-to-SQL model. While I have successfully set up Spider2 locally, I am unsure about the next steps to proceed with benchmarking.

Here is what I have done so far:

Followed the steps mentioned in the Spider2 Lite documentation.
Copied the SQLite files and executed evaluate.py.
However, I am not sure how to proceed with running Spider2 for evaluating my own Text-to-SQL model. Could you please guide me on:

The steps required to configure Spider2 to evaluate a custom Text-to-SQL model.
How to integrate my model into the Spider2 benchmarking pipeline.
Any resources, tutorials, or documentation that would help me better understand and use Spider2 for this purpose.
Best practices for benchmarking Text-to-SQL models using Spider2.
I would greatly appreciate any pointers or advice to get started.

Thank you in advance for your time and help!

@antonio-veezoo
Copy link

+1

@lfy79001
Copy link
Collaborator

lfy79001 commented Dec 2, 2024

Hi,

For Spider2-Lite, we have reproduced three commonly used baselines: Codes, DAILSQL, and DINSQL. You can run the code by visiting the following path: [Spider2-Lite Baselines](https://github.com/xlang-ai/Spider2/tree/main/spider2-lite/baselines). If you wish to run your own Text-to-SQL model, please refer to the data processing methods used in these baselines.

For Spider2, you can refer to the [Spider-Agent](https://github.com/xlang-ai/Spider2/tree/main/methods/spider-agent) implementation and build your own method based on it.

If you encounter any bugs or issues, feel free to contact us promptly.

Thank you!

@shafqatjamil
Copy link
Author

shafqatjamil commented Dec 4, 2024

Thank you for your prompt response, really appreciate that. I have installed dependencies in dinsql and tried to runrun.sh script. I am getting token limit exceeded error. Its more of a gpt issue and I am trying to process prompt in chunks now. I have few more questions.

  1. Is there a minimum dataset/sqls requirement for evaluation?
  2. Do we need to provide preprocessed json files or these will be created from sql dump file, if we somehow provide sql dump file of our own database.

I am new to this fields therefor you might find my questions silly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants