Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add spider example with 150 samples #165

Closed
wants to merge 12 commits into from
Closed

Conversation

arjunattam
Copy link
Member

No description provided.

Copy link

changeset-bot bot commented Apr 19, 2024

⚠️ No Changeset found

Latest commit: ca1aff9

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

Copy link
Contributor

github-actions bot commented Apr 19, 2024

Empirical Run Summary

Run #9880: gpt-3.5-turbo Run #58c6: gpt-4-turbo-preview
Outputs 100% 100%
Scores
json-syntax 100% 100%
Avg latency 690ms 1147ms

Total dataset samples: 2

@arjunattam arjunattam marked this pull request as draft April 20, 2024 04:33
## Usage

```
npx @empiricalrun/cli run
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

merge with spider example. known gaps

  • iteration speed will get slower with the full dataset: do we want to fix that? (e.g. add a way to sample the test dataset through rc.json)

"type": "model",
"name": "prompt-learner-gpt3.5",
"model": "gpt-3.5-turbo",
"prompt": "You are a helpful AI assistant.\nYou are helping a user with a SQLGeneration task.\nThe user gives you the following task description.\nPlease generate SQL query for the given texts to run on sqlite. I will use your output directly in sqlite so only give me the final executable SQL. Do not wrap it in backticks or quotes.\nHere are a few examples to help you understand the task better.\nschema: CREATE TABLE singer (\n singer_id NUMERIC PRIMARY KEY,\n name TEXT,\n country TEXT,\n song_name TEXT,\n song_release_year TEXT,\n age NUMERIC,\n is_male TIMESTAMP\n);\ntext: What is the average, minimum, and maximum age for all French singers?\nSQL: SELECT AVG(age), MIN(age), MAX(age) FROM singer WHERE country='France';\nThink step by step.\nGiven the text, you have to now generate a SQL query.Only output the SQL and nothing else.\nschema: {{schema}}\ntext: {{question}}\nSQL:",
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

move these to a script (blocked on how we run py-script; need prompt generation to be out of the generation loop)

os.remove(sqlite_file_name)
con = sqlite3.connect(sqlite_file_name)
cur = con.cursor()
schema_script = load_json("/Users/arjun/notebooks/schema.json")[db_name]
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • add preparation steps (from notebooks) file to this repo
  • fix paths here

Base automatically changed from sum-latency to main April 22, 2024 10:59
@arjunattam
Copy link
Member Author

closing this since we've covered in #230

@arjunattam arjunattam closed this May 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant