Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

updating to match current repo #1

Merged
merged 19 commits into from
Nov 20, 2023

Conversation

smadarapu3
Copy link
Owner

No description provided.

LordDarkula and others added 19 commits November 2, 2023 20:37
The environment created in the setup instructions in the documentation
calls the environment `test_eva_db`
Update the DockerFile in order to resolve dependency issues along with
fixing the invalid CMD that was previously passed in.
This PR fixes an issue in CREATE INDEX IF NOT EXISTS command wherein if
'IF NOT EXISTS' is passed, we had an unreferenced variable issue. Added
Unit Tests to check the correctness of both the cases.

Also reverted the index changes while merging dataframes after vector
scan, as it's failing for some cases where indexes can be undefined.
Support semi-colons in string literals for queries of the form:
```
"""SELECT ChatGPT("Here's a; question", "This is the context") FROM TAIPAI;"""
```

Also support string escape to run ChatGPT queries more easily:
```
"""SELECT ChatGPT('Here\\'s a question', 'This is the context') FROM TAIPAI;"""
```
This PR introduces a generic interface to support 3rd party apps in
EVADB. As an example. the template for integrating slack has been added.
In a subsequent PR the integration with slack will be completed.

---------

Co-authored-by: Gaurav Tarlok Kakkar <gaurav21776@gmail.com>
Co-authored-by: Joy Arulraj <jjamesaj@gmail.com>
Co-authored-by: Joy Arulraj <arulraj@gatech.edu>
Co-authored-by: Kaushik Ravichandran <kravicha3@ada-01.cc.gatech.edu>
Added separate error handling for ModuleNotFoundError and
FileNotFoundError
	modified:   evadb/utils/generic_utils.py
Issue - [721](#721)

Currently, we abort the entire process when the load executor encounters
a corrupted file.
…d pdf functionality. (#1343)

Issue #1067 about not being able to load pdf files, was verified to be
working with evadb documentation pdf and a new page for loading pdf is
added to the documentation.
<img width="1310" alt="Screenshot 2023-11-07 at 1 33 01 AM"
src="https://github.com/georgia-tech-db/evadb/assets/32676813/af2fa40b-c8c1-4f3d-b93f-98d0bf278a5b">

Co-authored-by: Lohith K S <lohithks@Lohiths-MacBook-Pro-2.local>
In the previous commit, we added the changes for displaying the train
scores and train times for XGBoost. We plan to add similar changes to
Ludwig integration as well.

---------

Co-authored-by: Jineet Desai <jdesai61@gatech.edu>
Co-authored-by: Andy Xu <xzdandy@gmail.com>
- Fix the following queries:

```
SELECT * FROM postgres_data.home_rentals where neighborhood='downtown' and number_of_rooms=2;
```

- Improve the error message: Instead of throwing arbitrary mask error,
now we raise `Unsupported Logical Operator: ...`.
This PR adds support for creating and dropping jobs in evadb based on
this [task](#1248).

1.  Jobs can be created using the create job query:

   
> CREATE JOB {job_name} AS {
>             {job_queries; ...}
>     }
>     START {start_time}
>     END {end_time}
>     EVERY {repeat_period} {repeat_unit}

2. Created jobs can be dropped using:

> DROP JOB {job_name}

3. The scheduled jobs will only be triggered if the job scheduler
process is started explicitly using:

> EvaDBConnection.start_jobs()

4. The job scheduler process can be stopped using:

> EvaDBConnection.stop_jobs()

---------

Co-authored-by: Gaurav Tarlok Kakkar <gaurav21776@gmail.com>
Provide feedback when `Forecasting` UDF is called in the following ways:

- [x] Reporting confidence intervals
- [x] Returning a metric for the forecasting performance.
- [x] Providing suggestions in simple special cases, such as during Flat
predictions.

Eg:
```sql
SELECT HomeForecast();
```

```
SUGGESTION: Predictions are flat. Consider using LIBRARY 'neuralforecast' for more accrate predictions.
```

Partially fixes #1257 and #1243.

---------

Co-authored-by: Andy Xu <xzdandy@gmail.com>
Flaml provides support for Sklearn models like Random Forests, KNN,
Extra Trees Regressor, and Logistic Regression with regularization. We
plan to integrate these ML models into EVADB.
Link for Flaml documentation:
https://microsoft.github.io/FLAML/docs/Use-Cases/Task-Oriented-AutoML

---------

Co-authored-by: Jineet Desai <jdesai61@gatech.edu>
Migrate ChatGPT function to openai v1.0.

The test is skipped in circleCI because we must supply the
`OPENAI_API_KEY`. The test passes on local machine.

- [x] Upgrade ChatGPT function.
- [x] Upgrade Dall-e function.
- [x] Update unit test cases.
- [x] Verify that notebooks work correctly.
Bump Version to v0.3.10+dev

---------

Co-authored-by: Jiashen Cao <caojiashen24@gmail.com>
Co-authored-by: Joy Arulraj <arulraj@gatech.edu>
This PR supports searching Hackernews stories in EVADB:

- We treat Hackernews as a data source
- Algolia search API for hackernews is used for searching
- Get requests are used
- We support searching within: `story`, `comment`, `URL`, `story`,
`poll` or a combination of them
- Syntax:
```
params = {
        "query": "EVADB",
        "tags": "(story,poll)",
    }
query = f"""CREATE DATABASE hackernews_data
    WITH ENGINE = "hackernews",
    PARAMETERS = {params};"""
```

---------

Co-authored-by: Kaushik Ravichandran <kravicha3@ada-01.cc.gatech.edu>
@smadarapu3 smadarapu3 merged commit fc806bd into smadarapu3:staging Nov 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.