-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* "wip on endpoint listing" * "wip on list_endpoints typing" * "refactoring to endpoint object" * "wip filtering" * "endpoints queried!" * "refactoring" * "wip on cleaning up types" * "type cleanup complete" * "moving files" * "use a dummy embedding model for deletes" * fix some bits from merge, get evals working again with CAII, tests passing * formatting * clean up ruff stuff * use the chat llm for evals * fix mypy for reformatting * "wip on java reconciler" * "reconciler don't do no model; start python work" * "python - updating for summarization model" * "comment out batch embeddings to get it working again" * add handling for no summarization in the files table * finish up ui and python for summarization * make sure to update the time-updated fields on data sources and chat sessions * use no-op models when we don't need real ones for summary functionality * Update release version to dev-testing * use the summarization llm when summarizing summaries --------- Co-authored-by: Elijah Williams <ewilliams@cloudera.com> Co-authored-by: actions-user <actions@github.com>
- Loading branch information
1 parent
2dac585
commit bdfabb7
Showing
53 changed files
with
995 additions
and
338 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
45 changes: 45 additions & 0 deletions
45
backend/src/main/resources/migrations/h2/15_add_summarization_model.down.sql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,45 @@ | ||
/* | ||
* CLOUDERA APPLIED MACHINE LEARNING PROTOTYPE (AMP) | ||
* (C) Cloudera, Inc. 2024 | ||
* All rights reserved. | ||
* | ||
* Applicable Open Source License: Apache 2.0 | ||
* | ||
* NOTE: Cloudera open source products are modular software products | ||
* made up of hundreds of individual components, each of which was | ||
* individually copyrighted. Each Cloudera open source product is a | ||
* collective work under U.S. Copyright Law. Your license to use the | ||
* collective work is as provided in your written agreement with | ||
* Cloudera. Used apart from the collective work, this file is | ||
* licensed for your use pursuant to the open source license | ||
* identified above. | ||
* | ||
* This code is provided to you pursuant a written agreement with | ||
* (i) Cloudera, Inc. or (ii) a third-party authorized to distribute | ||
* this code. If you do not have a written agreement with Cloudera nor | ||
* with an authorized and properly licensed third party, you do not | ||
* have any rights to access nor to use this code. | ||
* | ||
* Absent a written agreement with Cloudera, Inc. (“Cloudera”) to the | ||
* contrary, A) CLOUDERA PROVIDES THIS CODE TO YOU WITHOUT WARRANTIES OF ANY | ||
* KIND; (B) CLOUDERA DISCLAIMS ANY AND ALL EXPRESS AND IMPLIED | ||
* WARRANTIES WITH RESPECT TO THIS CODE, INCLUDING BUT NOT LIMITED TO | ||
* IMPLIED WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY AND | ||
* FITNESS FOR A PARTICULAR PURPOSE; (C) CLOUDERA IS NOT LIABLE TO YOU, | ||
* AND WILL NOT DEFEND, INDEMNIFY, NOR HOLD YOU HARMLESS FOR ANY CLAIMS | ||
* ARISING FROM OR RELATED TO THE CODE; AND (D)WITH RESPECT TO YOUR EXERCISE | ||
* OF ANY RIGHTS GRANTED TO YOU FOR THE CODE, CLOUDERA IS NOT LIABLE FOR ANY | ||
* DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR | ||
* CONSEQUENTIAL DAMAGES INCLUDING, BUT NOT LIMITED TO, DAMAGES | ||
* RELATED TO LOST REVENUE, LOST PROFITS, LOSS OF INCOME, LOSS OF | ||
* BUSINESS ADVANTAGE OR UNAVAILABILITY, OR LOSS OR CORRUPTION OF | ||
* DATA. | ||
*/ | ||
|
||
SET MODE MYSQL; | ||
|
||
BEGIN; | ||
|
||
ALTER TABLE rag_data_source DROP COLUMN summarization_model; | ||
|
||
COMMIT; |
45 changes: 45 additions & 0 deletions
45
backend/src/main/resources/migrations/h2/15_add_summarization_model.up.sql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,45 @@ | ||
/* | ||
* CLOUDERA APPLIED MACHINE LEARNING PROTOTYPE (AMP) | ||
* (C) Cloudera, Inc. 2024 | ||
* All rights reserved. | ||
* | ||
* Applicable Open Source License: Apache 2.0 | ||
* | ||
* NOTE: Cloudera open source products are modular software products | ||
* made up of hundreds of individual components, each of which was | ||
* individually copyrighted. Each Cloudera open source product is a | ||
* collective work under U.S. Copyright Law. Your license to use the | ||
* collective work is as provided in your written agreement with | ||
* Cloudera. Used apart from the collective work, this file is | ||
* licensed for your use pursuant to the open source license | ||
* identified above. | ||
* | ||
* This code is provided to you pursuant a written agreement with | ||
* (i) Cloudera, Inc. or (ii) a third-party authorized to distribute | ||
* this code. If you do not have a written agreement with Cloudera nor | ||
* with an authorized and properly licensed third party, you do not | ||
* have any rights to access nor to use this code. | ||
* | ||
* Absent a written agreement with Cloudera, Inc. (“Cloudera”) to the | ||
* contrary, A) CLOUDERA PROVIDES THIS CODE TO YOU WITHOUT WARRANTIES OF ANY | ||
* KIND; (B) CLOUDERA DISCLAIMS ANY AND ALL EXPRESS AND IMPLIED | ||
* WARRANTIES WITH RESPECT TO THIS CODE, INCLUDING BUT NOT LIMITED TO | ||
* IMPLIED WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY AND | ||
* FITNESS FOR A PARTICULAR PURPOSE; (C) CLOUDERA IS NOT LIABLE TO YOU, | ||
* AND WILL NOT DEFEND, INDEMNIFY, NOR HOLD YOU HARMLESS FOR ANY CLAIMS | ||
* ARISING FROM OR RELATED TO THE CODE; AND (D)WITH RESPECT TO YOUR EXERCISE | ||
* OF ANY RIGHTS GRANTED TO YOU FOR THE CODE, CLOUDERA IS NOT LIABLE FOR ANY | ||
* DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR | ||
* CONSEQUENTIAL DAMAGES INCLUDING, BUT NOT LIMITED TO, DAMAGES | ||
* RELATED TO LOST REVENUE, LOST PROFITS, LOSS OF INCOME, LOSS OF | ||
* BUSINESS ADVANTAGE OR UNAVAILABILITY, OR LOSS OR CORRUPTION OF | ||
* DATA. | ||
*/ | ||
|
||
SET MODE MYSQL; | ||
|
||
BEGIN; | ||
|
||
ALTER TABLE rag_data_source ADD COLUMN summarization_model varchar(255); | ||
|
||
COMMIT; |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
45 changes: 45 additions & 0 deletions
45
backend/src/main/resources/migrations/postgres/15_add_summarization_model.down.sql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,45 @@ | ||
/* | ||
* CLOUDERA APPLIED MACHINE LEARNING PROTOTYPE (AMP) | ||
* (C) Cloudera, Inc. 2024 | ||
* All rights reserved. | ||
* | ||
* Applicable Open Source License: Apache 2.0 | ||
* | ||
* NOTE: Cloudera open source products are modular software products | ||
* made up of hundreds of individual components, each of which was | ||
* individually copyrighted. Each Cloudera open source product is a | ||
* collective work under U.S. Copyright Law. Your license to use the | ||
* collective work is as provided in your written agreement with | ||
* Cloudera. Used apart from the collective work, this file is | ||
* licensed for your use pursuant to the open source license | ||
* identified above. | ||
* | ||
* This code is provided to you pursuant a written agreement with | ||
* (i) Cloudera, Inc. or (ii) a third-party authorized to distribute | ||
* this code. If you do not have a written agreement with Cloudera nor | ||
* with an authorized and properly licensed third party, you do not | ||
* have any rights to access nor to use this code. | ||
* | ||
* Absent a written agreement with Cloudera, Inc. (“Cloudera”) to the | ||
* contrary, A) CLOUDERA PROVIDES THIS CODE TO YOU WITHOUT WARRANTIES OF ANY | ||
* KIND; (B) CLOUDERA DISCLAIMS ANY AND ALL EXPRESS AND IMPLIED | ||
* WARRANTIES WITH RESPECT TO THIS CODE, INCLUDING BUT NOT LIMITED TO | ||
* IMPLIED WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY AND | ||
* FITNESS FOR A PARTICULAR PURPOSE; (C) CLOUDERA IS NOT LIABLE TO YOU, | ||
* AND WILL NOT DEFEND, INDEMNIFY, NOR HOLD YOU HARMLESS FOR ANY CLAIMS | ||
* ARISING FROM OR RELATED TO THE CODE; AND (D)WITH RESPECT TO YOUR EXERCISE | ||
* OF ANY RIGHTS GRANTED TO YOU FOR THE CODE, CLOUDERA IS NOT LIABLE FOR ANY | ||
* DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR | ||
* CONSEQUENTIAL DAMAGES INCLUDING, BUT NOT LIMITED TO, DAMAGES | ||
* RELATED TO LOST REVENUE, LOST PROFITS, LOSS OF INCOME, LOSS OF | ||
* BUSINESS ADVANTAGE OR UNAVAILABILITY, OR LOSS OR CORRUPTION OF | ||
* DATA. | ||
*/ | ||
|
||
SET MODE MYSQL; | ||
|
||
BEGIN; | ||
|
||
ALTER TABLE rag_data_source DROP COLUMN embedding_model; | ||
|
||
COMMIT; |
45 changes: 45 additions & 0 deletions
45
backend/src/main/resources/migrations/postgres/15_add_summarization_model.up.sql
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,45 @@ | ||
/* | ||
* CLOUDERA APPLIED MACHINE LEARNING PROTOTYPE (AMP) | ||
* (C) Cloudera, Inc. 2024 | ||
* All rights reserved. | ||
* | ||
* Applicable Open Source License: Apache 2.0 | ||
* | ||
* NOTE: Cloudera open source products are modular software products | ||
* made up of hundreds of individual components, each of which was | ||
* individually copyrighted. Each Cloudera open source product is a | ||
* collective work under U.S. Copyright Law. Your license to use the | ||
* collective work is as provided in your written agreement with | ||
* Cloudera. Used apart from the collective work, this file is | ||
* licensed for your use pursuant to the open source license | ||
* identified above. | ||
* | ||
* This code is provided to you pursuant a written agreement with | ||
* (i) Cloudera, Inc. or (ii) a third-party authorized to distribute | ||
* this code. If you do not have a written agreement with Cloudera nor | ||
* with an authorized and properly licensed third party, you do not | ||
* have any rights to access nor to use this code. | ||
* | ||
* Absent a written agreement with Cloudera, Inc. (“Cloudera”) to the | ||
* contrary, A) CLOUDERA PROVIDES THIS CODE TO YOU WITHOUT WARRANTIES OF ANY | ||
* KIND; (B) CLOUDERA DISCLAIMS ANY AND ALL EXPRESS AND IMPLIED | ||
* WARRANTIES WITH RESPECT TO THIS CODE, INCLUDING BUT NOT LIMITED TO | ||
* IMPLIED WARRANTIES OF TITLE, NON-INFRINGEMENT, MERCHANTABILITY AND | ||
* FITNESS FOR A PARTICULAR PURPOSE; (C) CLOUDERA IS NOT LIABLE TO YOU, | ||
* AND WILL NOT DEFEND, INDEMNIFY, NOR HOLD YOU HARMLESS FOR ANY CLAIMS | ||
* ARISING FROM OR RELATED TO THE CODE; AND (D)WITH RESPECT TO YOUR EXERCISE | ||
* OF ANY RIGHTS GRANTED TO YOU FOR THE CODE, CLOUDERA IS NOT LIABLE FOR ANY | ||
* DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, PUNITIVE OR | ||
* CONSEQUENTIAL DAMAGES INCLUDING, BUT NOT LIMITED TO, DAMAGES | ||
* RELATED TO LOST REVENUE, LOST PROFITS, LOSS OF INCOME, LOSS OF | ||
* BUSINESS ADVANTAGE OR UNAVAILABILITY, OR LOSS OR CORRUPTION OF | ||
* DATA. | ||
*/ | ||
|
||
SET MODE MYSQL; | ||
|
||
BEGIN; | ||
|
||
ALTER TABLE rag_data_source ADD COLUMN embedding_model varchar(255); | ||
|
||
COMMIT; |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.