-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[PROPOSAL] Merge with opensearch-py? #372
Comments
I've reviewed the repository code, and I feel merging opensearch-py-ml into opensearch-py is feasible. I estimate it would take approximately 3 weeks to 1 month to merge all the code and tests, ensuring everything runs smoothly. |
I think it might be worthwhile to conduct a high-level review of the ML API before merging it in. There are consistency and orthogonality issues that caused me a lot of confusion. Here would be some things to discuss:
my personal preference is for CRUD without repeating the object name, eg |
Merging opensearch-py-ml to opensearch-py will include below steps:
Example PRs from opensearch-dsl-py to opensearch-py merging :
|
|
Thanks for the input @saimedhi. The reason to keep other heavy libraries in py-ml is to maintain a lightweight opensearch-py package, since most of the py-ml libraries are for data analysis. Merging all these libraries into opensearch-py would make it significantly larger and potentially less efficient for users who don't need these specific functionalities. |
How much larger would it make it and how much less efficient? |
I will need some time to analyze the exact size implications and weigh all the pros and cons of merging all libraries. I believe I can work parallel with this and get started on merging the ML APIs first, since everyone is on board with that idea. This way, we can make progress while I'm conducting a thorough evaluation of the full merger's impact. |
CC: @minalsha |
I would suggest keep the scope clear :
|
Is your feature request related to a problem?
Users are confused with the many OpenSearch clients. Is opensearch-py-ml one too many?
The dsl client has a high-level and a low-level interface. For AWS users there’s also boto3 (see https://docs.aws.amazon.com/opensearch-service/latest/developerguide/serverless-sdk.html#serverless-sdk-python), used to create collections. We have opensearch-dsl-py client that has been deprecated since 2.1 and will be archived.
What solution would you like?
Merge opensearch-py-ml into opensearch-py.
What alternatives have you considered?
Leave things as is.
The text was updated successfully, but these errors were encountered: