Model Cache Management Features #239

wanliAlex · 2022-12-16T06:31:04Z

What kind of change does this PR introduce? (Bug fix, feature, docs update, ...)
features
What is the current behavior? (You can also link to an open issue here)
users can not view the loaded models and eject a model from the client
What is the new behavior (if this is a feature change)?
users can now view loaded models, eject models, view cuda information from the client
Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)
no
Have unit tests been run against this PR? (Has there also been any additional testing?)
not yet
Related Python client changes (link commit/PR here)
will link later
Related documentation changes (link commit/PR here)
will link later
Other information:
this PR is related to the issue
Please check if the PR fulfills these requirements

The commit message follows our guidelines
Tests for the changes have been added (for bug fixes/features)
Docs have been added / updated (for bug fixes / features)

pandu-k · 2022-12-19T01:49:43Z

Does this PR introduce a breaking change? (What changes might users need to make in their application due to this PR?)

FYI
Adding new API endpoints is not a breaking change. A breaking change is updating an existing endpoint such that it is no longer backwards compatible (so existing users have to update their code to use the new version).

An example of a breaking change is when the add_documents batch_size parameter was replaced by server_batch_size and client_batch_size parameters. Users wanting to use the new py-marqo version would have to update their add_documents calls

wanliAlex · 2022-12-19T01:52:27Z

Thanks for the comment.

pandu-k

Thanks for this PR! Left actionable feedback in the comments.

Further to dos:

Action feedback
Create a series of unit tests. One that I propose is to load and unload a model ~10 times, to see if the cache is actually being unloaded (else the computer runs out of memory). Please include testing for unusual and edge cases.
Create corresponding Py-Marqo methods and a few unit tests
Given this is a new endpoint, create a few API tests. These don't have to be as exhaustive as the unit tests, but should test every endpoint.
- An CUDA API test is proposed: load 2 CUDA models. Unload one. Test to ensure the remaining model is still cached. Test to ensure the unloaded model is indeed unloaded. Create API tests here. CUDA test classes are marked by this decorator. This ensures that the API test only runs on the CUDA test environment.

src/marqo/tensor_search/api.py

src/marqo/errors.py

src/marqo/s2_inference/errors.py

src/marqo/s2_inference/s2_inference.py

src/marqo/tensor_search/tensor_search.py

pandu-k

There are still a few pending questions. Tests are also to be made.

I have updated my review comment

Resolved most conversations.

src/marqo/s2_inference/s2_inference.py

pandu-k · 2023-01-04T00:25:10Z

src/marqo/tensor_search/tensor_search.py

+
+    else:
+        raise errors.HardwareCompatabilityError(message=str(
+            "ERROR: cuda is not supported in your machine!!"


Can you make the error message stylistically similar to this existing one:

marqo/src/marqo/tensor_search/web/api_validation.py

Line 67 in 590965d

raise HardwareCompatabilityError(message="Requested device is not available to this Marqo instance."

add APIs in marqo

da1fbbe

wanliAlex changed the title ~~Add Model Cache Management Features~~ [draft] Add Model Cache Management Features Dec 16, 2022

test new model cache key

03ca4dd

pandu-k requested changes Dec 19, 2022

View reviewed changes

wanliAlex added 2 commits December 20, 2022 22:07

cleaning

e7e7b09

Add todo

8ad198b

pandu-k reviewed Dec 20, 2022

View reviewed changes

src/marqo/s2_inference/s2_inference.py Show resolved Hide resolved

wanliAlex added 21 commits December 21, 2022 12:08

add multi-gpu support

d8a4fdd

add multi-gpu support

db30bcb

space adding

f0a801b

adding cpu usage, RAM usage api

e0f14a2

adding cpu usage, RAM usage api

186180a

adding cpu usage, RAM usage api

2c77fa7

adding cpu usage, RAM usage api

d88b590

revert back model cache key

a95d030

revert back model cache key

ba97a44

add test_eject_model test

2457920

add test_eject_model test

856b5f9

add test_eject_model test

3df97f0

add test_eject_model test

e3d3bec

add test_eject_model test

b6122aa

add test_eject_model test

1132cc5

add test_eject_model test

fd69a42

add test_eject_model test

fe2f5fa

add test_eject_model test

4b515c2

add test_eject_model test

7aa1a06

add test_eject_model test

53a6a13

add test_eject_model test

0d5140b

wanliAlex added 4 commits December 30, 2022 09:07

reduce a model for testing stability

9aa15a9

update

bb530f2

update

eab2a75

update

a4f0a42

wanliAlex requested a review from pandu-k January 2, 2023 23:07

pandu-k temporarily deployed to marqo-test-suite January 4, 2023 00:32 — with GitHub Actions Inactive

pandu-k temporarily deployed to marqo-test-suite January 4, 2023 00:33 — with GitHub Actions Inactive

wanliAlex added 9 commits January 5, 2023 16:45

add test for generic model

670fcee

add test for generic model

2f8c6f0

add test for generic model

4026435

add test for generic model

45c5891

add test for generic model

a782e52

add test for generic model

621df43

add test for generic model

0b10d26

revision

0cf173a

revision

ba6fb8d

wanliAlex temporarily deployed to marqo-test-suite January 5, 2023 06:23 — with GitHub Actions Inactive

wanliAlex temporarily deployed to marqo-test-suite January 5, 2023 06:24 — with GitHub Actions Inactive

pandu-k approved these changes Jan 6, 2023

View reviewed changes

pandu-k merged commit dd37904 into mainline Jan 6, 2023

pandu-k deleted the model-cache-management branch January 6, 2023 00:56

pandu-k mentioned this pull request Jan 11, 2023

[ENHANCEMENT] Model Cache Management #204

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Cache Management Features #239

Model Cache Management Features #239

wanliAlex commented Dec 16, 2022 •

edited

Loading

pandu-k commented Dec 19, 2022 •

edited

Loading

wanliAlex commented Dec 19, 2022

pandu-k left a comment •

edited

Loading

pandu-k left a comment

pandu-k Jan 4, 2023

Model Cache Management Features #239

Model Cache Management Features #239

Conversation

wanliAlex commented Dec 16, 2022 • edited Loading

pandu-k commented Dec 19, 2022 • edited Loading

wanliAlex commented Dec 19, 2022

pandu-k left a comment • edited Loading

Choose a reason for hiding this comment

pandu-k left a comment

Choose a reason for hiding this comment

pandu-k Jan 4, 2023

Choose a reason for hiding this comment

wanliAlex commented Dec 16, 2022 •

edited

Loading

pandu-k commented Dec 19, 2022 •

edited

Loading

pandu-k left a comment •

edited

Loading