Updated to allow the selection of GPU for embedding where there is mo… #1734

sgresham · 2024-03-15T04:47:49Z

Updated to allow the selection of GPU for embedding where there is more than one available. Defaults to cuda[0] or cpu if cuda is not available. Commented reference in settings.yaml under embedding.

…re than one available. Defaults to cuda[0] or cpu if cuda is not available. Commented reference in settings.yaml under embedding.

imartinez

I'm not so sure about this one. May be too specific to Nvidia setups.

imartinez · 2024-03-15T08:43:12Z

private_gpt/components/embedding/embedding_component.py

@@ -7,7 +7,7 @@
 from private_gpt.settings.settings import Settings

 logger = logging.getLogger(__name__)
-
+import torch


I'd move this to the try block within "huggingface" case. There is no "torch" general dependency declared in pyproject.toml, so this could break the whole execution for people not using huggingface. Actually, we may need to add torch to embeddings-huggingface = ["llama-index-embeddings-huggingface"] as

# Optional Huggingface related dependency torch = {version = "^2.2.1", optional = true} embeddings-huggingface = ["torch", "llama-index-embeddings-huggingface"]

in pyproject.toml.

I think huggingface package from llamaindex already depends on torch, but given we are now importing it explicitly we should also depende on it.

imartinez · 2024-03-15T08:46:04Z

private_gpt/components/embedding/embedding_component.py

+                        device = torch.device("cuda:0")
+                else:
+                    # If CUDA is not available, use CPU
+                    device = torch.device("cpu")


What happens with laptops using a GPU that is not Nvidia based? For example Mac book running Metal GPU? Will this make embedding slower forcing them to go to CPU?

This logic looks similar this: llama_index.core.utils.infer_torch_device which handles Metal (mps).

imartinez · 2024-03-15T08:46:21Z

settings.yaml

@@ -54,6 +54,7 @@ embedding:
  # Should be matching the value above in most cases
  mode: huggingface
  ingest_mode: simple
+  # gpu: cuda[0]      # if you have more than one GPU and you want to select another. defaults to cuda[0], or cpu if cuda not available


You'd need to include this new setting to settings.py

imartinez · 2024-03-15T08:47:16Z

private_gpt/components/embedding/embedding_component.py

@@ -28,9 +28,33 @@ def __init__(self, settings: Settings) -> None:
                        "Local dependencies not found, install with `poetry install --extras embeddings-huggingface`"
                    ) from e

+                # Get the number of available GPUs
+                num_gpus = torch.cuda.device_count()


Adding code to the codebase just to print information is not a good practive. I'd remove this. whole block of prints.

Updated to allow the selection of GPU for embedding where there is mo…

207531d

…re than one available. Defaults to cuda[0] or cpu if cuda is not available. Commented reference in settings.yaml under embedding.

imartinez requested changes Mar 15, 2024

View reviewed changes

gpufix partial

16ee058

jaluma force-pushed the main branch from 10fbf61 to 1d4c14d Compare August 5, 2024 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated to allow the selection of GPU for embedding where there is mo… #1734

Updated to allow the selection of GPU for embedding where there is mo… #1734

sgresham commented Mar 15, 2024

imartinez left a comment

imartinez Mar 15, 2024

imartinez Mar 15, 2024

dbzoo Mar 18, 2024

imartinez Mar 15, 2024

imartinez Mar 15, 2024

Updated to allow the selection of GPU for embedding where there is mo… #1734

Are you sure you want to change the base?

Updated to allow the selection of GPU for embedding where there is mo… #1734

Conversation

sgresham commented Mar 15, 2024

imartinez left a comment

Choose a reason for hiding this comment

imartinez Mar 15, 2024

Choose a reason for hiding this comment

imartinez Mar 15, 2024

Choose a reason for hiding this comment

dbzoo Mar 18, 2024

Choose a reason for hiding this comment

imartinez Mar 15, 2024

Choose a reason for hiding this comment

imartinez Mar 15, 2024

Choose a reason for hiding this comment