Memory-mapped Utility for PECOS XLinear Model #166

weiliw-amz · 2022-08-13T03:23:47Z

Issue #, if available:
N/A

Description of changes:

This pull request will consist of following 3 commits:

Refactor chunked matrix for PECOS XLinear model inference
- Concatenated original chunked matrix's fragmented memory allocation
  - For accommodating subsequent memory-mapped utility module.
  - This change increases time cost of making chunked matrix by 10%~15% for large models (>50G), but is necessary and cannot be avoided.
- Reduced memory footprint of making chunked matrix
Memory-mapped utility module
- Easy to use, well-encapsulated tool designed for dumping/loading arbitrary PECOS model
Memory-mapped PECOS XLinear model
- Greatly reduce loading time.
- Ideal for large models that user want to quickly try a few inferences without waiting for loading full model into memory.
- Also capable for large model inference that could not be stored in memory.

Usage:
User needs to have a XLinear Model saved on disk (in original .npz format), and manually compile into mmap format by calling compile_mmap_model:

import sys
from pecos.xmc.xlinear.model import XLinearModel


npz_model_path = f"/path/to/xlinear/pecos-models/"
mmap_model_path = f"/path/to/xlinear/mmap-models/"

print(f"Compiling mmap model from: {npz_model_path}, will save to : {mmap_model_path}...")
XLinearModel.compile_mmap_model(npz_model_path, mmap_model_path)
print("mmap model saved.")

Then user can load the memory-mapped model and do inference:

import sys
from pecos.xmc.xlinear.model import XLinearModel


mmap_model_path = f"/path/to/xlinear/mmap-models/"

# Load model
if sys.argv[2] == "--cmmap":
    print("Loading C/C++ mem map model...")
    xlm = XLinearModel.load(mmap_model_path, is_predict_only=True, is_mmap=True)
elif sys.argv[2] == "--cmmap-preload":
    print("Loading C/C++ mem map model pre-loaded...")
    xlm = XLinearModel.load(mmap_model_path, is_predict_only=True, is_mmap=True, pre_load=True)
else:
    print(f"Wrong option: {sys.argv[2]}")

# Load test data
Xt = XLinearModel.load_feature_matrix(f"/test/data/validation/X.npz")
Yt = XLinearModel.load_label_matrix(f"/test/data/validation/Y.npz")

# Predict
Yt_pred = xlm.predict(Xt)
Yt_pred = Yt_pred.tocsr()
metric = smat_util.Metrics.generate(Yt, Yt_pred, topk=10)
print(metric)

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

rofuyu · 2022-08-24T20:44:10Z

pecos/core/xmc/inference.hpp

-        bool b_has_explicit_bias; // Whether or not this chunk has an explicit bias term
+        index_type nnz_rows; // The number of non-zero rows in this chunk
+        // Using index_type for struct padding
+        index_type b_has_explicit_bias; // Whether or not this chunk has an explicit bias term, 0=false


given that this variable is no longer a boolean, we might want to consider to remove the "b_" prefix.

jiong-zhang · 2022-09-01T21:42:20Z

pecos/xmc/xlinear/model.py

+
+        shutil.copyfile(path.join(npz_folder, "param.json"), path.join(mmap_folder, "param.json"))
+
+        HierarchicalMLModel.compile_mmap_model(


Is compile_mmap_model implemented in HierarchicalMLModel? Also the namenpz_folder is kind of misleading.

jiong-zhang · 2022-09-01T23:57:32Z

pecos/core/base.py

    def xlinear_load_predict_only(
        self,
        folder,
        weight_matrix_type="BINARY_SEARCH_CHUNKED",
+        is_mmap=False,


Add docstrings for the new kwargs here and HierarchicalMLModel.load and XLinearModel.load

Also, can this be inferred from the saved configs rather than given by the user?

jiong-zhang · 2022-09-02T00:01:17Z

pecos/xmc/xlinear/model.py

+        """
+        import shutil
+
+        shutil.copyfile(path.join(npz_folder, "param.json"), path.join(mmap_folder, "param.json"))


Should we add "compiled_format": "memory_map" in the params.json so that user need not to know the format before calling load?

weiliw-amz · 2022-12-09T06:34:53Z

Will break this PR into a series of PRs to merge:
#192
#189

Refactor inference chunked matrix

e4e79e5

weiliw-amz requested a review from rofuyu August 13, 2022 03:24

weiliw-amz mentioned this pull request Aug 13, 2022

[Deprecated] Memory-mapped PECOS Model Loading for Inference #149

Closed

weiliw-amz changed the title ~~Memory-mapped Utility and PECOS XLinear Model~~ Memory-mapped Utility for PECOS XLinear Model Aug 16, 2022

Add mmap util

af81c8f

weiliw-amz requested review from hallogameboy, jiong-zhang and OctoberChang August 20, 2022 00:09

PECOS mmap XLinear model

6dab6ee

rofuyu reviewed Aug 24, 2022

View reviewed changes

jiong-zhang reviewed Sep 2, 2022

View reviewed changes

weiliw-amz closed this Dec 9, 2022

weiliw-amz deleted the refactor-inference-chunked-matrix branch September 21, 2023 21:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory-mapped Utility for PECOS XLinear Model #166

Memory-mapped Utility for PECOS XLinear Model #166

weiliw-amz commented Aug 13, 2022 •

edited

Loading

rofuyu Aug 24, 2022

jiong-zhang Sep 1, 2022

jiong-zhang Sep 1, 2022

jiong-zhang Sep 1, 2022

jiong-zhang Sep 2, 2022

weiliw-amz commented Dec 9, 2022 •

edited

Loading


		shutil.copyfile(path.join(npz_folder, "param.json"), path.join(mmap_folder, "param.json"))

		HierarchicalMLModel.compile_mmap_model(

Memory-mapped Utility for PECOS XLinear Model #166

Memory-mapped Utility for PECOS XLinear Model #166

Conversation

weiliw-amz commented Aug 13, 2022 • edited Loading

rofuyu Aug 24, 2022

Choose a reason for hiding this comment

jiong-zhang Sep 1, 2022

Choose a reason for hiding this comment

jiong-zhang Sep 1, 2022

Choose a reason for hiding this comment

jiong-zhang Sep 1, 2022

Choose a reason for hiding this comment

jiong-zhang Sep 2, 2022

Choose a reason for hiding this comment

weiliw-amz commented Dec 9, 2022 • edited Loading

weiliw-amz commented Aug 13, 2022 •

edited

Loading

weiliw-amz commented Dec 9, 2022 •

edited

Loading