[Feature] Added performance testing tool based on the PyTest testing framework #295

you-seesee-you · 2025-10-20T11:54:14Z

Purpose

Performance testing tool based on the PyTest testing framework

Modifications

1、Added tests for UC-related performance metrics, including full throughput and incremental throughput.
2、Support for custom PC hit rate.
3、Support for custom tokenizer.

Test

Performance test

yuanzhg078 · 2025-10-21T06:46:13Z

test/test_uc_performance

+        case_hit_rate_map  — {case_idx: hit_rate} 的映射
+    """
+    print(f"[INFO] 共计 {len(test_cases)} 个测试用例待执行")
+    failed_case = []


failed_case is not used in this function.

ygwpz · 2025-10-23T01:49:02Z

put this file in benchmark dir seems better

yuanzhg078 · 2025-10-24T02:26:15Z

Pursuant to the UCM code-repository guidelines, all code comments must be composed in English.

Potterluo

log，config，single

Potterluo · 2025-11-03T01:18:50Z

test/requirements.txt

You need to add the pip packages you use along with their versions to make it easier for others who don't have them (e.g., pandas, pydantic) to use them.

Potterluo · 2025-11-03T01:22:30Z

test/config.yaml

+  server_url: "http://141.111.32.70:9382"
+  tokenizer_path: "/home/models/QwQ-32B"
+# Performance Test Configuration
+llmperf_test_cases:


Configuration items can be added, referring to the implementation of logs and reports, with results stored using timestamps. (They can be uniformly placed in the reports directory to prevent too many subdirectories, and there should be an llmperf flag.)

Parameter names such as max_num_completed_requests and num_concurrent_requests are not descriptive enough; additional descriptions should be added.

Potterluo · 2025-11-03T01:26:08Z

test/common/llmperf/run_inference.py

+from common.llmperf.utils.utils import reset_prefill_cache
+
+
+def run_test_cases(test_cases, timestamp_dir, model, server_url, tokenizer_path):


The Singleton pattern can be used for optimization, ensuring that only one instance is created during a single program run, and the test is executed only once. This prevents repeated assertions from causing the test to be run multiple times. (Test results can be stored in the instance; refer to config_utils for details.)

* fix mtp in ucm

…odelEngine-Group#322) * linear buffer for device * check data consistency after embedding

New performance testing tools New performance testing tools

you-seesee-you requested review from Wwwzff, mag1c-h and ygwpz as code owners October 20, 2025 11:54

Performance test

af584ff

Performance test

you-seesee-you force-pushed the develop branch from 858b406 to af584ff Compare October 20, 2025 12:11

yuanzhg078 reviewed Oct 21, 2025

View reviewed changes

Merge branch 'ModelEngine-Group:develop' into develop

e79a34b

Potterluo reviewed Nov 3, 2025

View reviewed changes

you-seesee-you changed the title ~~Added performance test~~ [Feature] Added performance testing tool based on the PyTest testing framework Nov 5, 2025

NaganooMei and others added 3 commits November 5, 2025 14:35

[BugFix]fix mtp in ucm (ModelEngine-Group#321)

dc454e0

* fix mtp in ucm

[bugfix] preserve DRAM buffer lifetime to restore inference accuracy (M…

06442f0

…odelEngine-Group#322) * linear buffer for device * check data consistency after embedding

New performance testing tools

4b8b8de

New performance testing tools New performance testing tools

you-seesee-you force-pushed the develop branch from 5a3bc0c to 4b8b8de Compare November 5, 2025 06:37

you-seesee-you requested review from harrisonyhq, hek14 and qyh111 as code owners November 5, 2025 06:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Added performance testing tool based on the PyTest testing framework #295

[Feature] Added performance testing tool based on the PyTest testing framework #295

Uh oh!

you-seesee-you commented Oct 20, 2025 •

edited

Loading

Uh oh!

yuanzhg078 Oct 21, 2025

Uh oh!

ygwpz commented Oct 23, 2025

Uh oh!

yuanzhg078 commented Oct 24, 2025

Uh oh!

Potterluo left a comment

Uh oh!

Potterluo Nov 3, 2025

Uh oh!

Potterluo Nov 3, 2025

Uh oh!

Potterluo Nov 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

		from common.llmperf.utils.utils import reset_prefill_cache


		def run_test_cases(test_cases, timestamp_dir, model, server_url, tokenizer_path):

[Feature] Added performance testing tool based on the PyTest testing framework #295

Are you sure you want to change the base?

[Feature] Added performance testing tool based on the PyTest testing framework #295

Uh oh!

Conversation

you-seesee-you commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Modifications

Test

Uh oh!

yuanzhg078 Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

ygwpz commented Oct 23, 2025

Uh oh!

yuanzhg078 commented Oct 24, 2025

Uh oh!

Potterluo left a comment

Choose a reason for hiding this comment

Uh oh!

Potterluo Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Potterluo Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Potterluo Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

you-seesee-you commented Oct 20, 2025 •

edited

Loading