[MetaSchedule] Adding post optimization in MetaSchedule to Improve Scheduling #17104

canesche · 2024-06-17T21:37:54Z

Description

This pull request aims to enhance model optimization by adding post optimization in MetaSchedule. The proposed approach involves the following steps:

Execution of MetaSchedule over an end-to-end model that requires optimization.
Selection of the best implementation identified by MetaSchedule for the given model.
Utilization of Droplet Search to exploit the selected candidate.

By using Droplet Search as a post optimization (Droplet paper), we have been able to reduce the number of trials explored by MetaSchedule while still achieving faster kernel performance. We have observed this improvement on the following architectures: Nvidia A100, Nvidia 3080, AMD x86, and ARM A64FX. The results can be found in this report: bennu paper

Proposed Changes

Integration of Droplet Search as post optimization methodology.
Utilization of Droplet Search to exploit the best candidates identified by MetaSchedule.

Motivation

This pull request introduces an exploitation phase leveraging the coordinate descent algorithm to MetaSchedule. By iteratively refining the best kernel identified by MetaSchedule, we achieve two key benefits:

Reduced Sample Requirements: Coordinate descent search minimizes the number of samples MetaSchedule needs to discover high-performing schedules.
Faster Kernels: The refined kernels exhibit improved execution speed compared to those found by MetaSchedule alone, even when it uses more samples.

Thus, this PR optimizes MetaSchedule along two crucial dimensions: search efficiency and kernel performance.

Testing and Validation

Extensive testing has been conducted to validate the efficacy and performance improvements achieved through the integration of MetaSchedule and Droplet Search. Benchmarking tests have been performed across Nvidia A100, AMD x86, and ARM A64FX architectures to assess the impact on kernel speed and search time reduction compared with 10,000 trials from MetaSchedule execution. These results are available in Section 3 of this manuscript: paper

Additional Notes

This pull request builds upon prior research and experimentation in model optimization. The proposed approach improves end-to-end models across diverse hardware platforms while still reducing MetaSchedule's search time. We welcome the community’s feedback, suggestions, and contributions to further refine and enhance these methodologies.

Thank you.

Sincerely,

Michael Canesche, Gaurav Verma, and Fernando Pereira

…heduling

tqchen · 2025-02-08T15:10:21Z

Sorry for getting late in this. I think this is something that could be potentially valuable and also the change is modularized, so going to merge it in. Thanks @canesche for the PR!

[MetaSchedule] Adding post optimization in MetaSchedule to Improve Sc…

a6c7071

…heduling

canesche force-pushed the main branch from d27c0ea to a6c7071 Compare June 17, 2024 22:08

tqchen approved these changes Feb 8, 2025

View reviewed changes

tqchen merged commit 167795b into apache:main Feb 8, 2025

ysh329 mentioned this pull request Apr 19, 2025

[Release] v0.20.0 Release Candidate Notes #17860

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[MetaSchedule] Adding post optimization in MetaSchedule to Improve Scheduling #17104

[MetaSchedule] Adding post optimization in MetaSchedule to Improve Scheduling #17104

Uh oh!

canesche commented Jun 17, 2024

Uh oh!

tqchen commented Feb 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[MetaSchedule] Adding post optimization in MetaSchedule to Improve Scheduling #17104

[MetaSchedule] Adding post optimization in MetaSchedule to Improve Scheduling #17104

Uh oh!

Conversation

canesche commented Jun 17, 2024

Uh oh!

tqchen commented Feb 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants