[Driver] Make compilation more compatible with multi-processing #350 #351

xinli-git · 2023-08-21T02:58:08Z

This change adds a filelock to task compilation so that workflows such as distributed inference only builds the task once and avoids any potential data (file) races.

Currently, only task building is included in the filelock because in general, compiled graphs will be different and compiled modules is already protected by task building.

…del support (hidet-org#347) 1. Enhance support for `__setitem__` and` __getitem__` of Tensor; Add SetStridedSlice Op, Roll Op. 2. Add/Update torch mapping for adaptive_avg_pool3d, eq, pad, roll, matmul, new_zeros, batch_norm, MultiHeadAttention. 3. Update torch Linear mapping to optionally accept transposed weights. 4. Fix a bug where a empty graph will output a zero tensor instead of the input/weight.

…hidet-org#345) Encountered a few minor issues when compiling a transformer-based model using torch.compile with very large batch sizes, submitting the fix here.

This is a continuation of hidet-org#347. 1. Add LP normalization task (ToDo: schedule template) 2. Add torch mappings for normalize, clone, zero_, exp, chunk 3. Add ceil_mode=True support for pool2d 4. Fix dtype issue in resize 5. Fix other bugs in pad, conv2d_pattern

Add an ad-hoc implementation of einsum based on pattern matching. Only supports batched matmul.

…uild can work with spawned processes

xinli-git · 2023-08-23T20:41:10Z

Hi @soodoshll maybe a quick review ?

soodoshll · 2023-08-23T23:32:51Z

@xinli-git LGTM!

… for conv-bert-base model (#351) Added support for `torch.multiply` and `torch.nn.functional.unfold` These ops are needed in `conv-bert-base` models --------- Co-authored-by: Zhumakhan <nazirzhumakhan@gmail,.com>

yaoyaoding and others added 6 commits August 5, 2023 00:07

[Operator] Add clamp/isinf/any/all op, enhance where op (hidet-org#343)

8d755f7

[Dynamo] minor enhancements to attention and register a few functions (…

edb6503

…hidet-org#345) Encountered a few minor issues when compiling a transformer-based model using torch.compile with very large batch sizes, submitting the fix here.

[Operator] Add einsum (hidet-org#349)

d4eadcc

Add an ad-hoc implementation of einsum based on pattern matching. Only supports batched matmul.

add compile lock to build_task, use context to manage imap so hidet b…

c110fc3

…uild can work with spawned processes

xinli-git changed the base branch from main to auto-parallel August 24, 2023 18:17

xinli-git merged commit ab5b738 into hidet-org:auto-parallel Aug 24, 2023

xinli-git deleted the concurrent_task_build branch August 24, 2023 18:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Driver] Make compilation more compatible with multi-processing #350 #351

[Driver] Make compilation more compatible with multi-processing #350 #351

xinli-git commented Aug 21, 2023 •

edited

Loading

xinli-git commented Aug 23, 2023

soodoshll commented Aug 23, 2023

[Driver] Make compilation more compatible with multi-processing #350 #351

[Driver] Make compilation more compatible with multi-processing #350 #351

Conversation

xinli-git commented Aug 21, 2023 • edited Loading

xinli-git commented Aug 23, 2023

soodoshll commented Aug 23, 2023

xinli-git commented Aug 21, 2023 •

edited

Loading