[AutoSchedule] Sparse dense tuning support with custom sketch rule #7313

jcf94 · 2021-01-20T08:17:48Z

No description provided.

python/tvm/auto_scheduler/measure.py

ANSHUMAN87 · 2021-01-20T15:46:14Z

Thanks @jcf94 for the PR!
May be once the PR is ready, it would be really great if you can share the stats of sparse_dense Op with and without Ansor.
Really excited to see those.

tutorials/auto_scheduler/tune_sparse_x86.py

tkonolige · 2021-01-28T23:32:44Z

python/tvm/auto_scheduler/measure.py

+            density *= i
+        density /= (K * N)
+        density = density.value
+        sparse_prefix = "%s_%d_%d_%d_%d_%d_%.2f_" % (


We could run into the case that two matrices have the same sparse_prefix, but different non-zero structure. Will this cause issues? What if one of the matrices has one nonzero per row and the other has one dense row (while maintaining the same sparsity)?

Though in my test a schedule seems to have similar performance with different random sparse data, I think that may still be a potential problem. Unfortunately, I have not figured out any better solution.

You could hash the indptr and indices arrays as these determine the structure. Alternatively you could hash the number of nonzeros per row.

It would be interesting to study if tuning performs the same independent of structure (but for the same sparsity).

python/tvm/auto_scheduler/measure.py

merrymercy

minor style comments

python/tvm/topi/nn/sparse.py

src/auto_scheduler/feature.cc

tutorials/auto_scheduler/tune_sparse_x86.py

merrymercy · 2021-03-03T03:09:29Z

According to our offline discussion,

Update the type of SearchTaskNode::task_inputs. Change it from Map<String, runtime::NDArray> to Array<String>, so we only need to store nd arrays in one place. We can query it from the global table in measure.py
Remove SearchTask.AddTaskInput interface to make SearchTask immutable. We do not have the need to dynamically update task inputs, so we can provide all arguments to the constructors.
Make sure we can use the same interface to support the use case where we want to match the special buffers by name

python/tvm/auto_scheduler/search_task.py

jcf94 · 2021-03-03T07:10:14Z

According to our offline discussion,

Update the type of SearchTaskNode::task_inputs. Change it from Map<String, runtime::NDArray> to Array<String>, so we only need to store nd arrays in one place. We can query it from the global table in measure.py

Remove SearchTask.AddTaskInput interface to make SearchTask immutable. We do not have the need to dynamically update task inputs, so we can provide all arguments to the constructors.

Make sure we can use the same interface to support the use case where we want to match the special buffers by name

@comaniac @merrymercy Comments all addressed:
1/2: Removed the add_task_input API, and only provide them in constructor. Now, SearchTask only keeps the name of each special buffer.
3: Add a extra case in measure.py:_prepare_input_map to check the placeholder name, as well as a unit test in test_auto_scheduler_measure.py:test_measure_special_inputs_map_by_name.

tkonolige

I just want to echo my earlier concerns about special casing sparse inputs instead of having a generic mechanism for detecting special task inputs.

tkonolige · 2021-03-03T16:48:21Z

python/tvm/auto_scheduler/measure.py

+                tensor_input_map[arg] = arg.op.name
+
+    # Case 1: Check sparse op
+    sparse_input_map = topi.nn.sparse.try_get_sparse_input(args)


I think I asked this before, but can we have a more general mechanism than checking only for sparse. There are other use cases that require specific input (sorting, scatter).

Yeah, I've also had some discussions in our weekly sync while didn't figure out any better solutions.
There're several reasons:

Different ops have different requirements over specific inputs;

While the problems is in a subgraph generated in Relay Integration, the placeholder are all the same, we can not differentiate them from tag, name or any other way, even the order of inputs are not guaranteed.

Current approach is to merge all specific inputs checking to this function, at least they have a same entry here. For the other ops, you have to add their own check functions below.

By the way, my colleague is going to add Ansor support for sparse_conv2d. We'll add extra check to this entry first, and see if there's any better way to merge them.

Could we associate the lookup mechanism with @register_workload? It would at least be extensible then.

Could we associate the lookup mechanism with @register_workload? It would at least be extensible then.

Thanks! This is a pretty good idea, I'll have a try.

tests/python/unittest/test_auto_scheduler_search_task.py

include/tvm/auto_scheduler/search_task.h

python/tvm/auto_scheduler/measure.py

python/tvm/auto_scheduler/search_task.py

python/tvm/auto_scheduler/measure.py

python/tvm/topi/nn/sparse.py

jcf94 · 2021-03-05T03:19:50Z

@merrymercy @comaniac @tkonolige Thanks! Comments has all been addressed.
Additionally, add a @auto_scheduler.register_task_input_check_func, now we can add extra input check functions more easily.

comments are addressed

…pache#7313) * Add sparse dense tuning tutorial * Add sparse input fusion * Update the dag to support output fusion * Update * Add task input to search_task * Update * Add search_inputs to measure * Lint fix * Lint fix * Update * Update * Update * Update * Add file save load support * Update * Update * Update * Remove add_task_inputs API * Update * Update * Update * Lint fix * Lint fix * Lint fix * Lint fix * Update * Add example ci_log * Update * retrigger ci * Update * Update * Update * Lint fix * Lint fix * Lint fix

jcf94 added 3 commits January 20, 2021 11:27

Add sparse dense tuning tutorial

459f773

Add sparse input fusion

b577edd

Update the dag to support output fusion

594fdb4

antinucleon reviewed Jan 20, 2021

View reviewed changes

python/tvm/auto_scheduler/measure.py Outdated Show resolved Hide resolved

ZihengJiang added the status: need review label Jan 20, 2021

comaniac requested changes Jan 21, 2021

View reviewed changes

Update

a4b025b

tkonolige requested changes Jan 28, 2021

View reviewed changes

jcf94 added 11 commits February 3, 2021 20:55

Add task input to search_task

a6223e0

Update

f068e79

Add search_inputs to measure

1f1a7b5

Lint fix

e83dfe4

Lint fix

7ca2bd4

Update

486567a

Update

aa7abd2

Update

48a4e61

Update

5b2e25a

Add file save load support

cc111b9

Update

9c9d974

merrymercy self-assigned this Feb 5, 2021

jcf94 added 3 commits March 2, 2021 19:42

Merge branch 'main' into ansor_sparse

c1a65ae

Update

2eac0c7

Update

418f42c

merrymercy reviewed Mar 3, 2021

View reviewed changes

python/tvm/topi/nn/sparse.py Outdated Show resolved Hide resolved

src/auto_scheduler/feature.cc Outdated Show resolved Hide resolved

tutorials/auto_scheduler/tune_sparse_x86.py Outdated Show resolved Hide resolved

tutorials/auto_scheduler/tune_sparse_x86.py Outdated Show resolved Hide resolved

merrymercy reviewed Mar 3, 2021

View reviewed changes

python/tvm/auto_scheduler/search_task.py Outdated Show resolved Hide resolved

jcf94 added 3 commits March 3, 2021 11:54

Remove add_task_inputs API

1d735c8

Update

2083254

Update

1622de0

Update

b6f02cc

jcf94 added 6 commits March 3, 2021 15:10

Lint fix

ca92d64

Lint fix

2273998

Lint fix

034dcab

Lint fix

35ce552

Update

925fd70

Add example ci_log

56c01d9

merrymercy approved these changes Mar 3, 2021

View reviewed changes

tkonolige requested changes Mar 3, 2021

View reviewed changes

jcf94 requested a review from comaniac March 4, 2021 01:26

jcf94 added 2 commits March 4, 2021 09:55

Update

b5a1832

retrigger ci

84b277d

comaniac previously requested changes Mar 4, 2021

View reviewed changes

jcf94 added 3 commits March 5, 2021 10:14

Update

3bd6b6f

Update

de4170e

Update

7e45641

jcf94 added 3 commits March 5, 2021 11:41

Lint fix

7b47a06

Lint fix

eeb9b3c

Lint fix

cf4cb42

merrymercy approved these changes Mar 6, 2021

View reviewed changes

merrymercy merged commit 0b4f669 into apache:main Mar 6, 2021

jcf94 mentioned this pull request Mar 11, 2021

[Autoscheduler][Sparse] Add sparse dense end to end model tuning support for x86/arm cpu & Some bug fix #7635

Merged

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoSchedule] Sparse dense tuning support with custom sketch rule #7313

[AutoSchedule] Sparse dense tuning support with custom sketch rule #7313

jcf94 commented Jan 20, 2021

ANSHUMAN87 commented Jan 20, 2021

tkonolige Jan 28, 2021

jcf94 Jan 29, 2021

tkonolige Jan 29, 2021 •

edited

Loading

merrymercy left a comment

merrymercy commented Mar 3, 2021 •

edited

Loading

jcf94 commented Mar 3, 2021

tkonolige left a comment

tkonolige Mar 3, 2021

jcf94 Mar 4, 2021 •

edited

Loading

jcf94 Mar 4, 2021

tkonolige Mar 4, 2021

jcf94 Mar 5, 2021

jcf94 commented Mar 5, 2021

[AutoSchedule] Sparse dense tuning support with custom sketch rule #7313

[AutoSchedule] Sparse dense tuning support with custom sketch rule #7313

Conversation

jcf94 commented Jan 20, 2021

ANSHUMAN87 commented Jan 20, 2021

tkonolige Jan 28, 2021

Choose a reason for hiding this comment

jcf94 Jan 29, 2021

Choose a reason for hiding this comment

tkonolige Jan 29, 2021 • edited Loading

Choose a reason for hiding this comment

merrymercy left a comment

Choose a reason for hiding this comment

merrymercy commented Mar 3, 2021 • edited Loading

jcf94 commented Mar 3, 2021

tkonolige left a comment

Choose a reason for hiding this comment

tkonolige Mar 3, 2021

Choose a reason for hiding this comment

jcf94 Mar 4, 2021 • edited Loading

Choose a reason for hiding this comment

jcf94 Mar 4, 2021

Choose a reason for hiding this comment

tkonolige Mar 4, 2021

Choose a reason for hiding this comment

jcf94 Mar 5, 2021

Choose a reason for hiding this comment

jcf94 commented Mar 5, 2021

tkonolige Jan 29, 2021 •

edited

Loading

merrymercy commented Mar 3, 2021 •

edited

Loading

jcf94 Mar 4, 2021 •

edited

Loading