[Torchscript] Parallelized Text/Sequence Preprocessing #2206

geoffreyangus · 2022-06-28T21:40:28Z

This PR parallelizes sequence tokenization for sequence and text features. This gives us better or equal inference throughput relative to vanilla Ludwig Model preprocessing.

One thing to note: there were/are strange interactions with torch.no_grad and the added parallelism. It seems that torch.no_grad affects some global flag that activates/deactivates gradient computation (link), and it seems that this flag is not properly reset after some scripted, parallelized operation.

The workaround has to do with the insight that gradients are not computed during preprocessing time and that the extraneous torch.no_grad statements could be removed (particularly around preprocessing and postprocessing). Applying the torch.no_grad context exclusively at the predictor stage of inference is enough to ensure that the module output tensors contain no gradients. Added tests confirm this. That said, we should keep this issue in mind if we decide to introduce parallelism in our ECD architecture.

github-actions · 2022-06-28T22:28:29Z

Unit Test Results

      6 files ±  0       6 suites ±0 2h 11m 29s ⏱️ - 19m 34s
2 886 tests +  6 2 840 ✔️ +  6   46 💤 ±0 0 ❌ ±0
8 658 runs +18 8 516 ✔️ +18 142 💤 ±0 0 ❌ ±0

Results for commit 76310eb. ± Comparison against base commit 5193e0d.

♻️ This comment has been updated with latest results.

geoffreyangus added 5 commits June 28, 2022 14:38

wip

ac93ac6

add test sequence feature throughput

44e9fd3

added ludwig tests

6c8aefe

final benchmarking commit– shows 2x improvement

80376f9

clean up for PR review

93d281d

geoffreyangus requested review from w4nderlust, tgaddair and justinxzhao June 28, 2022 21:40

Merge branch 'master' into ts-sequence-parallelism

15e9021

geoffreyangus added 4 commits June 29, 2022 01:30

fine-grained control of no_grad to account for parallelism

472e1c0

nit

be420b0

added unit test to check no gradients

d2698a0

unify assert message format

76310eb

geoffreyangus mentioned this pull request Jun 28, 2022

Speeds up Torchscript SequenceFeature with parallelism #2085

Closed

geoffreyangus marked this pull request as ready for review June 29, 2022 00:23

tgaddair approved these changes Jun 29, 2022

View reviewed changes

tgaddair merged commit 76055a2 into master Jun 29, 2022

tgaddair deleted the ts-sequence-parallelism branch June 29, 2022 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torchscript] Parallelized Text/Sequence Preprocessing #2206

[Torchscript] Parallelized Text/Sequence Preprocessing #2206

geoffreyangus commented Jun 28, 2022 •

edited

Loading

github-actions bot commented Jun 28, 2022 •

edited

Loading

[Torchscript] Parallelized Text/Sequence Preprocessing #2206

[Torchscript] Parallelized Text/Sequence Preprocessing #2206

Conversation

geoffreyangus commented Jun 28, 2022 • edited Loading

github-actions bot commented Jun 28, 2022 • edited Loading

Unit Test Results

geoffreyangus commented Jun 28, 2022 •

edited

Loading

github-actions bot commented Jun 28, 2022 •

edited

Loading