keep parameter names from PyTorch #5887

t-vi · 2020-06-22T20:11:24Z

This patch uses PyTorch parameter names as the name hint in variables.
This means that one can load a stored PyTorch state dict and map the required inputs from that even after conversion.

masahi · 2020-06-22T23:41:01Z

Thanks @t-vi

siju-samuel · 2020-06-23T07:08:15Z

@t-vi Thanks for the PR.
I think this PR has slight impact in some scenarios.
You can confirm by running the below script before and after your change.

pytorch_pretrained_bert_uncased.py

I think after this PR, the param_tensor and param is not getting assinged properly in some cases..
Please look into this.
Thanks in advance.

t-vi · 2020-06-23T08:21:08Z

What is the error you are seeing?

siju-samuel · 2020-06-23T08:29:34Z

Output mismatch between pytorch and tvm.

Afer this PR

  File "pytorch_pretrained_bert_uncased.py", line 112, in <module>
    tvm.testing.assert_allclose(torch_preds, compiled_output, rtol=1e-3, atol=1e-3)
  File "/home/siju/workspace/tvm/python/tvm/testing.py", line 36, in assert_allclose
    np.testing.assert_allclose(actual, desired, rtol=rtol, atol=atol, verbose=True)
  File "/home/siju/.local/lib/python3.8/site-packages/numpy/testing/_private/utils.py", line 1532, in assert_allclose
    assert_array_compare(compare, actual, desired, err_msg=str(err_msg),
  File "/home/siju/.local/lib/python3.8/site-packages/numpy/testing/_private/utils.py", line 846, in assert_array_compare
    raise AssertionError(msg)
AssertionError: 
Not equal to tolerance rtol=0.001, atol=0.001

Mismatched elements: 427297 / 427308 (100%)
Max absolute difference: 24.692068
Max relative difference: 196537.89
 x: array([[[ -7.879808,  -7.787371,  -7.786093, ...,  -7.043789,
          -6.745376,  -4.60134 ],
        [-13.363304, -13.769426, -13.781861, ..., -11.81282 ,...
 y: array([[[-0.419126, -0.420205, -0.41907 , ..., -0.789973, -0.782199,
         -0.496477],
        [-0.419126, -0.420205, -0.41907 , ..., -0.789973, -0.782199,...

t-vi · 2020-06-23T09:01:42Z

I found the problem and will send a fix.
The old code included the embedding weight twice (which arguably is a bug in itself and needs fixing) and the new code deduplicated the param but not the var (which is even worse).
We underappreciate the structure of the PyTorch input...

t-vi · 2020-06-23T09:26:40Z

#5897 has the fix.

keep parameter names from PyTorch

967f418

tqchen added the status: need review label Jun 22, 2020

tqchen assigned masahi Jun 22, 2020

masahi approved these changes Jun 22, 2020

View reviewed changes

masahi merged commit 3637164 into apache:master Jun 22, 2020

t-vi mentioned this pull request Jun 23, 2020

PyTorch frontend: fix handling of duplicate use of a model weight #5897

Merged

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Jun 30, 2020

keep parameter names from PyTorch (apache#5887)

6b9986a

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Jul 2, 2020

keep parameter names from PyTorch (apache#5887)

cd2621b

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

keep parameter names from PyTorch #5887

keep parameter names from PyTorch #5887

t-vi commented Jun 22, 2020

masahi commented Jun 22, 2020

siju-samuel commented Jun 23, 2020

t-vi commented Jun 23, 2020

siju-samuel commented Jun 23, 2020

t-vi commented Jun 23, 2020 •

edited

Loading

t-vi commented Jun 23, 2020

keep parameter names from PyTorch #5887

keep parameter names from PyTorch #5887

Conversation

t-vi commented Jun 22, 2020

masahi commented Jun 22, 2020

siju-samuel commented Jun 23, 2020

t-vi commented Jun 23, 2020

siju-samuel commented Jun 23, 2020

t-vi commented Jun 23, 2020 • edited Loading

t-vi commented Jun 23, 2020

t-vi commented Jun 23, 2020 •

edited

Loading