Gluon export changing params #19354

samskalicky · 2020-10-15T01:36:04Z

Description

When exporting a model from Gluon, some param values can be modified:
https://github.com/apache/incubator-mxnet/blob/ce1e68260eb3cf9219ae4d59df8bdde7361802ec/python/mxnet/gluon/block.py#L1354
The _reduce function here actually performs a sum/division:
https://github.com/apache/incubator-mxnet/blob/ce1e68260eb3cf9219ae4d59df8bdde7361802ec/python/mxnet/gluon/parameter.py#L406

If the data type is floating point, then the resulting IEEE FP standard calls for a rounding operation. Even though summing by 1 and dividing by 1 should result in the same value, it may be slightly altered.

If specific binary data is encoded within a param, then this operation will modify that data and cause corruption.

Possible solution

I propose checking the length of the block, and if it is 1 then skipping the sum/divide operation.

The text was updated successfully, but these errors were encountered:

samskalicky · 2020-10-15T02:08:51Z

This problem is compounded by the bug in the lambda function used to allocate new tensors for graph passes:
https://github.com/apache/incubator-mxnet/blob/6729cf3bf4edd6837b0feb6417691ce2e00dbcee/src/c_api/c_api.cc#L1417-L1419
where its calling the NDArray constructor:
https://github.com/apache/incubator-mxnet/blob/6729cf3bf4edd6837b0feb6417691ce2e00dbcee/include/mxnet/ndarray.h#L95-L96
and its passing in the dtype to the 3rd argument which is delay_alloc. the 4th argument is actually the dtype and it should be:

NDArray* arr = new NDArray(shape, ctx, false, dtype);

samskalicky · 2020-10-15T02:11:52Z

It further compounds in Gluon where creating the new Parameter doesnt set the dtype explicitly:
https://github.com/apache/incubator-mxnet/blob/6729cf3bf4edd6837b0feb6417691ce2e00dbcee/python/mxnet/gluon/block.py#L1002-L1003
This leads to the default dtype mx_real_t which is float32:
https://github.com/apache/incubator-mxnet/blob/ce1e68260eb3cf9219ae4d59df8bdde7361802ec/python/mxnet/gluon/parameter.py#L106
with this error:

Traceback (most recent call last):
  File "test_subgraph.py", line 155, in <module>
    test("myProp")
  File "test_subgraph.py", line 113, in test
    clear=False, backend_opts={'dedup_subgraph':True})
  File "/home/ubuntu/incubator-mxnet/python/mxnet/gluon/block.py", line 1119, in optimize_for
    self._build_cache(x, *args)
  File "/home/ubuntu/incubator-mxnet/python/mxnet/gluon/block.py", line 1003, in _build_cache
    param._load_init(param_data, args[0].context)
  File "/home/ubuntu/incubator-mxnet/python/mxnet/gluon/parameter.py", line 302, in _load_init
    self.name, str(self.dtype), str(data.dtype))
AssertionError: Failed loading Parameter '_op0_input' from saved params: dtype incompatible expected <class 'numpy.float32'> vs saved <class 'numpy.float16'>. Set cast_dtype=True to cast the dtype of saved params.

Instead we should be creating the Parameter like:

param = Parameter(name, dtype=param_data.dtype)

samskalicky · 2020-10-15T02:25:37Z

FYI @leezu @MoisesHer @Kh4L @mseth10 @rondogency

szha · 2021-02-08T20:14:10Z

@samskalicky thanks for the fix!

samskalicky added Bug needs triage labels Oct 15, 2020

samskalicky mentioned this issue Oct 20, 2020

More extensions fixes #19393

Merged

6 tasks

szha removed the needs triage label Feb 8, 2021

szha closed this as completed Feb 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gluon export changing params #19354

Gluon export changing params #19354

samskalicky commented Oct 15, 2020

samskalicky commented Oct 15, 2020 •

edited

Loading

samskalicky commented Oct 15, 2020 •

edited

Loading

samskalicky commented Oct 15, 2020 •

edited

Loading

szha commented Feb 8, 2021

Gluon export changing params #19354

Gluon export changing params #19354

Comments

samskalicky commented Oct 15, 2020

Description

Possible solution

samskalicky commented Oct 15, 2020 • edited Loading

samskalicky commented Oct 15, 2020 • edited Loading

samskalicky commented Oct 15, 2020 • edited Loading

szha commented Feb 8, 2021

samskalicky commented Oct 15, 2020 •

edited

Loading

samskalicky commented Oct 15, 2020 •

edited

Loading

samskalicky commented Oct 15, 2020 •

edited

Loading