Improve to_torch/to_numpy converters #147

duburcqa · 2020-07-18T13:06:25Z

Do not force cast list and tuple.
Do not silently skip cast if no converter is found
Add fallback for to_numpy

duburcqa · 2020-07-18T13:16:46Z

Maybe a more clever mechanism could be implemented for list/tuple:

If all elements have the same type and shape (except for the first dim), than it can be converted as a whole into torch.tensor/np.ndarray, otherwise each element is handled separately.

What do you guys think ? @youkaichao @Trinkle23897

tianshou/data/utils.py

youkaichao · 2020-07-18T13:33:50Z

If all elements have the same type and shape (except for the first dim), than it can be converted as a whole into torch.tensor/np.ndarray, otherwise each element is handled separately.

I'm ok with this. If they finally get into a Batch, they will be converted to a whole array anyway. If not, it makes sense to handle them separately.

tianshou/data/utils.py

Trinkle23897 · 2020-07-18T13:41:01Z

If all elements have the same type and shape (except for the first dim), than it can be converted as a whole into torch.tensor/np.ndarray, otherwise each element is handled separately.

I'm ok with this. If they finally get into a Batch, they will be converted to a whole array anyway. If not, it makes sense to handle them separately.

Agree with Kaichao

youkaichao · 2020-07-18T13:42:46Z

I think the point is, previous code returns one merged tensor for to_torch([np.zeros((3, 4)), np.zeros((3, 4))]). Now you want to return two separate tensors to avoid merging them. Right? @duburcqa

test/base/test_batch.py

duburcqa · 2020-07-18T13:53:45Z

I think the point is, previous code returns one merged tensor for to_torch([np.zeros((3, 4)), np.zeros((3, 4))]). Now you want to return two separate tensors to avoid merging them. Right? @duburcqa

Exactly, it is a step forward supporting action space gym.Space.Tuple.

Trinkle23897 · 2020-07-20T02:25:05Z

A bug:

In [3]: d=np.zeros([3,6,6])

In [4]: to_torch(d[[]])
~/github/tianshou-new/tianshou/data/utils.py in to_torch(x, dtype, device)
     41         x = to_torch(np.asanyarray(x), dtype, device)
     42     elif isinstance(x, np.ndarray) and \
---> 43             isinstance(x.item(0), (np.number, np.bool_, Number)):
     44         x = torch.from_numpy(x).to(device)
     45         if dtype is not None:

IndexError: index 0 is out of bounds for size 0

However,

In [6]: to_torch(Batch(d=d)[[]])
Out[6]: 
Batch(
    d: tensor([], size=(0, 6, 6), dtype=torch.float64),
)

So, do not use x.item(0) to find the type, use issubclass(x.dtype.type, (np.bool_, np.number)) as in batch.py

duburcqa · 2020-07-20T06:14:53Z

A bug: [...]

Yes I noticed this. Thank you !

Trinkle23897 · 2020-07-20T08:26:00Z

A bug: [...]

Yes I noticed this. Thank you!

So could you please add this into the testcase?

duburcqa · 2020-07-20T08:33:24Z

So could you please add this into the testcase?

Sure ! on it

tianshou/data/utils.py

duburcqa · 2020-07-20T09:17:10Z

Ready for review.

tianshou/data/batch.py

test/base/test_batch.py

youkaichao · 2020-07-20T10:54:42Z

Personally, I don't think this is the best way to support gym.Space.Tuple in either the observation space or the action space.

A nice workaround is to wrap the original environment to return dict state and accept dict action, which is natively supported by tianshou.data.Batch.

Take observation space as an example, if an environment returns a tuple observation with two items, one is image observation with shape [224, 224, 3], the other is a vector observation with shape [10].

By writing an environment wrapper, the observation is {'img':img, 'vec':vec}. Everything else works without modifications.

for i in range(100):
    buffer.add(obs={'img':np.zeros((224, 224, 3)), 'vec':np.zeros((10,))})
buffer.obs.img # tensor of shape [100, 224, 224, 3], ready to feed in neural network policy
buffer.obs.vec # tensor of shape [100, 10], ready to feed in neural network policy

But if we support tuple observation as this pr, the observation in the buffer is a 2-d object array. The policy has to unpack the observation array and construct separate tensors for image observation and vector observations.

for i in range(100):
    buffer.add(obs=[np.zeros((224, 224, 3)), np.zeros((10,))])
buffer.obs # tensor of shape [100, 2], with data type np.object
img = np.stack(buffer.obs[:, 0]) # user has to do this explicitly!
vec = np.stack(buffer.obs[:, 1]) # user has to do this explicitly!

In addition, stack always create a new copy of data, while slicing does not. So the latter solution of directly supporting tuple observation keeps creating new data, costs much memory!

a = np.zeros((3, 4))
a
Out[104]: 
array([[0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.]])
b = a[:, 0] # sliced objects share memory 
b[0] = 1
In[107]: 
a
Out[107]: 
array([[1., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.]])

d = np.stack([a, a]) # stacking objects require additional memory cost
d
Out[109]: 
array([[[1., 0., 0., 0.],
        [0., 0., 0., 0.],
        [0., 0., 0., 0.]],
       [[1., 0., 0., 0.],
        [0., 0., 0., 0.],
        [0., 0., 0., 0.]]])
d[:] = 0
d
Out[111]: 
array([[[0., 0., 0., 0.],
        [0., 0., 0., 0.],
        [0., 0., 0., 0.]],
       [[0., 0., 0., 0.],
        [0., 0., 0., 0.],
        [0., 0., 0., 0.]]])
a
Out[112]: 
array([[1., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.]])

duburcqa · 2020-07-20T13:15:46Z

@youkaichao yes I know that. That's why I'm not planning to implement anything else regarding tuple support. Batch is obviously not design for that at this point.

As I said, it is just a step forward. Currently, it only consists in increasing the versatility of generic helpers, that are not indended to be used only internally in conjunction with Batch.

tianshou/data/batch.py

youkaichao

Nice work👍

Trinkle23897 · 2020-07-21T07:28:09Z

This case can bypass the check:

In [10]: d=Batch(a=[1, np.zeros([3,3]), np.zeros([3,3]), torch.zeros(3,3)])

In [11]: d
Out[11]: 
Batch(
    a: array([1, array([[0., 0., 0.],
              [0., 0., 0.],
              [0., 0., 0.]]),
              array([[0., 0., 0.],
              [0., 0., 0.],
              [0., 0., 0.]]),
              tensor([[0., 0., 0.],
               [0., 0., 0.],
               [0., 0., 0.]])], dtype=object),
)

In [12]: Batch.cat([d,d])
Out[12]: 
Batch(
    a: array([1, array([[0., 0., 0.],
              [0., 0., 0.],
              [0., 0., 0.]]),
              array([[0., 0., 0.],
              [0., 0., 0.],
              [0., 0., 0.]]),
              tensor([[0., 0., 0.],
               [0., 0., 0.],
               [0., 0., 0.]]),
              1, array([[0., 0., 0.],
              [0., 0., 0.],
              [0., 0., 0.]]),
              array([[0., 0., 0.],
              [0., 0., 0.],
              [0., 0., 0.]]),
              tensor([[0., 0., 0.],
               [0., 0., 0.],
               [0., 0., 0.]])], dtype=object),
)

I'm not sure this case is well enough. I think it is too "corner-case".

duburcqa · 2020-07-21T07:31:02Z

I'm not sure this case is well enough.

Indeed, I though it was forbidden. I will fix it and add a test case.

duburcqa · 2020-07-21T07:36:10Z

It is fine now.

Trinkle23897 · 2020-07-21T07:41:45Z

I think d=Batch(a=[]) should not raise an error.

duburcqa · 2020-07-21T07:49:41Z

I think d=Batch(a=[]) should not raise an error.

Indeed. Fixed. Added unit test.

* Enable converting list/tuple back and forth from/to numpy/torch. * Add fallbacks. * Fix PEP8 * Update unit tests. * Type annotation. Robust dtype check. * List of object are converted individually, as a single tensor otherwise. * Improve robustness of _to_array_with_correct_type * Add unit tests. * Do not catch exception at _to_array_with_correct_type level. * Use _parse_value * Fix PEP8 * Fix _parse_value list output type fallback. * Catch torch exception. * Do not convert torch tensor during fallback. * Improve unit tests. * Add unit tests. * FIx missing import * Remove support of numpy arrays of tensors for Batch value parser. * Forbid numpy arrays of tensors. * Fix PEP8. * Fix comment. * Reduce _parse_value branch number. * Fix None value. * Forward error message for debugging purpose. * Fix _is_scalar. * More specific try/catch blocks. * Fix exception chaining. * Fix PEP8. * Fix _is_scalar. * Fix missing corner case. * Fix PEP8. * Allow Batch empty key. * Fix multi-dim array datatype check. Co-authored-by: Alexis Duburcq <alexis.duburcq@wandercraft.eu>

duburcqa force-pushed the to_torch_numpy branch from 9bff506 to 1ffb59c Compare July 18, 2020 13:07

duburcqa changed the base branch from master to dev July 18, 2020 13:07

youkaichao reviewed Jul 18, 2020

View reviewed changes

tianshou/data/utils.py Outdated Show resolved Hide resolved

youkaichao reviewed Jul 18, 2020

View reviewed changes

tianshou/data/utils.py Show resolved Hide resolved

youkaichao reviewed Jul 18, 2020

View reviewed changes

test/base/test_batch.py Outdated Show resolved Hide resolved

Trinkle23897 changed the base branch from dev to test July 20, 2020 07:49

Trinkle23897 changed the base branch from test to dev July 20, 2020 07:50

Trinkle23897 force-pushed the dev branch from fa542f8 to fe5555d Compare July 20, 2020 07:54

duburcqa force-pushed the to_torch_numpy branch from e72764b to 13e2f22 Compare July 20, 2020 07:57

duburcqa force-pushed the to_torch_numpy branch from c1ed34c to 8893b3a Compare July 20, 2020 08:37

youkaichao reviewed Jul 20, 2020

View reviewed changes

tianshou/data/utils.py Outdated Show resolved Hide resolved

duburcqa changed the title ~~Improve to_torch/to_numpy converters~~ WIP: Improve to_torch/to_numpy converters Jul 20, 2020

duburcqa changed the title ~~WIP: Improve to_torch/to_numpy converters~~ Improve to_torch/to_numpy converters Jul 20, 2020

duburcqa commented Jul 20, 2020

View reviewed changes

tianshou/data/batch.py Outdated Show resolved Hide resolved

Trinkle23897 reviewed Jul 20, 2020

View reviewed changes

test/base/test_batch.py Outdated Show resolved Hide resolved

duburcqa force-pushed the to_torch_numpy branch from d35b44b to fe6294e Compare July 20, 2020 09:34

thu-ml deleted a comment from Trinkle23897 Jul 20, 2020

youkaichao reviewed Jul 20, 2020

View reviewed changes

tianshou/data/batch.py Outdated Show resolved Hide resolved

Alexis Duburcq added 5 commits July 21, 2020 09:21

Fix _is_scalar.

128a816

More specific try/catch blocks.

26b739b

Fix exception chaining.

cca2c10

Fix PEP8.

ffbd017

Fix _is_scalar.

7cef13d

duburcqa force-pushed the to_torch_numpy branch from 5b2995d to 7cef13d Compare July 21, 2020 07:21

youkaichao previously approved these changes Jul 21, 2020

View reviewed changes

Trinkle23897 previously approved these changes Jul 21, 2020

View reviewed changes

Fix missing corner case.

f793320

duburcqa dismissed stale reviews from Trinkle23897 and youkaichao via f793320 July 21, 2020 07:34

youkaichao previously approved these changes Jul 21, 2020

View reviewed changes

Fix PEP8.

3f89363

duburcqa dismissed youkaichao’s stale review via 3f89363 July 21, 2020 07:37

Allow Batch empty key.

99c7f95

Fix multi-dim array datatype check.

a7e7779

Trinkle23897 approved these changes Jul 21, 2020

View reviewed changes

youkaichao approved these changes Jul 21, 2020

View reviewed changes

Trinkle23897 merged commit 865ef6c into thu-ml:dev Jul 21, 2020

duburcqa deleted the to_torch_numpy branch July 21, 2020 08:58

Trinkle23897 mentioned this pull request Jul 31, 2020

Action spaces where actions are tuples #172

Closed

Trinkle23897 mentioned this pull request Oct 14, 2020

action-observation pairing in RNN-style training #241

Closed

Trinkle23897 mentioned this pull request Apr 24, 2021

Support different state size and fix exception in venv.__del__ #352

Merged

Trinkle23897 mentioned this pull request May 6, 2022

fixed problem with tuple obs being confused with return tuple #633

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve to_torch/to_numpy converters #147

Improve to_torch/to_numpy converters #147

duburcqa commented Jul 18, 2020 •

edited

Loading

duburcqa commented Jul 18, 2020 •

edited

Loading

youkaichao commented Jul 18, 2020

Trinkle23897 commented Jul 18, 2020 •

edited

Loading

youkaichao commented Jul 18, 2020

duburcqa commented Jul 18, 2020

Trinkle23897 commented Jul 20, 2020 •

edited by youkaichao

Loading

duburcqa commented Jul 20, 2020

Trinkle23897 commented Jul 20, 2020

duburcqa commented Jul 20, 2020 •

edited

Loading

duburcqa commented Jul 20, 2020

youkaichao commented Jul 20, 2020

duburcqa commented Jul 20, 2020 •

edited

Loading

youkaichao left a comment

Trinkle23897 commented Jul 21, 2020 •

edited

Loading

duburcqa commented Jul 21, 2020

duburcqa commented Jul 21, 2020

Trinkle23897 commented Jul 21, 2020

duburcqa commented Jul 21, 2020

Improve to_torch/to_numpy converters #147

Improve to_torch/to_numpy converters #147

Conversation

duburcqa commented Jul 18, 2020 • edited Loading

duburcqa commented Jul 18, 2020 • edited Loading

youkaichao commented Jul 18, 2020

Trinkle23897 commented Jul 18, 2020 • edited Loading

youkaichao commented Jul 18, 2020

duburcqa commented Jul 18, 2020

Trinkle23897 commented Jul 20, 2020 • edited by youkaichao Loading

duburcqa commented Jul 20, 2020

Trinkle23897 commented Jul 20, 2020

duburcqa commented Jul 20, 2020 • edited Loading

duburcqa commented Jul 20, 2020

youkaichao commented Jul 20, 2020

duburcqa commented Jul 20, 2020 • edited Loading

youkaichao left a comment

Choose a reason for hiding this comment

Trinkle23897 commented Jul 21, 2020 • edited Loading

duburcqa commented Jul 21, 2020

duburcqa commented Jul 21, 2020

Trinkle23897 commented Jul 21, 2020

duburcqa commented Jul 21, 2020

duburcqa commented Jul 18, 2020 •

edited

Loading

duburcqa commented Jul 18, 2020 •

edited

Loading

Trinkle23897 commented Jul 18, 2020 •

edited

Loading

Trinkle23897 commented Jul 20, 2020 •

edited by youkaichao

Loading

duburcqa commented Jul 20, 2020 •

edited

Loading

duburcqa commented Jul 20, 2020 •

edited

Loading

Trinkle23897 commented Jul 21, 2020 •

edited

Loading