Make Keras models Pickle-able #10483

farizrahman4u · 2018-06-20T06:52:50Z

maxpumperla · 2018-06-20T07:25:47Z

@farizrahman4u cool, I was going to do the same thing. any chance we can do this without introducing about 95% code duplication for get_json_type etc.?

farizrahman4u · 2018-06-20T12:15:36Z

@fchollet unrelated test seems to be failing?

Dref360 · 2018-06-20T15:54:15Z

The CNTK test is a fluke, but the pep8 one is real?

/home/travis/build/keras-team/keras/keras/engine/saving.py:80:13: E126 continuation line over-indented for hanging indent
            'class_name': model.__class__.__name__,
            ^
/home/travis/build/keras-team/keras/keras/engine/saving.py:82:9: E121 continuation line under-indented for hanging indent
        }, default=_get_json_type).encode('utf8')

fchollet · 2018-06-20T18:03:03Z

keras/engine/saving.py

+    raise TypeError('Not JSON Serializable:', obj)
+
+
+def get_model_state(model):


This function would exist exclusively for pickling, so I think it should have "pickle" or some such in the name (e.g. get_model_state_for_pickling

Additionally, the code seems highly redundant with save_model / load_model. How can we refactor to minimize code duplication?

@fchollet What about - we have a common get_model_state()/load_model_from_state() which will be called by pickle_model()/unpickle_model() and load_model()/save_model()

That sounds good!

After further thought, that would make h5py write sub-optimal. We would be creating a fully copy of the model weights in memory and writing it all at once to disk. So probably leave save_model as is?

@fchollet ping ping

csbrown · 2018-06-26T15:25:06Z

I think that it may be possible to use the hdf5 serialization and pickle at the same time. The trick is to hdf5 serialize the model in memory (which can be done, see h5py.File options driver and backing_store). Note that the save_model method can accept either a string filename or an h5py.File object. Having initialized the appropriate h5py.File object, pickle is able to serialize this file object. This can be combined with a __get_state__ and __set_state__ method on a base class to perform the appropriate conversions in preparation for pickling.

As a side-benefit of this process, there need not be another materially different serialization method.

Anybody see any pitfalls to this design?

fchollet · 2018-08-28T17:54:24Z

Hi, I think this is a useful feature and we should merge it. What's the status on this PR? Thanks.

farizrahman4u · 2018-08-28T18:54:42Z

What's the status on this PR?

It is functional.

You had asked me to refactor it such that we have a single serialization method which would be called by model.save() and model.__getstate__(). But the H5py file is written on the go, and having a common serialization method would make it sub optimal (first a copy of the model will be created in memory and then written to disk). So I have left it as it is.

fchollet · 2018-08-28T21:37:32Z

Seemingly some issues on CI: https://travis-ci.org/keras-team/keras/builds/421708916?utm_source=github_status&utm_medium=notification

fchollet · 2018-08-28T21:42:05Z

keras/engine/saving.py

+    # if obj is any numpy type
+    if type(obj).__module__ == np.__name__:
+        if isinstance(obj, np.ndarray):
+            return {'type': type(obj),


We've changed this behavior recently, this will need to be updated

fchollet · 2018-08-28T21:44:24Z

any chance we can do this without introducing about 95% code duplication for get_json_type etc.?

Is there any possibility that using a shared function with a callback structure would work? In one case the callback would write to a HDF5 file, in the other case it would just build a dict.

fchollet · 2018-08-28T21:45:22Z

Or we create a dict-like HDF5 file class.

farizrahman4u · 2018-08-30T07:40:35Z

@fchollet See #11030 with the dict-like HDF5 file class.

fchollet · 2018-08-30T18:19:57Z

Let's close this PR and move this to #11030.

namish800 · 2018-10-09T21:27:50Z

I am having the same problem when I try to run model.to_json() my model contains custom layers and I get this error

TypeError: can't pickle _thread.RLock objects

Is there any workaround this problem?

farizrahman4u added 3 commits June 20, 2018 12:20

states

c1b2a6d

pickle

b7763ff

tests

3afda31

farizrahman4u added 8 commits June 20, 2018 13:00

pep8

159d627

pep8

d531af1

make runnable

7ae2d9a

few fixes

49c45fe

make get_json_type global

53dbad1

tests fixes

f73ef85

pep8

10862ad

pep8

e68ecf3

farizrahman4u mentioned this pull request Jun 20, 2018

Please, make keras objects picklable #10475

Closed

typo

5e0466b

fchollet reviewed Jun 20, 2018

View reviewed changes

tRosenflanz mentioned this pull request Jun 26, 2018

TypeError: can't pickle _thread.RLock objects #10528

Closed

3 tasks

Dref360 mentioned this pull request Jul 3, 2018

compatibility with dask #7931

Closed

ren method names

886bc0d

lesteve mentioned this pull request Jul 6, 2018

Use case with deep-learning frameworks: prediction in parallel dask/dask-ml#281

Closed

farizrahman4u added 2 commits August 28, 2018 23:57

Merge branch 'master' into pickle

999a898

pep8

628ffe9

fchollet reviewed Aug 28, 2018

View reviewed changes

fchollet closed this Aug 30, 2018

antoinecarme mentioned this pull request Aug 31, 2018

Add support for Keras Models SQL generation mllite/sklearn2sql_heroku#3

Open

farizrahman4u deleted the pickle branch September 12, 2018 01:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make Keras models Pickle-able #10483

Make Keras models Pickle-able #10483

farizrahman4u commented Jun 20, 2018 •

edited

Loading

maxpumperla commented Jun 20, 2018

farizrahman4u commented Jun 20, 2018

Dref360 commented Jun 20, 2018

fchollet Jun 20, 2018

farizrahman4u Jun 20, 2018

fchollet Jun 22, 2018

farizrahman4u Jun 29, 2018

farizrahman4u Jul 5, 2018

csbrown commented Jun 26, 2018

fchollet commented Aug 28, 2018

farizrahman4u commented Aug 28, 2018 •

edited

Loading

fchollet commented Aug 28, 2018

fchollet Aug 28, 2018

fchollet commented Aug 28, 2018

fchollet commented Aug 28, 2018

farizrahman4u commented Aug 30, 2018

fchollet commented Aug 30, 2018

namish800 commented Oct 9, 2018 •

edited

Loading

		raise TypeError('Not JSON Serializable:', obj)


		def get_model_state(model):

Make Keras models Pickle-able #10483

Make Keras models Pickle-able #10483

Conversation

farizrahman4u commented Jun 20, 2018 • edited Loading

maxpumperla commented Jun 20, 2018

farizrahman4u commented Jun 20, 2018

Dref360 commented Jun 20, 2018

fchollet Jun 20, 2018

Choose a reason for hiding this comment

farizrahman4u Jun 20, 2018

Choose a reason for hiding this comment

fchollet Jun 22, 2018

Choose a reason for hiding this comment

farizrahman4u Jun 29, 2018

Choose a reason for hiding this comment

farizrahman4u Jul 5, 2018

Choose a reason for hiding this comment

csbrown commented Jun 26, 2018

fchollet commented Aug 28, 2018

farizrahman4u commented Aug 28, 2018 • edited Loading

fchollet commented Aug 28, 2018

fchollet Aug 28, 2018

Choose a reason for hiding this comment

fchollet commented Aug 28, 2018

fchollet commented Aug 28, 2018

farizrahman4u commented Aug 30, 2018

fchollet commented Aug 30, 2018

namish800 commented Oct 9, 2018 • edited Loading

farizrahman4u commented Jun 20, 2018 •

edited

Loading

farizrahman4u commented Aug 28, 2018 •

edited

Loading

namish800 commented Oct 9, 2018 •

edited

Loading