Refactor `keras.dtype_policies` #19711

james77777778 · 2024-05-13T02:46:30Z

EDITED:
Please refer to #19711 (comment) for the new updates.

I think it would be beneficial to provide some flexibility to QuantizedDTypePolicy regarding the global dtype policy keras.config.dtype_policy()

Additionally, there is a new property in DTypePolicy: is_quantized that should be useful for these quantization-related methods.

With this PR, we can do the following:

import keras
from keras import dtype_policies
from keras import layers
from keras import models


@keras.saving.register_keras_serializable("MyPackage")
class MySubclass(layers.Layer):
    def __init__(self, **kwargs):
        dtypes = kwargs.pop("dtypes", {})
        super().__init__(**kwargs)
        self.layer = layers.Dense(8, dtype=dtypes.pop("layer", None))

    def call(self, inputs, training=None):
        return self.layer(inputs)

    def get_config(self):
        config = super().get_config()
        config.pop("dtype")
        if self.layer.dtype_policy.is_quantized:
            _config = dtype_policies.serialize(self.layer.dtype_policy)
            _config["config"]["source_name"] = None
            config.update({"dtypes": {"layer": _config}})
        return config


inputs = layers.Input(shape=[None, 4])
outputs = MySubclass()(inputs)
model = models.Model(inputs, outputs)

"""global dtype policy (float32)"""

model.quantize("int8")
for layer in model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)
model.save("model.keras")

"""global dtype policy (bfloat16)"""

keras.config.set_dtype_policy("bfloat16")
new_model = models.load_model("model.keras")
for layer in new_model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)

Outputs:

# During saving (global dtype policy: float32)
input_layer <FloatDTypePolicy "float32">
my_subclass <FloatDTypePolicy "float32">
dense <QuantizedDTypePolicy "int8_from_float32">

# During loading (global dtype policy: bfloat16)
input_layer <FloatDTypePolicy "bfloat16">
my_subclass <FloatDTypePolicy "bfloat16">
dense_1 <QuantizedDTypePolicy "int8_from_bfloat16">

@mattdangerw has pointed out that currently the dtype policies of the quantized saves are immutable regarding the global dtype policy. keras-team/keras-hub#1612 (comment)
With this PR, we can make a slight modification in get_config to support that feature.

codecov-commenter · 2024-05-13T02:51:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 78.53%. Comparing base (310c275) to head (ecf2523).
Report is 1 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master   #19711   +/-   ##
=======================================
  Coverage   78.52%   78.53%           
=======================================
  Files         498      498           
  Lines       45769    45756   -13     
  Branches     8456     8454    -2     
=======================================
- Hits        35942    35936    -6     
+ Misses       8091     8087    -4     
+ Partials     1736     1733    -3

Flag	Coverage Δ
keras	`78.38% <100.00%> (+<0.01%)`	⬆️
keras-jax	`61.95% <100.00%> (+<0.01%)`	⬆️
keras-numpy	`56.29% <87.93%> (-0.01%)`	⬇️
keras-tensorflow	`63.41% <100.00%> (-0.01%)`	⬇️
keras-torch	`61.99% <100.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

keras/src/dtype_policies/dtype_policy.py

fchollet · 2024-05-13T18:19:46Z

keras/src/dtype_policies/dtype_policy.py

        return f'<FloatDTypePolicy "{self._name}">'


+GLOBAL_DEFAULT_PLACEHOLDER = "global_default"


Please use a more explicit name, e.g. "DEFAULT_DTYPE_POLICY". Why use this string as the initial value, instead of e.g. None?

Why use this string as the initial value, instead of e.g. None?

Currently, DTypePolicy and its subclasses rely on string value for parsing.
It is not clear for me how we can pass None in combination with the quantization mode.

Should we refactor QuantizedDTypePolicy to support a signature for both the quantization mode and the source dtype policy?

Ex:

policy = QuantizedDTypePolicy(mode="int8", source_dtype_policy="mixed_bfloat16")

Currently, DTypePolicy and its subclasses rely on string value for parsing.
It is not clear for me how we can pass None in combination with the quantization mode.

We could just modify DTypePolicy to support None, meaning "default".

Should we refactor QuantizedDTypePolicy to support a signature for both the quantization mode and the source dtype policy?

Yes, that's a great idea!

james77777778 · 2024-05-15T03:24:22Z

I've significantly refactored the keras.dtype_policies.

Some notes:

Replicate all methods from FloatDTypePolicy to DTypePolicy so that FloatDTypePolicy becomes an alias for DTypePolicy. The reason is that the overriden __new__ in DTypePolicy caused numerous issues and addressing them would introduce unnecessary complexity.
Introduce a new signature for QuantizedDTypePolicy and QuantizedFloat8DTypePolicy.
Utilize dtype_policies.serialize in get_config of keras.layers.Layer. This is required because we now use different signatures for different dtype policies.
Update the tests.

Imcompatible warning:

We can still use something like "int8_from_float32" in keras.dtype_polices.get but it is now impossible to be passed to QuantizedDTypePolicy and QuantizedFloat8DTypePolicy.

To add flexibility to quantized dtype policy:

Details

import keras
from keras import dtype_policies
from keras import layers
from keras import models


@keras.saving.register_keras_serializable("MyPackage")
class MySubclass(layers.Layer):
    def __init__(self, **kwargs):
        dtypes = kwargs.pop("dtypes", {})
        super().__init__(**kwargs)
        self.layer = layers.Dense(8, dtype=dtypes.pop("layer", None))

    def call(self, inputs, training=None):
        return self.layer(inputs)

    def get_config(self):
        config = super().get_config()
        config.pop("dtype")
        if self.layer.dtype_policy.is_quantized:
            _config = dtype_policies.serialize(self.layer.dtype_policy)
            _config["config"]["source_name"] = None
            config.update({"dtypes": {"layer": _config}})
        return config


inputs = layers.Input(shape=[None, 4])
outputs = MySubclass()(inputs)
model = models.Model(inputs, outputs)

"""global dtype policy (float32)"""

model.quantize("int8")
for layer in model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)
model.save("model.keras")

"""global dtype policy (bfloat16)"""

keras.config.set_dtype_policy("bfloat16")
new_model = models.load_model("model.keras")
for layer in new_model._flatten_layers(include_self=False, recursive=True):
    print(layer.name, layer.dtype_policy)

The outputs:

# global dtype policy: float32
input_layer <FloatDTypePolicy "float32">
my_subclass <FloatDTypePolicy "float32">
dense <QuantizedDTypePolicy "int8_from_float32">

# global dtype policy: bfloat16
input_layer <FloatDTypePolicy "bfloat16">
my_subclass <FloatDTypePolicy "bfloat16">
dense_1 <QuantizedDTypePolicy "int8_from_bfloat16">

fchollet

Nice work -- it's definitely cleaner this way! LGTM

Keras' output format was slightly changed in keras-team/keras#19711; in some cases dtypes will now be exported as a config map instead of just a string. This fixes test breakages when using ToT keras.

Keras' output format was slightly changed in keras-team/keras#19711; for non-input layers dtypes will now be exported as a config map instead of just a string. This fixes test breakages when using ToT keras.

Keras' output format was slightly changed in keras-team/keras#19711; for non-input layers dtypes will now be exported as a config map instead of just a string. This fixes test breakages when using ToT keras. Alternative to #6855

Original PR #19711 by james77777778 Original: keras-team/keras#19711

Merged from original PR #19711 Original: keras-team/keras#19711

Original PR #19711 by james77777778 Original: keras-team/keras#19711

Merged from original PR #19711 Original: keras-team/keras#19711

james77777778 added 3 commits May 13, 2024 09:40

Add flexibility to QuantizedDTypePolicy

8dd90ff

Add is_quantized_dtype_policy

9208b57

Update layers

fd3a75b

google-ml-butler bot added the size:M label May 13, 2024

google-ml-butler bot assigned gbaned May 13, 2024

fchollet reviewed May 13, 2024

View reviewed changes

james77777778 added 2 commits May 14, 2024 10:20

Address comments

59300ed

Refactor keras.dtype_policies

e5d1320

james77777778 changed the title ~~Add flexibility to QuantizedDTypePolicy~~ Refactor keras.dtype_policies May 15, 2024

Update unit tests

ae08b69

james77777778 added 2 commits May 15, 2024 11:25

Update comments

5e5bdad

Update tests

d73c54d

james77777778 requested a review from fchollet May 15, 2024 04:44

google-ml-butler bot added the awaiting review label May 15, 2024

Update tests

ecf2523

fchollet approved these changes May 15, 2024

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels May 15, 2024

fchollet merged commit 3105247 into keras-team:master May 15, 2024

kokoro-team removed the kokoro:force-run label May 15, 2024

google-ml-butler bot removed awaiting review ready to pull Ready to be merged into the codebase labels May 15, 2024

james77777778 deleted the flexible-quantized-dtype branch May 16, 2024 00:33

mloc mentioned this pull request May 21, 2024

Fix keras dtype importing and unpin for CI tensorflow/tensorboard#6857

Merged

ryantqiu pushed a commit to snorkel-marlin-repos/keras-team_keras_pr_19711_866835b0-27a7-4ce2-beb3-0c6277437083 that referenced this pull request Oct 1, 2025

Refactor keras.dtype_policies

2d343ce

Original PR #19711 by james77777778 Original: keras-team/keras#19711

ryantqiu mentioned this pull request Oct 1, 2025

Refactor keras.dtype_policies snorkel-marlin-repos/keras-team_keras_pr_19711_866835b0-27a7-4ce2-beb3-0c6277437083#1

Merged

ryantqiu added a commit to snorkel-marlin-repos/keras-team_keras_pr_19711_866835b0-27a7-4ce2-beb3-0c6277437083 that referenced this pull request Oct 1, 2025

Merge pull request #1: Refactor keras.dtype_policies

841a298

Merged from original PR #19711 Original: keras-team/keras#19711

ryantqiu pushed a commit to snorkel-marlin-repos/keras-team_keras_pr_19711_971b5588-a6fe-44a6-a122-91c039644f00 that referenced this pull request Oct 2, 2025

Refactor keras.dtype_policies

0a07b58

Original PR #19711 by james77777778 Original: keras-team/keras#19711

ryantqiu mentioned this pull request Oct 2, 2025

Refactor keras.dtype_policies snorkel-marlin-repos/keras-team_keras_pr_19711_971b5588-a6fe-44a6-a122-91c039644f00#1

Merged

ryantqiu added a commit to snorkel-marlin-repos/keras-team_keras_pr_19711_971b5588-a6fe-44a6-a122-91c039644f00 that referenced this pull request Oct 2, 2025

Merge pull request #1: Refactor keras.dtype_policies

7dc9217

Merged from original PR #19711 Original: keras-team/keras#19711

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor `keras.dtype_policies` #19711

Refactor `keras.dtype_policies` #19711

james77777778 commented May 13, 2024 •

edited

Loading

Uh oh!

codecov-commenter commented May 13, 2024 •

edited

Loading

Uh oh!

fchollet left a comment

Uh oh!

Uh oh!

fchollet May 13, 2024

Uh oh!

james77777778 May 14, 2024

Uh oh!

fchollet May 14, 2024

Uh oh!

james77777778 commented May 15, 2024 •

edited

Loading

Uh oh!

fchollet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		return f'<FloatDTypePolicy "{self._name}">'


		GLOBAL_DEFAULT_PLACEHOLDER = "global_default"

Refactor keras.dtype_policies #19711

Refactor keras.dtype_policies #19711

Conversation

james77777778 commented May 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented May 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fchollet May 13, 2024

Choose a reason for hiding this comment

Uh oh!

james77777778 May 14, 2024

Choose a reason for hiding this comment

Uh oh!

fchollet May 14, 2024

Choose a reason for hiding this comment

Uh oh!

james77777778 commented May 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fchollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Refactor `keras.dtype_policies` #19711

Refactor `keras.dtype_policies` #19711

james77777778 commented May 13, 2024 •

edited

Loading

codecov-commenter commented May 13, 2024 •

edited

Loading

james77777778 commented May 15, 2024 •

edited

Loading