[TFLite] TFLite FP16 Post Quantization Support #5823

FrozenGene · 2020-06-16T11:06:42Z

TensorFlow Lite now supports converting weights to 16-bit floating point values during model conversion from TensorFlow to TensorFlow Lite's flat buffer format. This results in a 2x reduction in model size.

However, this will insert new dequantize for ops (like conv2d) used for dequantize fp16 weight to fp32. Like this:

TVM doesn't support this behavior. List the things we mainly should to do:

Support float16 type inside tflite parser
Extend dequantize to support fp16 to fp32

Related issue:#5774

The text was updated successfully, but these errors were encountered:

onkar-sima-ai · 2021-11-18T21:43:09Z

@FrozenGene Is this issue still open?

FrozenGene · 2021-11-19T03:50:45Z

@FrozenGene Is this issue still open?

I think #7093 is enough and could close it now.

FrozenGene added the status: help wanted label Jun 16, 2020

FrozenGene mentioned this issue Jun 16, 2020

[TFLite] KeyError: 'conv2d/Kernel' on test_forward_mediapipe_hand_landmark() #5774

Closed

FrozenGene mentioned this issue Dec 17, 2020

[TFLite] add support for float16 #7093

Merged

FrozenGene closed this as completed Nov 19, 2021

FrozenGene removed the status: help wanted label Nov 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TFLite] TFLite FP16 Post Quantization Support #5823

[TFLite] TFLite FP16 Post Quantization Support #5823

FrozenGene commented Jun 16, 2020 •

edited

Loading

onkar-sima-ai commented Nov 18, 2021 •

edited

Loading

FrozenGene commented Nov 19, 2021

[TFLite] TFLite FP16 Post Quantization Support #5823

[TFLite] TFLite FP16 Post Quantization Support #5823

Comments

FrozenGene commented Jun 16, 2020 • edited Loading

onkar-sima-ai commented Nov 18, 2021 • edited Loading

FrozenGene commented Nov 19, 2021

FrozenGene commented Jun 16, 2020 •

edited

Loading

onkar-sima-ai commented Nov 18, 2021 •

edited

Loading