Bug fix: Saving problem with TF2 SavedModel fmt in TensorflowTransform class. #5797

darth-vader-lg · 2021-05-17T11:52:52Z

The SaveModel function of the TensorflowTransform class didn't save the TensorFlow saved_model directory in the zip repo. It was just done for frozen graphs but missing for the SavedModel format.

I followed the schema that you used for the DnnRetrainTransform class to fix it:

machinelearning/src/Microsoft.ML.Vision/DnnRetrainTransform.cs

Lines 68 to 75 in 43c49f6

    
           internal static class DefaultModelFileNames 
        
           { 
        
               public const string VariablesFolder = "variables"; 
        
               public const string Index = "variables.index"; 
        
               public const string Data = "variables.data-00000-of-00001"; 
        
               public const string Graph = "saved_model.pb"; 
        
               public const string TmpMlnetModel = "mlnet_model"; 
        
           }

and

machinelearning/src/Microsoft.ML.Vision/DnnRetrainTransform.cs

Lines 701 to 726 in 43c49f6

    
           ctx.SaveBinaryStream("TFSavedModel", w => 
        
           { 
        
               // only these files need to be saved. 
        
               string[] modelFilePaths = 
        
               { 
        
                   Path.Combine(_modelLocation, DefaultModelFileNames.Graph), 
        
                   Path.Combine(_modelLocation, DefaultModelFileNames.VariablesFolder, DefaultModelFileNames.Data), 
        
                   Path.Combine(_modelLocation, DefaultModelFileNames.VariablesFolder, DefaultModelFileNames.Index), 
        
               }; 
        
               w.Write(modelFilePaths.Length); 
        
               foreach (var fullPath in modelFilePaths) 
        
               { 
        
                   var relativePath = fullPath.Substring(_modelLocation.Length + 1); 
        
                   w.Write(relativePath); 
        
                   using (var fs = new FileStream(fullPath, FileMode.Open)) 
        
                   { 
        
                       long fileLength = fs.Length; 
        
                       w.Write(fileLength); 
        
                       long actualWritten = fs.CopyRange(w.BaseStream, fileLength); 
        
                       Host.Assert(actualWritten == fileLength); 
        
                   } 
        
               } 
        
           });

.

The same part was not present in the TensorflowTransform class. Just the frozen graph saving:

machinelearning/src/Microsoft.ML.TensorFlow/TensorflowTransform.cs

Lines 425 to 467 in 43c49f6

    
           private protected override void SaveModel(ModelSaveContext ctx) 
        
           { 
        
               Host.AssertValue(ctx); 
        
               ctx.CheckAtModel(); 
        
               ctx.SetVersionInfo(GetVersionInfo()); 
        
               // *** Binary format *** 
        
               // byte: indicator for frozen models 
        
               // byte: indicator for adding batch dimension in input 
        
               // byte: indicator for treating output as batched 
        
               // stream: tensorFlow model. 
        
               // int: number of input columns 
        
               // for each input column 
        
               //   int: id of int column name 
        
               // int: number of output columns 
        
               // for each output column 
        
               //   int: id of output column name 
        
               var isFrozen = string.IsNullOrEmpty(_savedModelPath); 
        
               ctx.Writer.WriteBoolByte(isFrozen); 
        
               ctx.Writer.WriteBoolByte(_addBatchDimensionInput); 
        
               ctx.Writer.WriteBoolByte(_treatOutputAsBatched); 
        
               if (isFrozen) 
        
               { 
        
                   using (var status = new Status()) 
        
                   using (var buffer = Session.graph.ToGraphDef(status)) 
        
                   { 
        
                       ctx.SaveBinaryStream("TFModel", w => 
        
                       { 
        
                           w.WriteByteArray(buffer.DangerousMemoryBlock.ToArray()); 
        
                       }); 
        
                   } 
        
               } 
        
               Host.AssertNonEmpty(Inputs); 
        
               ctx.Writer.Write(Inputs.Length); 
        
               foreach (var colName in Inputs) 
        
                   ctx.SaveNonEmptyString(colName); 
        
               Host.AssertNonEmpty(Outputs); 
        
               ctx.Writer.Write(Outputs.Length); 
        
               foreach (var colName in Outputs) 
        
                   ctx.SaveNonEmptyString(colName); 
        
           }

.

It leads to an incomplete zip repo that cannot be reloaded after.

After this fix the zip repo can be saved and loaded for inference.

saved_model.pb.zip

…'t save the TensorFlow SavedModel directory in the repo. It was just done for frozen graphs. It was missing for the SavedModel format.

dnfadmin · 2021-05-17T11:53:06Z

All CLA requirements met.

…avedModelPath) while saving frozen graphs.

codecov · 2021-05-17T15:25:48Z

Codecov Report

Merging #5797 (ce683fc) into main (43c49f6) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff            @@
##             main    #5797    +/-   ##
========================================
  Coverage   68.35%   68.36%            
========================================
  Files        1131     1131            
  Lines      241210   241372   +162     
  Branches    25039    25055    +16     
========================================
+ Hits       164887   165011   +124     
- Misses      69819    69857    +38     
  Partials     6504     6504

Flag	Coverage Δ
Debug	`68.36% <100.00%> (+<0.01%)`	⬆️
production	`62.97% <100.00%> (+<0.01%)`	⬆️
test	`89.25% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/Microsoft.ML.TensorFlow/TensorflowTransform.cs	`84.70% <100.00%> (+5.22%)`	⬆️
...cenariosWithDirectInstantiation/TensorflowTests.cs	`92.35% <100.00%> (+0.29%)`	⬆️
...c/Microsoft.ML.FastTree/Utils/ThreadTaskManager.cs	`79.48% <0.00%> (-20.52%)`	⬇️
src/Microsoft.ML.Core/Data/ProgressReporter.cs	`70.95% <0.00%> (-6.99%)`	⬇️
src/Microsoft.ML.FastTree/FastTreeRanking.cs	`50.79% <0.00%> (-4.28%)`	⬇️
src/Microsoft.ML.Data/MLContext.cs	`90.47% <0.00%> (-2.03%)`	⬇️
src/Microsoft.ML.FastTree/Dataset/IntArray.cs	`12.10% <0.00%> (-0.11%)`	⬇️
...oft.ML.Tests/OnnxSequenceTypeWithAttributesTest.cs	`94.33% <0.00%> (-0.11%)`	⬇️
src/Microsoft.ML.OnnxTransformer/OnnxUtils.cs	`86.63% <0.00%> (-0.06%)`	⬇️
src/Microsoft.ML.FastTree/FastTree.cs	`80.16% <0.00%> (-0.06%)`	⬇️
... and 17 more

michaelgsharp · 2021-05-27T16:45:55Z

@darth-vader-lg this looks great! Thanks for taking the time to submit this.

Can you add a unit test around this new saving/loading?

darth-vader-lg · 2021-05-27T20:19:09Z

Hello @michaelgsharp,
I will add it ASAP.
Sorry if I didn't do it before. I tried but I had some troubles running the unit test system on my pc.
I had, as usual, to quick deliver to my customer, that was pushing me, the robotic vision application where I'm working on and... bye bye 🙆‍♂️.
But in any case it luckily works on the field in battle. 😉
I will prepare for You while working on integration of TensorFlow2.5.0 with ML.NET.

…). Signed-off-by: darth-vader-lg <luigi.generale@gmail.com>

darth-vader-lg · 2021-05-29T13:44:16Z

Hello @michaelgsharp,

The unit test is added: TensorFlowSaveAndLoadSavedModel.

It does the following steps:

Load your standard test model cifar_saved_model and creates the transformer.
Do some predictions.
Save the transformer as ML.NET zip repo.
Reload the transformer from the saved zip repo.
Do the same above list of predictions.
Compare the results for equality.

Check if it's all ok for You and have a good merge.

michaelgsharp

LGTM. Thanks!

Bug fix: the SaveModel function of the TensorflowTransform class didn…

85ff92f

…'t save the TensorFlow SavedModel directory in the repo. It was just done for frozen graphs. It was missing for the SavedModel format.

Fixed a bug in the TensorflowTransform: reference to a null object (s…

2ee9aa4

…avedModelPath) while saving frozen graphs.

Added unit test for save/load TensorFlow SavedModel; issue (dotnet#5797…

ce683fc

…). Signed-off-by: darth-vader-lg <luigi.generale@gmail.com>

michaelgsharp approved these changes Jun 4, 2021

View reviewed changes

michaelgsharp merged commit 992d989 into dotnet:main Jun 4, 2021

darth-vader-lg deleted the bugfix-tf-saved-model branch June 26, 2021 08:28

ghost locked as resolved and limited conversation to collaborators Mar 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix: Saving problem with TF2 SavedModel fmt in TensorflowTransform class. #5797

Bug fix: Saving problem with TF2 SavedModel fmt in TensorflowTransform class. #5797

darth-vader-lg commented May 17, 2021

dnfadmin commented May 17, 2021 •

edited

Loading

codecov bot commented May 17, 2021 •

edited

Loading

michaelgsharp commented May 27, 2021

darth-vader-lg commented May 27, 2021

darth-vader-lg commented May 29, 2021

michaelgsharp left a comment

	internal static class DefaultModelFileNames
	{
	public const string VariablesFolder = "variables";
	public const string Index = "variables.index";
	public const string Data = "variables.data-00000-of-00001";
	public const string Graph = "saved_model.pb";
	public const string TmpMlnetModel = "mlnet_model";
	}

	ctx.SaveBinaryStream("TFSavedModel", w =>
	{
	// only these files need to be saved.
	string[] modelFilePaths =
	{
	Path.Combine(_modelLocation, DefaultModelFileNames.Graph),
	Path.Combine(_modelLocation, DefaultModelFileNames.VariablesFolder, DefaultModelFileNames.Data),
	Path.Combine(_modelLocation, DefaultModelFileNames.VariablesFolder, DefaultModelFileNames.Index),
	};

	w.Write(modelFilePaths.Length);

	foreach (var fullPath in modelFilePaths)
	{
	var relativePath = fullPath.Substring(_modelLocation.Length + 1);
	w.Write(relativePath);

	using (var fs = new FileStream(fullPath, FileMode.Open))
	{
	long fileLength = fs.Length;
	w.Write(fileLength);
	long actualWritten = fs.CopyRange(w.BaseStream, fileLength);
	Host.Assert(actualWritten == fileLength);
	}
	}
	});

	private protected override void SaveModel(ModelSaveContext ctx)
	{
	Host.AssertValue(ctx);
	ctx.CheckAtModel();
	ctx.SetVersionInfo(GetVersionInfo());

	// * Binary format *
	// byte: indicator for frozen models
	// byte: indicator for adding batch dimension in input
	// byte: indicator for treating output as batched
	// stream: tensorFlow model.
	// int: number of input columns
	// for each input column
	// int: id of int column name
	// int: number of output columns
	// for each output column
	// int: id of output column name
	var isFrozen = string.IsNullOrEmpty(_savedModelPath);
	ctx.Writer.WriteBoolByte(isFrozen);
	ctx.Writer.WriteBoolByte(_addBatchDimensionInput);
	ctx.Writer.WriteBoolByte(_treatOutputAsBatched);
	if (isFrozen)
	{
	using (var status = new Status())
	using (var buffer = Session.graph.ToGraphDef(status))
	{
	ctx.SaveBinaryStream("TFModel", w =>
	{
	w.WriteByteArray(buffer.DangerousMemoryBlock.ToArray());
	});
	}
	}

	Host.AssertNonEmpty(Inputs);
	ctx.Writer.Write(Inputs.Length);
	foreach (var colName in Inputs)
	ctx.SaveNonEmptyString(colName);

	Host.AssertNonEmpty(Outputs);
	ctx.Writer.Write(Outputs.Length);
	foreach (var colName in Outputs)
	ctx.SaveNonEmptyString(colName);
	}

Bug fix: Saving problem with TF2 SavedModel fmt in TensorflowTransform class. #5797

Bug fix: Saving problem with TF2 SavedModel fmt in TensorflowTransform class. #5797

Conversation

darth-vader-lg commented May 17, 2021

dnfadmin commented May 17, 2021 • edited Loading

codecov bot commented May 17, 2021 • edited Loading

Codecov Report

michaelgsharp commented May 27, 2021

darth-vader-lg commented May 27, 2021

darth-vader-lg commented May 29, 2021

michaelgsharp left a comment

Choose a reason for hiding this comment

dnfadmin commented May 17, 2021 •

edited

Loading

codecov bot commented May 17, 2021 •

edited

Loading