You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One thing we do know for certain is that that onnxruntime now have BERT support via their transformer optimizer, it;d be great to pull this out and treat it for either mixed precision or quant.
The rest of the network can then become a self contained optimisation problem or remain as a torch eager execution with an AMP context
The text was updated successfully, but these errors were encountered:
The text was updated successfully, but these errors were encountered: