You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Triton's documentation now contains a description of a state management feature which should help to improve the efficiency of the stateful models in hermes.quiver by removing the need for updating the snapshot weight in the model itself. This in turn makes it unnecessary to implement the stateful models in TensorFlow. A simple implementation of this for the snapshotter might look like:
Triton's documentation now contains a description of a state management feature which should help to improve the efficiency of the stateful models in
hermes.quiver
by removing the need for updating the snapshot weight in the model itself. This in turn makes it unnecessary to implement the stateful models in TensorFlow. A simple implementation of this for the snapshotter might look like:And the model config would include a section that looks something like
With the actual state naming mechanism needing to be worked out.
This will require adding functionality to the
Exporter
classes to be able to mark certain inputs as states.The text was updated successfully, but these errors were encountered: