Allow mixed rest/grpc graphs in new golang based executor #1820

thjwhite · 2020-05-11T19:07:34Z

In the newly released version (1.1) one of the breaking changes is to disallow mixed rest/grpc graphs

The reason we would like to have this setup is that we have a model deployed with v1.0 which has a mixed rest/grpc graph. This model is an image processing based model and is data size and latency sensitive. In our testing we were not able to get the full end to end GRPC working within our existing infrastructure components, and full REST had too high of a latency. With GRPC we got the execution under 100ms which met our requirements. There was some latency difference between the mixed protocol graph and pure GRPC, but overall it was still in the acceptable range.

The pod itself with full GRPC transport worked correctly, however when integrating with Ambassador and other parts of our infrastructure (AWS ELB, TLS termination etc..) we were not able to make successful connections.
There was a configuration where I was able to use "grpcurl" to make the connection but it did not work with the python client. I think this was due to AWS ELB's not fully supporting GRPC, or TLS termination just not being compatible with GRPC and HTTP/2. Particularly the protocol negotiation/upgrade part (ALPN). 1.20.0.pre3 can't connect to server behind ELB (ALPN check) grpc/grpc#18710 . I understand this is not particularly a Seldon issue, but given that you guys support ambassador as a recommended API gateway, I think it is useful information for your users. I will probably make an issue in the Ambassador github relating to this as well if it doesn't already exist.
Converting the graph to REST was fairly straight forward but we encountered latency of up to 1 second. I think this is probably due to the conversion between TFTensor/ndarray/json when it get's sent to TFServing. https://github.com/SeldonIO/seldon-core/blob/v1.0.2/integrations/tfserving/TfServingProxy.py#L102

For reference our graph looks like this

      graph:
        children:
        - children:
          - endpoint:
              service_host: localhost
              service_port: 9003
              type: REST
            implementation: TENSORFLOW_SERVER
            modelUri: s3://redacted
            name: redacted
            parameters:
            - name: signature_name
              type: STRING
              value: serving_default
            - name: model_name
              type: STRING
              value: redacted
            - name: model_input
              type: STRING
              value: in
            - name: model_output
              type: STRING
              value: out
            type: MODEL
          endpoint:
            service_host: localhost
            service_port: 9002
            type: REST
          implementation: UNKNOWN_IMPLEMENTATION
          name: output-transformer
          type: OUTPUT_TRANSFORMER
        endpoint:
          service_host: localhost
          service_port: 9001
          type: REST
        implementation: UNKNOWN_IMPLEMENTATION
        name: input-transformer
        type: TRANSFORMER

I'm open to any suggestions, it is entirely possible I am missing something. I am fairly new to using Seldon. In the meantime I think we'll be ok with 1.0.2, however I would like to make the switch to the new golang executor soon, provided we can successfully migrate our models.

thjwhite · 2020-05-11T19:14:50Z

To clarify, the topology of the infrastructure components looks like this:
(AWS ELB) -> (Ambassador) -> (linkerd service mesh)-> (Seldon Model)

The ELB, Ambassador, and linkerd are configured for TLS transport. The DNS entry given to my client application points to the ELB.

ukclivecox · 2020-05-11T19:39:51Z

The solution we are trying to move towards is to multiplex REST and grpc. The initial work can be tracked in #1762

thjwhite · 2020-05-11T20:18:43Z

Thanks, that's great to hear, let me know if there is anything you need to know with respect to our use case.

ukclivecox · 2021-01-07T11:28:13Z

We allow both REST and gRPC now for all deployments. We might look to have a switch to allow users to externally use REST and internal gRPC bit mixed is not on roadmap

thjwhite added the triage Needs to be triaged and prioritised accordingly label May 11, 2020

ukclivecox assigned glindsell May 14, 2020

ukclivecox removed the triage Needs to be triaged and prioritised accordingly label May 14, 2020

ukclivecox closed this as completed Jan 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow mixed rest/grpc graphs in new golang based executor #1820

Allow mixed rest/grpc graphs in new golang based executor #1820

thjwhite commented May 11, 2020

thjwhite commented May 11, 2020

ukclivecox commented May 11, 2020

thjwhite commented May 11, 2020

ukclivecox commented Jan 7, 2021

Allow mixed rest/grpc graphs in new golang based executor #1820

Allow mixed rest/grpc graphs in new golang based executor #1820

Comments

thjwhite commented May 11, 2020

thjwhite commented May 11, 2020

ukclivecox commented May 11, 2020

thjwhite commented May 11, 2020

ukclivecox commented Jan 7, 2021