You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The reason we would like to have this setup is that we have a model deployed with v1.0 which has a mixed rest/grpc graph. This model is an image processing based model and is data size and latency sensitive. In our testing we were not able to get the full end to end GRPC working within our existing infrastructure components, and full REST had too high of a latency. With GRPC we got the execution under 100ms which met our requirements. There was some latency difference between the mixed protocol graph and pure GRPC, but overall it was still in the acceptable range.
The pod itself with full GRPC transport worked correctly, however when integrating with Ambassador and other parts of our infrastructure (AWS ELB, TLS termination etc..) we were not able to make successful connections.
There was a configuration where I was able to use "grpcurl" to make the connection but it did not work with the python client. I think this was due to AWS ELB's not fully supporting GRPC, or TLS termination just not being compatible with GRPC and HTTP/2. Particularly the protocol negotiation/upgrade part (ALPN). 1.20.0.pre3 can't connect to server behind ELB (ALPN check) grpc/grpc#18710 . I understand this is not particularly a Seldon issue, but given that you guys support ambassador as a recommended API gateway, I think it is useful information for your users. I will probably make an issue in the Ambassador github relating to this as well if it doesn't already exist.
I'm open to any suggestions, it is entirely possible I am missing something. I am fairly new to using Seldon. In the meantime I think we'll be ok with 1.0.2, however I would like to make the switch to the new golang executor soon, provided we can successfully migrate our models.
The text was updated successfully, but these errors were encountered:
We allow both REST and gRPC now for all deployments. We might look to have a switch to allow users to externally use REST and internal gRPC bit mixed is not on roadmap
In the newly released version (1.1) one of the breaking changes is to disallow mixed rest/grpc graphs
The reason we would like to have this setup is that we have a model deployed with v1.0 which has a mixed rest/grpc graph. This model is an image processing based model and is data size and latency sensitive. In our testing we were not able to get the full end to end GRPC working within our existing infrastructure components, and full REST had too high of a latency. With GRPC we got the execution under 100ms which met our requirements. There was some latency difference between the mixed protocol graph and pure GRPC, but overall it was still in the acceptable range.
For reference our graph looks like this
I'm open to any suggestions, it is entirely possible I am missing something. I am fairly new to using Seldon. In the meantime I think we'll be ok with 1.0.2, however I would like to make the switch to the new golang executor soon, provided we can successfully migrate our models.
The text was updated successfully, but these errors were encountered: