Failover requests would not work for partially used streams #106

rahulreddy · 2017-01-19T18:50:16Z

Identified by @alexandre-merle, streams which are consumed partially and failed over to another node would not work as it would only be able to pipe partial data

ghost · 2017-01-19T19:06:28Z

This would explain scality/MetaData#940 then.

rahulreddy · 2017-01-19T19:13:39Z

I guess. I just can't think of why the source stream would end up with an error, I would have assumed that there were TCP errors connecting to any of the nodes from start.

ghost · 2017-01-19T19:23:08Z

On the MD issue, @msegura analyzed for some connection closing related to the way the keepalive is configured on our Sproxyd's Tengine. Does it look to you like this could be the explanation ?

rahulreddy · 2017-01-19T20:32:07Z

I think it fits the picture. The log line with PUT chunk to sproxyd

{"name":"SproxydClient","error":{"code":"ECONNRESET"},"time":1484643601662,"req_id":"a7ea4c3151407e76b10e:7a6691a0746cb6785797","level":"error","message":"PUT chunk to sproxyd","hostname":"asvppdxobjs301.gecis.io","pid":59}

could only happen if a connection was established to the remote host, socket was assigned and then later destroyed by the remote host for some reason. In that case, the stream may have been partially consumed.
socket object has properties bytesRead / bytesWritten which we can use to do a failover, only when no bytes have been consumed.

ThibaultRiviere · 2017-01-20T00:26:02Z

That can explain the retry error but not the first one right ?

rahulreddy · 2017-01-20T00:37:36Z

Yeah I can't explain the first one.

rachedbenmustapha · 2017-01-21T01:02:54Z

Does this really happen though, given that production deployments only have 1 sproxyd endpoint configured and thus can not fail over?

rahulreddy · 2017-01-21T02:09:26Z

From the situation today, I realized that we have only 1 sproxyd endpoint. My speculation goes out of the window. It's a potential problem if there are multiple nodes in the bootstrap list.

rahulreddy added the bug label Jan 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failover requests would not work for partially used streams #106

Failover requests would not work for partially used streams #106

rahulreddy commented Jan 19, 2017

ghost commented Jan 19, 2017

rahulreddy commented Jan 19, 2017

ghost commented Jan 19, 2017

rahulreddy commented Jan 19, 2017

ThibaultRiviere commented Jan 20, 2017

rahulreddy commented Jan 20, 2017

rachedbenmustapha commented Jan 21, 2017

rahulreddy commented Jan 21, 2017

Failover requests would not work for partially used streams #106

Failover requests would not work for partially used streams #106

Comments

rahulreddy commented Jan 19, 2017

ghost commented Jan 19, 2017

rahulreddy commented Jan 19, 2017

ghost commented Jan 19, 2017

rahulreddy commented Jan 19, 2017

ThibaultRiviere commented Jan 20, 2017

rahulreddy commented Jan 20, 2017

rachedbenmustapha commented Jan 21, 2017

rahulreddy commented Jan 21, 2017