-
-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sink the stream, this avoids highWaterMark being hit - add test #126
Conversation
|
||
function complete(err, file) { | ||
cb(err, file); | ||
if (!err) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
seems very similar to my proposition in #120 but can you shed some light for me how the error propagates or not in this situation?
is dealing with the stream being done
or rather flushed as I proposed not ideal ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the return value from push isn't a done flag (or that's what I think I found in all my digging). I tried implementing your suggestion in #120 but it didn't seem to work. The cb
is being called on the line above with the err
and file
objects. If an error occurs and is passed to that, the error
event should be emitted on the stream and the pipeline torn down. I check to see if there wasn't an error before reading because I don't know if reading from a torn down stream blows up or not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmm based on my understanding, on http://bit.ly/1kyt00f it says that readable will return a boolean which will say if more data should be pushed into the buffer which is controlled by the highWaterMark http://bit.ly/1kytDGX then as it hits false in the case of theres no piping happening we can then stream.read
to pull chunks again from the buffer http://bit.ly/1kytTpp. When piping
from a destination stream.read
never gets called because the .pipe
handles the draining of the readable part http://bit.ly/1kyvnjr
I've tried my way with a pipe from the dest
and not piping essentially finish
ing the write part and it works..
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@davidchase you are correct that the flag means more data can be pushed; however, that has no bearing on if we can read more. As soon as we are process the file object and pass it on (calling the callback with err & file), we are able to read another and process that. As far as I am aware, backpressure is handled by the reading stream and because we weren't reading or resuming, the beginning of our stream was clogging the pipeline. As you stated, the piping of dest to somewhere else will drain it, so the read will essentially nothing in that case (I think).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
; however, that has no bearing on if we can read more
Ahh i guess I misunderstood the following:
_read should start pushing that data into the read queue by calling this.push(dataChunk). _read should continue reading from the resource and pushing data until push returns false, at which point it should stop reading from the resource...
https://nodejs.org/api/stream.html#stream_readable_read_size_1
so the read will essentially nothing in that case (I think).
yeah looks like it returns null when pipe
from dest
thanks for clarifying
edit
based on the above and since stream.push(chunk) + cb()
=== cb(null, chunk)
would you say this is fair?
function complete(err, file) {
var keepReading = stream.push(file);
if (keepReading) {
stream.read();
}
cb(err);
}
since keepingReading
will be true
we can continue reading from the resource
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@davidchase After consulting the stream docs that seems like it should work. We should make sure our stream implementation is in line with the latest docs
@phated ping me when you need a look |
@contra can you review the comments between @davidchase and I and help us figure out the correct solution here? My stream-fu seems to be weak. |
Just commented. Also: can we avoid the cruft of having a ton of fixture files and make those dummy files in code? This will also make it easier to test different #s later on down the road |
any luck with this one? |
I reached out to @chrisdickinson and @nrn on IRC but I don't think either was available to give any guidance. |
Sorry, checked it out but I haven't dealt with the subtleties around this enough to have any insight. Good luck! |
b87d645
to
8435bbe
Compare
@davidchase it seems your suggestion passes the test I added so I switched to that. Also updated as per @contra's suggestions and rebased. Let me know what you think |
@davidchase can you think of a test for when |
@phated i think the only time But if we are using Im also wondering if bypassing the buffering is ideal?? |
@davidchase I thought |
k, went through this with chris and he explained what is happening in the current implementation and the best way to fix this. I will update this PR and try to add another test soon. |
Sounds good 👍 Im curious to see how it looks after you update the PR |
9ab7372
to
6ea0b79
Compare
6ea0b79
to
df97b14
Compare
@contra @davidchase I've updated with @chrisdickinson's suggestion and added another test. Can you take a look again. |
@phated interesting can you explain a little bit of whats happening... are you taking the this seems very similar to the stream-exhaust approach, is that a correct assumption ? |
Review, LGTM - thanks @phated 🙏 |
@davidchase I believe a pipeline must end in a writeable stream to function correctly, so this was the suggestion from @chrisdickinson. I asked how it was different than just calling
and then further on in the discussion, he said the following, which I believe sums up the change nicely:
You are correct that it is similar to the stream-exhaust approach (I even asked chris about that) and I considered using it as a dependency, but since we are in control of the streams, we shouldn't need the fallback to the |
read from the stream once we finish processing a file, this avoids highWaterMark being hit - add test
Any idea when this fix will be in a new version of vinyl-fs? |
Soon. I'm dealing with some personal stuff right now but hoping to dive back in soon. |
read from the stream once we finish processing a file, this avoids highWaterMark being hit - add test
@contra I think I solved #120 with this. Can you take a look? I made the test and it was failing and then I made it pass. @davidchase could you also take a look at this?
Closes #120