Demonstrate race condition to discuss how to fix #91 #94

themodernlife · 2014-09-25T15:09:48Z

I was kind of glad to see #91, because I too have ran into this issue while processing LZO files (from S3) via Spark using local config with multiple threads.

I dug into this and the issue seems to be in

hadoop-lzo/src/main/java/com/hadoop/compression/lzo/LzopInputStream.java

Line 349 in e8c11c2

CodecPool.returnDecompressor(decompressor);

.

When using Spark, the exception occurs because the TextInputFormat creates a LineRecordReader that loads the decompressor from the pool. When it gets closed, the LineRecordReader adds it back to the pool... the trouble is that underlying LzopInputStream also adds the decompressor back to the pool, which means the same decompressor is now in the pool twice.

This means that the next 2 calls to getRecordReader are going to use the same decompressor, and that's where you get the race condition.

I attached a unit test that demonstrates the error. The fix here is just to remove the bit that adds the decompressor back to the pool on 349.

Can someone have a look at this as well and let me know what you think? I can clean up the PR and resubmit, but just wanted to add some info about the bug.

Cheers!

…InputFormat multiple times

sjlee · 2014-09-28T03:16:58Z

Thanks for the report @themodernlife. We'll look into it.

themodernlife · 2014-11-05T16:52:24Z

Hey gang, just wondering if you guys have any feedback on this?

sjlee · 2014-11-05T22:13:17Z

We haven't been able to spend much time on this. At the high level we recognize the issue, but it might be tricky/expensive to fix as users may have code written to work around this and we shouldn't break them...

rangadi · 2014-11-06T20:21:13Z

src/main/java/com/hadoop/compression/lzo/LzopDecompressor.java

 public class LzopDecompressor extends LzoDecompressor {
+  private static final Log LOG = LogFactory.getLog(LzopDecompressor.class);


rangadi · 2014-11-06T20:27:35Z

sorry, can you point to the fix? I must be missing obvious.

sjlee · 2014-11-06T21:58:50Z

@rangadi, this particular pull request was just to demonstrate the bug itself. I think @themodernlife's suggestion is to not pool decompressors (which he added and commented out in this PR).

sjlee · 2014-11-06T22:04:39Z

If we simply remove the line that returns the decompressor in LzopInputStream.close(), it would satisfy the use cases mentioned here (via LineRecordReader, or any use case that returns the decompressor). However, I'm pretty certain there are a lot of use cases (correct or not) that are not returning the decompressor, implicitly relying on LzopInputStream.close(). For them, the decompressors would start leaking.

As you suggested, annotating the decompressor as DoNotPool is one option, but we would forgo the benefit of pooling. Decompressors carry native buffers, and not pooling them would have a fairly major performance implication depending on the use cases. So I'm not sure we want to go there.

BTW, it looks like the compressor (LzopOutputStream) has the same issue.

sjlee · 2014-12-05T01:04:03Z

Now that #103 was merged, would you like to clean up this unit test and update this PR? This would be a useful test.

CLAassistant · 2019-07-18T15:08:10Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Add a test case, showing how to trigger the race condition using Text…

1567fc9

…InputFormat multiple times

themodernlife mentioned this pull request Sep 25, 2014

Get corrent result processing files, but errors processing directories? #91

Closed

rangadi reviewed Nov 6, 2014
View reviewed changes

themodernlife mentioned this pull request Nov 22, 2014

Remove unnecessary calls to CodecPool.returnCompressor/returnDecompresso... #103

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demonstrate race condition to discuss how to fix #91 #94

Demonstrate race condition to discuss how to fix #91 #94

themodernlife commented Sep 25, 2014

sjlee commented Sep 28, 2014

themodernlife commented Nov 5, 2014

sjlee commented Nov 5, 2014

rangadi Nov 6, 2014

rangadi commented Nov 6, 2014

sjlee commented Nov 6, 2014

sjlee commented Nov 6, 2014

sjlee commented Dec 5, 2014

CLAassistant commented Jul 18, 2019 •

edited

Loading

		public class LzopDecompressor extends LzoDecompressor {
		private static final Log LOG = LogFactory.getLog(LzopDecompressor.class);

Demonstrate race condition to discuss how to fix #91 #94

Are you sure you want to change the base?

Demonstrate race condition to discuss how to fix #91 #94

Conversation

themodernlife commented Sep 25, 2014

sjlee commented Sep 28, 2014

themodernlife commented Nov 5, 2014

sjlee commented Nov 5, 2014

rangadi Nov 6, 2014

Choose a reason for hiding this comment

rangadi commented Nov 6, 2014

sjlee commented Nov 6, 2014

sjlee commented Nov 6, 2014

sjlee commented Dec 5, 2014

CLAassistant commented Jul 18, 2019 • edited Loading

CLAassistant commented Jul 18, 2019 •

edited

Loading