[GH-3193] Avoid deadlock in writeAllTo by removing WritableByteChannel usage #3196

wangyum · 2025-04-18T02:55:11Z

Rationale for this change

The reason changing the original implementation (using Channels.newChannel(out)) to directly writing via OutputStream resolves the deadlock issue is as follows:

In the original implementation, using Channels.newChannel(out) introduces an internal lock (WritableByteChannelImpl) that interacts with the underlying OutputStream. When Spark's task interruption mechanism (Task reaper thread) attempts to interrupt or close the channel, it acquires locks in a different order compared to the executor thread writing data. Specifically:

The executor thread holds the DFSOutputStream lock and waits for the internal lock of WritableByteChannelImpl.
The Task reaper thread holds the internal lock of WritableByteChannelImpl and waits for the DFSOutputStream lock (during hflush()).
This conflicting lock acquisition order results in a deadlock.

By directly writing to the OutputStream without using Channels.newChannel, the intermediate locking introduced by WritableByteChannelImpl is eliminated. This removes the conflicting lock order scenario, thus resolving the deadlock.

What changes are included in this PR?

Removed the usage of Channels.newChannel(out) and WritableByteChannel.
Updated the writeAllTo method to write data directly to the OutputStream using ByteBuffer operations.

Are these changes tested?

Manual test.
There is no zombie task(deadlock) after applying this patch:

Are there any user-facing changes?

No.

…Channel usage

wangyum · 2025-04-18T16:18:31Z

@gszadovszky @wgtmac Would you please take a look at this patch?

vrozov · 2025-04-21T15:44:54Z

The new implementation is uninterruptible compared to the previous implementation that uses WritableByteChannel.

gszadovszky · 2025-04-22T13:02:59Z

@wangyum, not sure I get the purpose of this fix. We have already closed the related Parquet issue saying that the bug is in Spark. See here. Am I misinterpreting something?

wangyum · 2025-04-23T06:19:12Z

Thanks for the reply @gszadovszky.

Although it is a bug in Spark, Spark has never triggered this bug until the Parquet version was upgraded to 1.15.1.
This is because Parquet 1.15.1 fixed a critical CVE(CVE-2025-30065) issue, prompting many users to consider upgrading, especially those using the parquet-avro module.
I hope the Parquet team can also address this issue so that users with Spark versions below 4.0 can continue to use the new version of Parquet without problems.

vrozov · 2025-04-23T07:18:43Z

@wangyum There is a fix on the Spark side and if it is critical CVE, IMO it will be better to ask Spark community to backport the fix to 3.5.x so existing Spark users can benefit from Parquet upgrade. Can you please try to cherry pick and test it.

wgtmac · 2025-04-23T07:55:14Z

I agree with @vrozov and @gszadovszky that Spark not Parquet should fix this. Since the root cause is not from Parquet, other user dependency may also trigger this bug and workaround on the Parquet side does not help either.

wangyum · 2025-04-23T09:48:23Z

@vrozov What should users do if they are still using Spark 3.0 - Spark 3.4? The Spark community no longer maintains these branches.

pan3793 · 2025-04-23T11:58:41Z

@wgtmac possible to trigger patch releases for older Parquet branches? e.g. 1.13 (used by Spark 3.5) and 1.14

vrozov · 2025-04-23T15:25:48Z

What should users do if they are still using Spark 3.0 - Spark 3.4? The Spark community no longer maintains these branches.

@wangyum Those that use Spark 3.0 - 3.4 should upgrade to Spark 3.5.x so they can continue to receive CVE and other fixes from the Spark community. Additionally from the discussion on the Apache Spark, the Parquet CVE-2025-30065 does not affect Spark, so Spark community even considered not upgrading parquet-java to 1.5.1.

I also think that avoiding using WritableByteChannel is not the correct generic fix. While it may work for Spark, it may break other consumers of the parquet-java as it converts interruptible write to uninterruptible.

As the last resort, both Spark and Parquet are open source and it is possible to recompile both using your own fork.

wangyum · 2025-04-23T16:25:29Z

it converts interruptible write to uninterruptible

Before Parquet 1.14.0. It also uninterruptible write:

parquet-java/parquet-common/src/main/java/org/apache/parquet/bytes/ConcatenatingByteArrayCollector.java

Lines 51 to 56 in 274dc51

    
           @Override 
        
           public void writeAllTo(OutputStream out) throws IOException { 
        
             for (byte[] slab : slabs) { 
        
               out.write(slab); 
        
             } 
        
           }

vrozov · 2025-04-24T00:18:24Z

It is interruptible starting with 1.14 and going back to uninterruptible is not a good option, IMO. That will even impact Spark as it will now have to write all data instead of I/O (write) being interrupted and aborted. If Spark needs this task to run without being interrupted it should use runUninterruptibly.

wangyum marked this pull request as draft April 18, 2025 02:56

wangyum changed the title ~~[WIP]~~ [GH-3193] Avoid deadlock in writeAllTo by removing WritableByteChannel usage Apr 18, 2025

[apacheGH-3193] Avoid deadlock in writeAllTo by removing WritableByte…

96e5e92

…Channel usage

wangyum force-pushed the PARQUET-3193 branch from 6e62ec1 to 96e5e92 Compare April 18, 2025 16:13

wangyum marked this pull request as ready for review April 18, 2025 16:16

wangyum closed this Jun 23, 2025

wangyum deleted the PARQUET-3193 branch June 23, 2025 01:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GH-3193] Avoid deadlock in writeAllTo by removing WritableByteChannel usage #3196

[GH-3193] Avoid deadlock in writeAllTo by removing WritableByteChannel usage #3196

Uh oh!

wangyum commented Apr 18, 2025 •

edited

Loading

Uh oh!

wangyum commented Apr 18, 2025 •

edited

Loading

Uh oh!

vrozov commented Apr 21, 2025

Uh oh!

gszadovszky commented Apr 22, 2025

Uh oh!

wangyum commented Apr 23, 2025

Uh oh!

vrozov commented Apr 23, 2025

Uh oh!

wgtmac commented Apr 23, 2025

Uh oh!

wangyum commented Apr 23, 2025

Uh oh!

pan3793 commented Apr 23, 2025

Uh oh!

vrozov commented Apr 23, 2025

Uh oh!

wangyum commented Apr 23, 2025

Uh oh!

vrozov commented Apr 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[GH-3193] Avoid deadlock in writeAllTo by removing WritableByteChannel usage #3196

[GH-3193] Avoid deadlock in writeAllTo by removing WritableByteChannel usage #3196

Uh oh!

Conversation

wangyum commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

wangyum commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vrozov commented Apr 21, 2025

Uh oh!

gszadovszky commented Apr 22, 2025

Uh oh!

wangyum commented Apr 23, 2025

Uh oh!

vrozov commented Apr 23, 2025

Uh oh!

wgtmac commented Apr 23, 2025

Uh oh!

wangyum commented Apr 23, 2025

Uh oh!

pan3793 commented Apr 23, 2025

Uh oh!

vrozov commented Apr 23, 2025

Uh oh!

wangyum commented Apr 23, 2025

Uh oh!

vrozov commented Apr 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

wangyum commented Apr 18, 2025 •

edited

Loading

wangyum commented Apr 18, 2025 •

edited

Loading