Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI][Python][Parquet] Failed to open local file column_chunk_key_value_metadata.parquet #43785

Closed
wgtmac opened this issue Aug 22, 2024 · 7 comments

Comments

@wgtmac
Copy link
Member

wgtmac commented Aug 22, 2024

Describe the bug, including details regarding any error messages, version, and platform.

https://github.com/apache/arrow/actions/runs/10493727075/job/29068353962?pr=40594

=================================== FAILURES ===================================
_____________________ test_column_chunk_key_value_metadata _____________________

parquet_test_datadir = PosixPath('/Users/runner/work/arrow/arrow/cpp/submodules/parquet-testing/data')

    def test_column_chunk_key_value_metadata(parquet_test_datadir):
>       metadata = pq.read_metadata(parquet_test_datadir /
                                    'column_chunk_key_value_metadata.parquet')

/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/site-packages/pyarrow/tests/parquet/test_metadata.py:788: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/site-packages/pyarrow/parquet/core.py:2297: in read_metadata
    file_ctx = where = filesystem.open_input_file(where)
pyarrow/_fs.pyx:789: in pyarrow._fs.FileSystem.open_input_file
    ???
pyarrow/error.pxi:155: in pyarrow.lib.pyarrow_internal_check_status
    ???
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

>   ???
E   FileNotFoundError: [Errno 2] Failed to open local file '/Users/runner/work/arrow/arrow/cpp/submodules/parquet-testing/data/column_chunk_key_value_metadata.parquet'. Detail: [errno 2] No such file or directory

pyarrow/error.pxi:92: FileNotFoundError

Component(s)

Parquet, Python

@wgtmac
Copy link
Member Author

wgtmac commented Aug 22, 2024

This commit is suspicious: #41580

cc @mapleFU

@wgtmac
Copy link
Member Author

wgtmac commented Aug 22, 2024

A similar one but with different file: https://github.com/apache/arrow/actions/runs/10493727068/job/29068354546?pr=40594

[ RUN      ] TestArrowReadWrite.FuzzReader
unknown file: Failure
C++ exception with description "IOError: Failed to open local file '/Users/runner/work/arrow/arrow/cpp/submodules/parquet-testing/data/../bad_data/ARROW-GH-41317.parquet'. Detail: [errno 2] No such file or directory" thrown in the test body.
[  FAILED  ] TestArrowReadWrite.FuzzReader (1 ms)

@wgtmac
Copy link
Member Author

wgtmac commented Aug 22, 2024

Never mind, I made a mistake in the rebase: #40594

@jorisvandenbossche
Copy link
Member

This is actually an issue for some of the nightly crossbow builds, see also my comment at #41580 (comment), so going to reuse this issue for that, since it already has the correct title :)

jorisvandenbossche added a commit to jorisvandenbossche/arrow that referenced this issue Aug 22, 2024
@mapleFU
Copy link
Member

mapleFU commented Aug 22, 2024

I'll tackle this tonigh

@jorisvandenbossche
Copy link
Member

I already opened a PR for the wheel tests, but as mentioned in #41580 (comment), also the python-emscriptem test currently is failing because of this (that is not yet tackled in my current PR)

jorisvandenbossche added a commit that referenced this issue Aug 22, 2024
…43786)

### Rationale for this change

Starting with #41580, the pyarrow tests now also rely on a file in the parquet-testing submodule. And the path to that directory is controlled by `PARQUET_TEST_DATA`, which appears to be set wrongly in the wheel test scripts, causing all wheel builds to fail at the moment.
 
* GitHub Issue: #43785

Authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
Signed-off-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
@jorisvandenbossche jorisvandenbossche added this to the 18.0.0 milestone Aug 22, 2024
@jorisvandenbossche
Copy link
Member

Issue resolved by pull request 43786
#43786

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants