-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Closed
Description
Describe the enhancement requested
The docstring for parquet.read_table doesn't specify that the source can be a list of strings:
arrow/python/pyarrow/parquet/core.py
Lines 1629 to 1632 in e2a5b4e
| source : str, pyarrow.NativeFile, or file-like object | |
| If a string passed, can be a single file name or directory name. For | |
| file-like objects, only read a single file. Use pyarrow.BufferReader to | |
| read a file contained in a bytes or buffer-like object. |
There is also a #TODO for a check on the source argument to validate is of the valid type in case dataset is not enabled.
arrow/python/pyarrow/parquet/core.py
Lines 1826 to 1829 in e2a5b4e
| if filesystem is not None: | |
| source = filesystem.open_input_file(path) | |
| # TODO test that source is not a directory or a list | |
| dataset = ParquetFile( |
This should only tackle those. There is another issue to test the fallback case successfully:
Component(s)
Documentation, Python