error reading iceberg table #2603

djouallah · 2024-02-06T07:19:47Z

data can be read fine using duckdb

vrongmeal · 2024-02-28T13:00:36Z

This seems like an issue with v1 format.

tychoish · 2024-03-06T13:46:59Z

I believe that this was addressed in #2718, which was released yesterday in 0.9.1.

@djouallah let us know if this works or if you still have an issue or if you have any example datasets (or ways of producing data) that you think we should have integration tests for.

djouallah · 2024-03-06T14:17:50Z

getting new errors
ExecutionException: External error: Data is invalid: Failed to read table metadata: missing field snapshot-log at line 81 column 1

you can genereate iceberg tables using pyiceberg local
https://colab.research.google.com/drive/1EjffJO75-8Rj4V0MGKUsoFHDOGgicKgK?usp=sharing

vrongmeal · 2024-03-06T17:56:43Z

Thanks! I'll definitely take a look at this on priority. Once we have a variety of datasets, we should be able to resolve most of the incompatibility issues with v1.

tychoish · 2024-03-06T19:43:33Z

Just collecting some notes after looking at the notebook you posted @djouallah:

glaredb.sql(""" select * from
iceberg_scan('/content/warehouse/default.db/taxi_dataset/metadata/00001-c13c72f3-6082-444c-9256-bea980ff7e0e.metadata.json') """)

And the error:

ExecutionException: External error: Failed to canonicalize path "/content/warehouse/default.db/taxi_dataset/metadata/00001-c13c72f3-6082-444c-9256-bea980ff7e0e.metadata.json/metadata/version-hint.text": Not a directory (os error 20)

It seems like glaredb (and duckdb) both expect to be pointed to the top level directory that contains the iceberg table, in this case /content/warehouse/default.db/taxi_dataset/, but when I do that I get an error:

 Failed to canonicalize path "/content/warehouse/default.db/taxi_dataset/metadata/version-hint.text": No such file or directory (os error 2)

DuckDB has the same error (it's looking for the same file) so I imagine that there's something unexpected about this dataset or the way it's saved. My inspection of the pyiceberg api did not render anything fruitful yet, but I will look back into it.

Fixes #2603 --------- Signed-off-by: Vaibhav <vrongmeal@gmail.com>

djouallah added the bug Something isn't working label Feb 6, 2024

universalmind303 added blocked Not actionable due to a blocker support User-driven support priority-high ⛰️ and removed blocked Not actionable due to a blocker labels Feb 6, 2024

vrongmeal mentioned this issue Mar 25, 2024

fix: Iceberg fixes for reading table metadata #2810

Merged

vrongmeal closed this as completed in #2810 Apr 11, 2024

vrongmeal added a commit that referenced this issue Apr 11, 2024

fix: Iceberg fixes for reading table metadata (#2810)

15cd5dc

Fixes #2603 --------- Signed-off-by: Vaibhav <vrongmeal@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error reading iceberg table #2603

error reading iceberg table #2603

djouallah commented Feb 6, 2024 •

edited

Loading

vrongmeal commented Feb 28, 2024

tychoish commented Mar 6, 2024

djouallah commented Mar 6, 2024

vrongmeal commented Mar 6, 2024

tychoish commented Mar 6, 2024

error reading iceberg table #2603

error reading iceberg table #2603

Comments

djouallah commented Feb 6, 2024 • edited Loading

vrongmeal commented Feb 28, 2024

tychoish commented Mar 6, 2024

djouallah commented Mar 6, 2024

vrongmeal commented Mar 6, 2024

tychoish commented Mar 6, 2024

djouallah commented Feb 6, 2024 •

edited

Loading