-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Iceberg table (Glue catalog) - unable to fetch, InputFormat cannot be null #359
Comments
Another relevant finding for this issue - when creating new table from seed, a non-iceberg table is creates: |
This article about DBT and Glue doesn't mention this specifically but seems like DBT-Glue is not able to read Iceberg tables (InputFormat cannot be null). In their setup they use Hive tables for intermediate stage and Iceberg only for final layer. Can anyone confirm my conclusion that Iceberg table can be used only in a final stage of the processing pipeline? |
Have the same issue with Iceberg. Maybe also related to the fact I use LakeFormation |
hi, i have the same issue (dbt and dbt-glue 1.7, glue 4.0, with lake formation), so i tried replicating the dbt code and running it in a glue notebook, and i did get the exact same error in the notebook as well. adding instead - i added these configs:
that also worked in the notebook, since the job now used my catalog instead of the default one (named however, i still couldn't get dbt to work, even though i added these two configs in the profiles yaml. to conclude - i've identified 2 problems:
|
update - got it working, don't know why it didn't work before... the solution was adding the default configs -
|
When trying this I get the error: This is my conf now:
|
@JanKrl this is exactly what i have (only did you make sure to leave out the first conf: >
spark.sql.catalog.glue_catalog=org.apache.iceberg.spark.SparkCatalog
--conf spark.sql.catalog.glue_catalog.catalog-impl=org.apache.iceberg.aws.glue.GlueCatalog
... |
That did the trick! For sake of clarity, here is the full config:
|
Describe the bug
When reading a source table (Iceberg) I get the following error:
After some googling, I found suggestion to add
glue_catalog
before table name. This results with:Steps To Reproduce
Apache Iceberg
-
-
-
As far as I can tell this is expected outcome.
I also tried all sorts of additional configs based on what I found online:
Expected behavior
The source table should have been read and DBT should have created new table from the model.
System information
The output of
dbt --version
:The operating system you're using:
The output of
python --version
:Python 3.11.0
Additional context
The text was updated successfully, but these errors were encountered: