Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doc: Do Not Modify the Source Data Table During MergeIntoCommand Exec… #11787

Closed
wants to merge 3 commits into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions docs/docs/spark-writes.md
Original file line number Diff line number Diff line change
Expand Up @@ -101,6 +101,9 @@ Spark 3.5 added support for `WHEN NOT MATCHED BY SOURCE ... THEN ...` to update
WHEN NOT MATCHED BY SOURCE THEN UPDATE SET status = 'invalid'
```

!!! danger
Note: For copy on write table,if source data table is not iceberg table,Please Do Not Modify the Source Data Table During Execution(For example:hive table,jdbc view table,another datalake table,etc.).Due to the need for Spark to use the source table consecutively twice for computation,the relation which is created of source data must remain constant through the two different passes of the source data.If the source data query would return different results user will see odd behavior(For example: data loss).

### `INSERT OVERWRITE`

`INSERT OVERWRITE` can replace data in the table with the result of a query. Overwrites are atomic operations for Iceberg tables.
Expand Down