You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
0xdarkman
changed the title
delta v0.6.3
I do not use partitionBy when I write delta table. I do use mode overwrite. Although, I see new file being created each time after write.
1-...
2-...
...
6-...
I would expect same file being overwritten.
import pyarrow as pa
from deltalake.writer import write_deltalake
storage_options = {
"AZURE_STORAGE_ACCOUNT_NAME": account_name,
"AZURE_STORAGE_ACCOUNT_KEY": account_key,
}
table_path = "abfss://CONTAINERNAME@STORAGEACCOUNT.dfs.core.windows.net/TABLE_NAME"
dt = DeltaTable(table_path, storage_options=storage_options)
write_deltalake(table_or_uri=dt, df=df, mode="overwrite")
tb = pa.Table.from_pandas(df, preserve_index=False)
write_deltalake(table_or_uri=dt, data=tb, mode="overwrite")
Overwrite creates new file
Nov 27, 2022
version: delta v0.6.3
environment: cloud, azure
mode overwrite used.
no partition by option used
I see new file being created each time after write:
1-...
2-...
...
6-...
I would expect same file being overwritten so I would expect to see:
1-...
only.
import pyarrow as pa
from deltalake.writer import write_deltalake
storage_options = { "AZURE_STORAGE_ACCOUNT_NAME": account_name, "AZURE_STORAGE_ACCOUNT_KEY": account_key, }
table_path = "abfss://CONTAINERNAME@STORAGEACCOUNT.dfs.core.windows.net/TABLE_NAME"
dt = DeltaTable(table_path, storage_options=storage_options)
tb = pa.Table.from_pandas(df, preserve_index=False)
write_deltalake(table_or_uri=dt, data=tb, mode="overwrite")
The text was updated successfully, but these errors were encountered: