-
Notifications
You must be signed in to change notification settings - Fork 416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: Partition value cannot be parsed from string. #2380
Comments
it appears that the
|
@thomasfrederikhoeck it shouldn't write the partition values with a Z. Also my PR didn't touch the partition value serialization. |
@thomasfrederikhoeck This issue seems to be related to how pyarrow engine is serializing the partition values |
Yes it appears that pyarrow serialize import pyarrow as pa
import pytz
tz = "UTC"
def get_data(with_tz):
tzinfo = pytz.timezone(tz) if with_tz else None
dates = pd.date_range(
datetime(2021,1,1,3,4,6,3, tzinfo=tzinfo),
datetime(2021,1,3,3,4,6, tzinfo=tzinfo)
)
return pd.DataFrame({"time":dates, "a":[i for i in range(len(dates))]})
schema = pa.schema(
[
("time", pa.timestamp("us")),
("a", pa.int64()),
]
)
dt = DeltaTable.create(
"mytable_timestampNtz", schema=schema, partition_by=["time"]
)
write_deltalake("mytable_timestampNtz",get_data(with_tz=False), partition_by="time", mode="append")
print(dt.schema())
schema = pa.schema(
[
("time", pa.timestamp("us",tz)),
("a", pa.int64()),
]
)
dt = DeltaTable.create(
"mytable_timestamp", schema=schema, partition_by=["time"]
)
write_deltalake("mytable_timestamp",get_data(with_tz=True), partition_by="time", mode="append")
print(dt.schema())
>Schema([Field(time, PrimitiveType("timestampNtz"), nullable=True), Field(a, PrimitiveType("long"), nullable=True)])
>Schema([Field(time, PrimitiveType("timestamp"), nullable=True), Field(a, PrimitiveType("long"), nullable=True)]) |
@ion-elgreco I wanted to try the rust engine but the problem is that it serialize like this which is invalid on Windows where you can't have colon (
|
@thomasfrederikhoeck can you make a separate issue for that |
Yes, #2382 :-) @ion-elgreco |
@thomasfrederikhoeck for this one, can you also create a separate issue? : P |
@ion-elgreco Done #2384 :-) |
Environment
Delta-rs version:
Main including 6f81b80
Binding:
python
Environment:
Bug
What happened:
When I try to create a checkpoint on a table partioned by timestamp I'm hit with a
ValueError. Note that I have build from master including
#2357:which gives:
What you expected to happen:
That the checkpoint was created.
How to reproduce it:
Run code above
More details:
The text was updated successfully, but these errors were encountered: