-
Notifications
You must be signed in to change notification settings - Fork 5.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TPCH connector return different results on prestissimo for l_quantity #24010
Comments
cc: @amitkdutta, @aditi-pandit |
CC: @pramodsatya I just had a chat with @aditi-pandit, she confirmed its a data generation issue - rather than a table reader bug. Its coming from here |
Yes this is a data generation issue and the velox Tpch connector seems to be generating incorrect data (when compared with Presto). For the column Also as per the Tpch specs, |
Thanks @pramodsatya. Is this one similar issue as well? #24011 |
Yes @amitkdutta, the checksum mismatch here appears to be because of data mismatch between Presto and Prestissimo Tpch connectors for column |
@pramodsatya, I know the Varchar data is mismatched as well. I think it should be the same as Java. Can you confirm this against DuckDB as well? |
@pramodsatya can we fix this to bring parity with Presto Java? I think double would be okay since the decimal scale seems to be atmost 2. |
@majetideepak, yes the Varchar data is mismatched too and it differs from Presto Java, this is from checking data with SF1 for following columns from three tables: |
Sure, I opened a PR in velox to fix the data generated for column Upon checking the Tpch SF1 data generated by Prestissimo and Presto and comparing them against Duckdb and the spec, it looks like there is a parity only for Varchar columns and the column |
@pramodsatya Can you clarify this further? How different are the Java and Prestissimo results after your fix? All SF should have the same data. |
@majetideepak, apologies for the confusion. This comment was only with respect to the Is it fine if we take up all these dbgen fixes in a separate PR? |
@pramodsatya : Yes, lets handle the varchar issues in #24011 |
Java
Prestissimo
The text was updated successfully, but these errors were encountered: