Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Presto does not support Chinese by parquet #23969

Open
i95271116 opened this issue Nov 7, 2024 · 5 comments
Open

Presto does not support Chinese by parquet #23969

i95271116 opened this issue Nov 7, 2024 · 5 comments
Labels

Comments

@i95271116
Copy link

For example :
Select * from table where col = '中文'.
For Now I must write sql like this if field contains Chinese.
Select * from table where to_utf8(col) = to_utf8('中文')

my version is 0.234.2-add98eb

@agrawalreetika
Copy link
Member

Hi @i95271116
Is it the same even with the latest presto as well?

@i95271116
Copy link
Author

I tested both presto-server-0.216 and presto-server-0.285.1, but the problem still exists.

@i95271116
Copy link
Author

In the where condition, for Chinese characters, "where trim(‘column_name’)"=‘切断压力’ needs to be added

@i95271116
Copy link
Author

Or it is normal to use "SELECT * from
table
where alias='切断压力' or rand() = 1 limit 10".

@hantangwangd
Copy link
Member

Hi @i95271116. Can you elaborate the issue? For example, which connector do you use, and what are the steps to reproduce the problem.

I cannot reproduce it on Iceberg connector with parquet format.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: 🆕 Unprioritized
Development

No branches or pull requests

3 participants