Allow unparser to override the alias name for the specific dialect #16540

goldmedal · 2025-06-24T15:47:41Z

Which issue does this PR close?

No corresponding issue at DataFusion. I encountered it in the Wren AI case.

Fix count syntax for BigQuery Canner/wren-engine#1089

Rationale for this change

Consider the following SQL:

select count(*) from (select 1)

It will be simplified to the following SQL after we move CountWildcardRule to the logical planner #14689

SELECT count(1) AS \"count(*)\" FROM (SELECT 1)

However, some dialects (e.g. BigQuery) don't allow some special characters((, *, @, ...) in an alias name. We should have a way to handle this case for the unparser.

What changes are included in this PR?

Introduce the interface col_alias_overrides for the unparser dialect.

    /// Allows the dialect to override column alias unparsing if the dialect has specific rules.
    /// Returns None if the default unparsing should be used, or Some(String) if there is
    /// a custom implementation for the alias.
    fn col_alias_overrides(&self, _alias: &str) -> Result<Option<String>> {
        Ok(None)
    }

Add BigQueryDialect, which will encode the special character in the alias name.

Are these changes tested?

Add the unit test.
This change comes from Wren AI fork Bigquery col alias override Canner/datafusion#1 which is used in the production for a while. It works well.

Are there any user-facing changes?

new method for the unparser dialect.

alamb

Makes sense to me

alamb · 2025-06-24T19:53:20Z

datafusion/sql/tests/cases/plan_to_sql.rs


+#[test]
+fn roundtrip_statement_with_dialect_special_char_alias() -> Result<(), DataFusionError> {
+    roundtrip_statement_with_dialect_helper!(


it might also help to add a test case here with unparser_dialect GenericDialect too to show the difference in dialects

alamb · 2025-06-25T19:33:50Z

🚀

goldmedal · 2025-06-26T00:30:53Z

Thanks @alamb 👍

…pache#16540) * allow override col alias for specific dialect * improve test case * add generic dialect case

goldmedal added 2 commits June 24, 2025 23:20

allow override col alias for specific dialect

3bfaeb9

improve test case

196622e

github-actions bot added the sql SQL Planner label Jun 24, 2025

alamb approved these changes Jun 24, 2025

View reviewed changes

add generic dialect case

fed7a6f

alamb merged commit 6f2747f into apache:main Jun 25, 2025
27 checks passed

goldmedal deleted the feat/bigquery-alias branch June 26, 2025 00:30

goldmedal added a commit to Canner/datafusion that referenced this pull request Jul 4, 2025

Allow unparser to override the alias name for the specific dialect (a…

7608b2d

…pache#16540) * allow override col alias for specific dialect * improve test case * add generic dialect case

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow unparser to override the alias name for the specific dialect #16540

Allow unparser to override the alias name for the specific dialect #16540

Uh oh!

goldmedal commented Jun 24, 2025 •

edited

Loading

Uh oh!

alamb left a comment

Uh oh!

alamb Jun 24, 2025

Uh oh!

Uh oh!

alamb commented Jun 25, 2025

Uh oh!

goldmedal commented Jun 26, 2025

Uh oh!

Uh oh!

Allow unparser to override the alias name for the specific dialect #16540

Allow unparser to override the alias name for the specific dialect #16540

Uh oh!

Conversation

goldmedal commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

alamb left a comment

Choose a reason for hiding this comment

Uh oh!

alamb Jun 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alamb commented Jun 25, 2025

Uh oh!

goldmedal commented Jun 26, 2025

Uh oh!

Uh oh!

goldmedal commented Jun 24, 2025 •

edited

Loading