Skip to content

Conversation

@amaliujia
Copy link
Contributor

What changes were proposed in this pull request?

This PR replace column names and columns type with a schema (which is a struct).

Why are the changes needed?

Before this PR, AnalyzeResult separates column names and column types. However these two can be combined to form a schema which is a struct. This PR will simplify that proto message.

Does this PR introduce any user-facing change?

No

How was this patch tested?

UT

@amaliujia
Copy link
Contributor Author

R: @grundprinzip @hvanhovell

@amaliujia amaliujia force-pushed the return_schema_use_struct branch 2 times, most recently from 7cb9507 to 9c7617b Compare October 19, 2022 06:09
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: for readability, move this to the other int type?

@AmplabJenkins
Copy link

Can one of the admins verify this patch?

@amaliujia amaliujia force-pushed the return_schema_use_struct branch from 9c7617b to 4d3a83f Compare October 19, 2022 21:45
@amaliujia
Copy link
Contributor Author

R: @cloud-fan

@amaliujia amaliujia force-pushed the return_schema_use_struct branch from 4d3a83f to c6bd24e Compare October 21, 2022 23:23
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we also need to set nullablity.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok let me fix the nullability in this PR then. Maybe it is a right time now.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Entire nullability is ignored so far in current Connect implementation.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done. I set nullablity now.

@amaliujia amaliujia force-pushed the return_schema_use_struct branch from 6faa578 to bfcd053 Compare October 25, 2022 04:58
@cloud-fan
Copy link
Contributor

thanks, merging to master!

@cloud-fan cloud-fan closed this in 0b3d954 Oct 25, 2022
SandishKumarHN pushed a commit to SandishKumarHN/spark that referenced this pull request Dec 12, 2022
### What changes were proposed in this pull request?

This PR replace column names and columns type with a schema (which is a struct).

### Why are the changes needed?

Before this PR, AnalyzeResult separates column names and column types. However these two can be combined to form a schema which is a struct.  This PR will simplify that proto message.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

UT

Closes apache#38301 from amaliujia/return_schema_use_struct.

Authored-by: Rui Wang <rui.wang@databricks.com>
Signed-off-by: Wenchen Fan <wenchen@databricks.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants