Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow expressions to have tuple-valued output type #854

Open
senderista opened this issue Sep 9, 2016 · 0 comments
Open

Allow expressions to have tuple-valued output type #854

senderista opened this issue Sep 9, 2016 · 0 comments
Assignees

Comments

@senderista
Copy link
Contributor

There are cases where an expression must emit multiple columns which cannot be computed independently (i.e., by different expressions). One example (via @parmitam) is a chunking expression which explodes some input value into a sequence of chunks, with each chunk numbered by its sequence within the output of its input tuple: ["abc", "def"] -> [(1, "a"), (2, "b"), (3, "c"), (1, "d"), (2, "e"), (3, "f")]. It is impossible to assign these sequence numbers after the fact, since the association of the outputs with their inputs has been lost (as has, possibly, the original order of the outputs). This example could be solved by an expression that output tuples of type (LONG_TYPE, STRING_TYPE).

This will require extensive changes in Apply and related operators, since we can no longer associate emit expressions 1:1 with output columns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants