Skip to content

Conversation

@dongjoon-hyun
Copy link
Member

@dongjoon-hyun dongjoon-hyun commented Oct 25, 2018

What changes were proposed in this pull request?

Our current doc does not explain how we are passing the data source specific options to the underlying data source. According to the review comment, this PR aims to add more detailed information and examples. This is a backport of #22801. orc.column.encoding.direct is removed since it's not supported in ORC 1.5.2.

How was this patch tested?

Manual.

…bout extra data source options

Our current doc does not explain how we are passing the data source specific options to the underlying data source. According to [the review comment](#22622 (comment)), this PR aims to add more detailed information and examples

Manual.
# $example on:manual_save_options_orc$
df <- read.df("examples/src/main/resources/users.orc", "orc")
write.orc(df, "users_with_options.orc", orc.bloom.filter.columns = "favorite_color", orc.dictionary.key.threshold = 1.0)
# $example off:manual_save_options_orc$
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, @felixcheung . This is a backport of #22801 (on master branch).
Could you review this?

orc.bloom.filter.columns 'favorite_color',
orc.dictionary.key.threshold '1.0'
)
{% endhighlight %}
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, @dbtsai . This is a backport of #22801 without orc.column.encoding.direct.

@SparkQA
Copy link

SparkQA commented Oct 25, 2018

Test build #98046 has finished for PR 22839 at commit c9d89f4.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@dbtsai
Copy link
Member

dbtsai commented Oct 25, 2018

Thanks @dongjoon-hyun This LGTM!

@dongjoon-hyun
Copy link
Member Author

Thank you so much for review, @dbtsai .
Merged to branch-2.4.

asfgit pushed a commit that referenced this pull request Oct 25, 2018
…bout extra data source options

## What changes were proposed in this pull request?

Our current doc does not explain how we are passing the data source specific options to the underlying data source. According to [the review comment](#22622 (comment)), this PR aims to add more detailed information and examples. This is a backport of #22801. `orc.column.encoding.direct` is removed since it's not supported in ORC 1.5.2.

## How was this patch tested?

Manual.

Closes #22839 from dongjoon-hyun/SPARK-25656-2.4.

Authored-by: Dongjoon Hyun <dongjoon@apache.org>
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
@dongjoon-hyun dongjoon-hyun deleted the SPARK-25656-2.4 branch October 25, 2018 21:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants