Skip to content

Commit

Permalink
News and comment
Browse files Browse the repository at this point in the history
  • Loading branch information
nealrichardson committed Apr 26, 2024
1 parent 878fb5a commit b57daae
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 0 deletions.
1 change: 1 addition & 0 deletions r/NEWS.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@

* R functions that users write that use functions that Arrow supports in dataset queries now can be used in queries too. Previously, only functions that used arithmetic operators worked. For example, `time_hours <- function(mins) mins / 60` worked, but `time_hours_rounded <- function(mins) round(mins / 60)` did not; now both work. These are automatic translations rather than true user-defined functions (UDFs); for UDFs, see `register_scalar_function()`. (#41223)
* `summarize()` supports more complex expressions, and correctly handles cases where column names are reused in expressions.
* The `na_matches` argument to the `dplyr::*_join()` functions is now supported. This argument controls whether `NA` values are considered equal when joining. (#41358)

# arrow 16.0.0

Expand Down
3 changes: 3 additions & 0 deletions r/src/compute-exec.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -418,6 +418,9 @@ std::shared_ptr<acero::ExecNode> ExecNode_Join(
for (auto&& name : left_keys) {
left_refs.emplace_back(std::move(name));
// Populate key_cmps in this loop, one for each key
// Note that Acero supports having different values for each key, but dplyr
// only supports one value for all keys, so we're only going to support that
// for now.
key_cmps.emplace_back(na_matches ? acero::JoinKeyCmp::IS : acero::JoinKeyCmp::EQ);
}
for (auto&& name : right_keys) {
Expand Down

0 comments on commit b57daae

Please sign in to comment.