Skip to content

Latest commit

 

History

History
126 lines (77 loc) · 4.5 KB

NEWS.md

File metadata and controls

126 lines (77 loc) · 4.5 KB

dtplyr (development version)

  • Grouped filter, mutate, and slice no longer affect ordering of output (#178).

  • Grouped filters now use a much faster translation based on .I rather than .SD (and involving an intermediate assignemnt) (#176). Thanks to suggestion from @myoung3 and @ColeMiller1.

  • compute() now creates an intermediate assignment within the translation. This will generally have little impact on performance but it allows you to use intermediate variables to simplify complex translations.

  • All verbs now have (very basic) documentation pointing back to the dplyr generic, and providing a (very rough) description of the translation accompanied with a few examples.

  • Show total rows when printing a lazy_dt().

  • select() after grouped filter no longer incorrectly collapses into a single [ call (#127).

  • dtplyr can now translate across() (#154).

  • Objects now printing grouping if present.

  • dtplyr now supports group_map() and group_walk() (#108).

  • dtplyr now supports relocate() (@smingerson, #162).

  • dtplyr now supports rename_with() (#160)

  • rename() and select() support dplyr 1.0.0 tidyselect syntax (apart from predicate functions which can't easily work on lazily evaluted data tables).

  • x[[1]] is now translated correctly.

  • Added translations for cur_data() (.SD), cur_group() (.BY), cur_group_id() (.GRP), and cur_group_rows() (.I`) (#166).

  • as.data.table() always calls [] so that the result will print (#146).

  • Implement count() method for compatibility with dplyr 1.0.0 (#159).

  • Anonymous functions work in mutate() and summarise() (@smingerson, #155)

  • Environment variables used in the i argument of [.data.table are now correctly inlined when not in the global environment (#164).

  • T and F are correctly translated to TRUE and FALSE (#140).

dtplyr 1.0.1

  • Better handling for .data and .env pronouns (#138).

  • dplyr verbs now work with NULL inputs (#129).

  • joins do better job at determining output variables in the presence of duplicated outputs (#128). When joining based on different variables in x and y, joins consistently preserve column from x, not y (#137).

  • lazy_dt() objects now have a useful glimpse() method (#132).

  • group_by() now has an arrange parameter which, if set to FALSE, sets the data.table translation to use by rather than keyby (#85).

  • rename() now works without data.table attached, as intended (@michaelchirico, #123).

  • dtplyr has been re-licensed as MIT (#165).

dtplyr 1.0.0

  • Converted from eager approach to lazy approach. You now must use lazy_dt() to begin a translation pipeline, and must use collect(), as.data.table(), as.data.frame(), or as_tibble() to finish the translation and actually perform the computation (#38).

    This represents a complete overhaul of the package replacing the eager evaluation used in the previous releases. This unfortunately breaks all existing code that used dtplyr, but frankly the previous version was extremely inefficient so offered little of data.table's impressive speed, and was used by very few people.

  • dtplyr provides methods for data.tables that warning you that they use the data frame implementation and you should use lazy_dt() (#77)

  • Joins now pass ... on to data.table's merge method (#41).

  • ungroup() now copies its input (@christophsax, #54).

  • mutate() preserves grouping (@christophsax, #17).

  • if_else() and coalesce() are mapped to data.table's fifelse() and fcoalesce() respectively (@michaelchirico, #112).

dtplyr 0.0.3

  • Maintenance release for CRAN checks.

  • inner_join(), left_join(), right_join(), and full_join(): new suffix argument which allows you to control what suffix duplicated variable names receive, as introduced in dplyr 0.5 (#40, @christophsax).

  • Joins use extended merge.data.table() and the on argument, introduced in data.table 1.9.6. Avoids copy and allows joins by different keys (#20, #21, @christophsax).

dtplyr 0.0.2

  • This is a compatibility release. It makes dtplyr compatible with dplyr 0.6.0 in addition to dplyr 0.5.0.

dtplyr 0.0.1

  • distinct() gains .keep_all argument (#30, #31).

  • Slightly improve test coverage (#6).

  • Install devtools from GitHub on Travis (#32).

  • Joins return data.table. Right and full join are now implemented (#16, #19).

  • Remove warnings from tests (#4).

  • Extracted from dplyr at revision e5f2952923028803.