-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
furrr stumbles over grouped dataframes #28
Comments
This is because the way you have it set up, With a grouped data frame like this one, with 200 groups, If you don't believe me, run So, yes! It is going to be slower this way, and hopefully in your real example you can think of another way to do it so that |
Hi @DavisVaughan, Anyway, keep up the good work. |
I've added a new issue for documentation updates. Thanks! |
Wouldn't it be a good idea for It would align better with the It is quite common to use grouped dataframes with Also see nest vignette:
I found it quite surprising, that |
I don't think so, that would be very different from what purrr does |
@DavisVaughan it is indeed frustrated to troubleshoot why |
I've run into some wired behavior when using
dplyr
andmutate
and the dataframe is grouped. Calculations withfuture_map
take forever compared to usage ofpurrr::map
. This becomes cumbersome for my workflow which resemblesdf %>% group_by(some_var) %>% nest() %>% mutate(results = future_map(data, some_expensive_calculation)
I've attached a (hopefully helpful) reprex:
Example from github works as expected.
Using
dplyr
'smutate
for adding the new columns.Now for the grouped dataframes:
The calculation with
furrr
on the grouped dataframe takes considerably longer thanpurrr
which I would not expect.Created on 2018-08-02 by the reprex package (v0.2.0).
The text was updated successfully, but these errors were encountered: