Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NotImplementedError: Groupby with lists of columns not yet supported. #75

Closed
mathlf2015 opened this issue Aug 30, 2018 · 2 comments · Fixed by #987
Closed

NotImplementedError: Groupby with lists of columns not yet supported. #75

mathlf2015 opened this issue Aug 30, 2018 · 2 comments · Fixed by #987
Labels
new feature/request 💬 Requests and pull requests for new features
Milestone

Comments

@mathlf2015
Copy link

mathlf2015 commented Aug 30, 2018

System information

  • OS Platform and Distribution (e.g., Linux centos):
  • Modin installed from pip:
  • Modin version 0.1.1:
  • Python version 3.6.6:
  • t1 = t1.groupby(["subid","label"])["cnt"].count().reset_index():

0.1.1

Describe the problem

Groupby with lists of columns not yet supported.

Source code / logs

Waiting for redis server at 127.0.0.1:59835 to respond...
Waiting for redis server at 127.0.0.1:16671 to respond...
Starting local scheduler with the following resources: {'CPU': 56, 'GPU': 4}.
Traceback (most recent call last):
File "gen_fea_online.py", line 318, in
df = get_data_log_hive(pre_time=(2018,8,7))
File "gen_fea_online.py", line 290, in get_data_log_hive
df_vod40 = gen_fea_active_log(df_vod40)
File "gen_fea_online.py", line 238, in gen_fea_active_log
t1 = t1.groupby(["subid","label"])["cnt"].count().reset_index()
File "/root/anaconda3/lib/python3.6/site-packages/modin/pandas/dataframe.py", line 823, in groupby
"Groupby with lists of columns not yet supported.")
NotImplementedError: Groupby with lists of columns not yet supported.

@devin-petersohn devin-petersohn added the new feature/request 💬 Requests and pull requests for new features label Aug 30, 2018
@devin-petersohn
Copy link
Collaborator

Hi @mathlf2015, thanks for the request.

We are currently rewriting the majority of the backend for scalability, performance, and maintainability. With this update, groupby will have better performance, and we should be able to extend it to this. #70 contains the rewrite.

@mathlf2015
Copy link
Author

mathlf2015 commented Aug 31, 2018

thank you for your reply.

@devin-petersohn devin-petersohn added this to the 0.6.4 milestone Dec 19, 2019
devin-petersohn referenced this issue in devin-petersohn/modin Jan 13, 2020
* Resolves #75
* Adds support for grouping by multiple columns.
* Does this grouping by broadcasting the columns.
  * A preliminary performance evaluation shows that it is significantly
    faster than before, but still has some room for improvement.
* Minimal code changes to add this new feature.
* We still default to pandas when the user is looping over the dataframe
  * Even though this is common, it is exceptionally hard to optimize,
    and out of scope for this PR.
devin-petersohn added a commit that referenced this issue Jan 13, 2020
* Resolves #75
* Adds support for grouping by multiple columns.
* Does this grouping by broadcasting the columns.
  * A preliminary performance evaluation shows that it is significantly
    faster than before, but still has some room for improvement.
* Minimal code changes to add this new feature.
* We still default to pandas when the user is looping over the dataframe
  * Even though this is common, it is exceptionally hard to optimize,
    and out of scope for this PR.
dchigarev pushed a commit to dchigarev/modin that referenced this issue Aug 25, 2020
REFACTOR-modin-project#1879: Move logic for `groupby.agg` into query compiler
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Feb 27, 2023
…roject#75) [core]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Feb 27, 2023
…roject#75) [service]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
mvashishtha added a commit to mvashishtha/modin that referenced this issue Feb 27, 2023
…roject#75)

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [core]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [service]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [core]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [service]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [core]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [service]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [core]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
vnlitvinov pushed a commit to vnlitvinov/modin that referenced this issue Mar 16, 2023
…roject#75) [service]

Note for upstream and rebase: Don't worry about the client query compiler change
here as it's just deleting an unused method. Don't need to upstream or add into
the next modin branch on the next rebase.

Signed-off-by: mvashishtha <mahesh@ponder.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new feature/request 💬 Requests and pull requests for new features
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants