Review notebook 6 "Calculate statistics" #8

jorisvandenbossche · 2019-09-11T06:52:54Z

No description provided.

review-notebook-app · 2019-09-11T06:52:59Z

Check out this pull request on

You'll be able to see Jupyter notebook diff and discuss changes. Powered by ReviewNB.

justmarkham · 2019-09-27T18:16:06Z

notebooks/6_calculate_statistics.ipynb

@@ -0,0 +1,873 @@
+{


This is an excellent attempt at explaining groupby with a diagram! However, I think it would be easier to understand if there were only 2 groups in the diagram, rather than 3. Part of the reason is that the 4 shades of green are difficult to distinguish between.

Reply via ReviewNB

justmarkham · 2019-09-27T18:16:07Z

notebooks/6_calculate_statistics.ipynb

@@ -0,0 +1,873 @@
+{


I didn't quite follow as to why you created titanic_subset and did a groupby on it, and then separately did a groupby on titanic, even though they have the same result (except for the data type of the result). I think there is a teaching point you were trying to emphasize by making that distinction, but I couldn't quite figure it out!

Reply via ReviewNB

Thanks for noticing, I'll try to improve the _storyline_ . The reason to have the subset is to overcome the necessity of selecting a column (e.g. Age) after the groupby operator. As such, I can first explain
titanic.groupby("Sex").mean() before doing titanic.groupby("Sex")["Age"].mean().

Maybe rather work without subset, just do titanic.groupby("Sex").mean(), get a grouped-average result on all the columns (PassengerId,Survived,Pclass,Age,SibSp,Parch and Fare) and next, explain how to just go for a single column of interest?

After trying out, I rather stick to:
titanic[["Sex", "Age"]].groupby("Sex").mean() as the solution of the specific question. It is putting less attention to the subset as it was before, but still keep the focus on those two columns involved.

justmarkham · 2019-09-27T18:16:07Z

notebooks/6_calculate_statistics.ipynb

@@ -0,0 +1,873 @@
+{


typo: "Nan" should be "NaN"

Reply via ReviewNB

Add 6_calculate_statistics.ipynb from master

810c48a

justmarkham reviewed Sep 27, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Review notebook 6 "Calculate statistics" #8

Review notebook 6 "Calculate statistics" #8

jorisvandenbossche commented Sep 11, 2019

review-notebook-app bot commented Sep 11, 2019

justmarkham Sep 27, 2019 •

edited

Loading

justmarkham Sep 27, 2019

stijnvanhoey Oct 14, 2019

stijnvanhoey Oct 14, 2019

justmarkham Sep 27, 2019 •

edited

Loading

Review notebook 6 "Calculate statistics" #8

Are you sure you want to change the base?

Review notebook 6 "Calculate statistics" #8

Conversation

jorisvandenbossche commented Sep 11, 2019

review-notebook-app bot commented Sep 11, 2019

justmarkham Sep 27, 2019 • edited Loading

Choose a reason for hiding this comment

justmarkham Sep 27, 2019

Choose a reason for hiding this comment

stijnvanhoey Oct 14, 2019

Choose a reason for hiding this comment

stijnvanhoey Oct 14, 2019

Choose a reason for hiding this comment

justmarkham Sep 27, 2019 • edited Loading

Choose a reason for hiding this comment

justmarkham Sep 27, 2019 •

edited

Loading

justmarkham Sep 27, 2019 •

edited

Loading