docs(blog): classification metrics on the backend #10501

IndexSeek · 2024-11-15T23:18:30Z

Description of changes

Adding a blog post breaking down how to perform binary classification metrics with Ibis. I did a fair amount of background explanation on these models and these metrics because many Ibis users may not be as familiar with these topics, but we can scale that back if needed and get more to the point.

deepyaman · 2024-11-16T03:32:07Z

Description of changes

Adding a blog post breaking down how to perform binary classification metrics with Ibis. I did a fair amount of background explanation on these models and these metrics because many Ibis users may not be as familiar with these topics, but we can scale that back if needed and get more to the point.

Seems like these would also be useful additions to IbisML! ibis_ml.metrics?

ibis-docs-bot · 2024-11-16T11:06:10Z

Docs preview: https://pr-10501-264061a498a2eea5081db287669bbe2c04f1b02a--ibis-quarto.netlify.app

docs/posts/classification-metrics-on-the-backend/index.qmd

IndexSeek · 2024-11-16T14:08:28Z

Seems like these would also be useful additions to IbisML! ibis_ml.metrics?

I think so! I have given that a good bit of thought and I think it would be worth adding that capability with IbisML. I opened feat: ibis_ml.metrics #174 over there, so hopefully, we can discuss further and plan the approach.

deepyaman

Took a quick look. I personally like the detailed explanations; as you said, a lot of people may not have much ML exposure. I also think this is illustrative, but not necessarily efficient.

I'm guessing this should be a lot more efficient:

>>> tp = (t.actual * t.prediction).sum()
>>> tp
┌───┐
│ 4 │
└───┘
>>> fp = t.prediction.sum() - tp
>>> fp
┌───┐
│ 2 │
└───┘
>>> fn = t.actual.sum() - tp
>>> fn
┌───┐
│ 3 │
└───┘
>>> tn = t.actual.count() - tp - fp - fn
>>> tn
┌───┐
│ 3 │
└───┘

(I borrowed the logic from https://github.com/scikit-learn/scikit-learn/blob/a2448b5ce8778b76f8d8c6e7b0ef9b6cca9c7313/sklearn/metrics/_classification.py#L445, since I was too lazy to think it through myself.)

Since you do explicitly make a point about performance, maybe it makes sense to show the more efficient method after going through the illustrative labeling approach?

Edit: An alternative would be to just show the illustrative approach, add the efficient approach to IbisML, and call the IbisML function to demo the "efficient" path.

IndexSeek · 2024-11-16T17:17:06Z

Thanks for the review and the feedback! I agree. The way you demonstrated calculating the true positives, false positives, etc., does seem much more efficient. It also demonstrates how we can break apart calculations and use them in other expressions with Ibis.

Since you do explicitly make a point about performance, maybe it makes sense to show the more efficient method after going through the illustrative labeling approach?

This is a great idea! The illustrative approach helps cement the concepts, and then the more efficient method would demonstrate assigning expressions as variables as using them in other expressions. Something that is far less convenient to do with pure SQL. I'm happy to incorporate this!

Edit: An alternative would be to just show the illustrative approach, add the efficient approach to IbisML, and call the IbisML function to demo the "efficient" path.

What if we added the above efficient approach to the article as it is now, I follow this up with another blog post on regression metrics. Then we have a third blog post to close out the series that throws back to the first two (e.g., we've previously reviewed and demonstrated how to calculate classification and regression metrics with Ibis, in this post, we'll demonstrate how we can perform these calculations out of the box with IbisML) so that we can tie it all together and create a nice mini series of blog posts.

deepyaman · 2024-11-16T17:49:27Z

Edit: An alternative would be to just show the illustrative approach, add the efficient approach to IbisML, and call the IbisML function to demo the "efficient" path.

What if we added the above efficient approach to the article as it is now, I follow this up with another blog post on regression metrics. Then we have a third blog post to close out the series that throws back to the first two (e.g., we've previously reviewed and demonstrated how to calculate classification and regression metrics with Ibis, in this post, we'll demonstrate how we can perform these calculations out of the box with IbisML) so that we can tie it all together and create a nice mini series of blog posts.

Sounds good to me! From my perspective, part of seeing your posts is also an indicator of what, if anything, somebody may actually want to use Ibis for in the ML space. Happy to use the blogs as a leading indicator. :)

IndexSeek · 2024-11-16T21:27:10Z

I just updated it to incorporate this approach. Thank you for sharing those snippets! Hopefully it flows well - I'm happy to adjust as necessary.

IndexSeek · 2024-11-16T21:32:01Z

docs/posts/classification-metrics-on-the-backend/index.qmd

+t.select(
+    accuracy=accuracy_expr,
+    precision=precision_expr,
+    recall=recall_expr,
+    f1_score=f1_score_expr,
+).limit(1)


Is there a better way we could render these results? I was fiddling around with:

print(f"{accuracy_expr=}, {precision_expr=}, {recall_expr=}, {f1_score_expr=}")

But it wasn't rendering nicely.

.execute() should work (or .to_pyarrow().as_py() or some of the other .to_* export methods)

I ended up using to_pyarrow().as_py(). I suspect some readers may like to see that we can bring this to a Python object.

ibis-docs-bot · 2024-11-16T22:14:24Z

Docs preview: https://pr-10501-66dce135710001b077d7ae067124023f9a4282a3--ibis-quarto.netlify.app

docs/posts/classification-metrics-on-the-backend/index.qmd

deepyaman

Added suggestions for the "efficient" paths, but I think for these there may be no meaningful difference if the computations are already warm on the backend? Probably something you could more easily test if you're interested; leave it up to you whether you want to use these shortcut formulas.

docs/posts/classification-metrics-on-the-backend/index.qmd

deepyaman

Some minor grammatical changes, but otherwise looks great to me!

docs/posts/classification-metrics-on-the-backend/index.qmd

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs/posts/classification-metrics-on-the-backend/index.qmd

IndexSeek · 2024-11-18T12:44:29Z

I'm ready to go with this one if we're good with it! (pending the date edit).

Thanks for your help and the thorough review @deepyaman, I think it greatly improves the post!

gforsyth

Hey @IndexSeek -- this looks good to me!
Do you have a particular date you'd like to release it on?

I feel like @lostmygithubaccount would tell us to not publish it on a Friday.

IndexSeek · 2024-11-21T22:49:21Z

Hey @IndexSeek -- this looks good to me! Do you have a particular date you'd like to release it on?

I feel like @lostmygithubaccount would tell us to not publish it on a Friday.

Sweet! Thank you for the review and approval.

I think this upcoming Monday would work out well, given later in the week many potential US readers would rather be consuming turkey than consuming information on classification metrics.

I edited my suggestion above so it is easier to tweak when we are ready to go if that date is okay.

ibis-docs-bot · 2024-11-21T23:46:09Z

Docs preview: https://pr-10501-746edcb9a5f5ad004cab4de949c8ce5ba67d01d9--ibis-quarto.netlify.app

docs/posts/classification-metrics-on-the-backend/index.qmd

lostmygithubaccount · 2024-11-22T02:54:20Z

I feel like @lostmygithubaccount would tell us to not publish it on a Friday.

generally wouldn't recommend publishing on Friday + a lot of people will be out all of next week for Thanksgiving. but idk, maybe people want something to read still

great blog! not necessary, but could be cool to demonstrate a plot of the confusion matrix with one of the visualization libraries

also this reminded me of what could be a cool follow up blog for using binary classification to detect data drift over time (described as two-sample tests here: https://arxiv.org/abs/1610.06545 and various other articles since). it's a really cool application and in theory Ibis + XGBoost or LightGBM makes it trivial to implement on a ton of backends

IndexSeek · 2024-11-22T12:47:33Z

great blog! not necessary, but could be cool to demonstrate a plot of the confusion matrix with one of the visualization libraries

Thank you! This is a great idea; I will tweak this to support plotting this either this evening or over the weekend.

also this reminded me of what could be a cool follow up blog for using binary classification to detect data drift over time (described as two-sample tests here: https://arxiv.org/abs/1610.06545 and various other articles since). it's a really cool application and in theory Ibis + XGBoost or LightGBM makes it trivial to implement on a ton of backends

I haven't previously used binary classification to detect drift, but it does seem like a clever application! I like the idea a lot; exploring and providing a write-up showing how Ibis can make this easy regardless of where the data is would be cool. We could also use Ibis to detect feature drift; that is something else I've been thinking about a lot. I think the implementation would be more straightforward than alternatives.

IndexSeek · 2024-11-23T20:19:00Z

@lostmygithubaccount - I added a visualization using Seaborn for the confusion matrix. As I was doing this, I felt there might have been a better way to get the Ibis expression into a compatible array, but this was the "cleanest" way I could come up with. I am open to tweaking if there might be a more optimal way!

gforsyth · 2024-12-04T20:16:00Z

Hey @IndexSeek -- sorry I dropped the ball on this over the holiday weekend. Do we want to release this today or tomorrow?

IndexSeek · 2024-12-04T22:59:00Z

Hey @IndexSeek -- sorry I dropped the ball on this over the holiday weekend. Do we want to release this today or tomorrow?

No worries at all! I'm happy to release this tomorrow if that works for you.

Co-authored-by: Tyler White <50381805+IndexSeek@users.noreply.github.com>

gforsyth · 2024-12-05T02:08:01Z

Sounds good -- I'm going to merge it after CI passes, but we can publicize it starting tomorrow!

docs(blog): classification metrics on the backend

264061a

github-actions bot added the docs Documentation related issues or PRs label Nov 15, 2024

cpcloud added the docs-preview Add this label to trigger a docs preview label Nov 16, 2024

ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Nov 16, 2024

IndexSeek mentioned this pull request Nov 16, 2024

feat: ibis_ml.metrics ibis-project/ibis-ml#174

Open

deepyaman reviewed Nov 16, 2024

View reviewed changes

docs(blog): incorporate efficient approach

66dce13

IndexSeek commented Nov 16, 2024

View reviewed changes

deepyaman added the docs-preview Add this label to trigger a docs preview label Nov 16, 2024

ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Nov 16, 2024

deepyaman reviewed Nov 16, 2024

View reviewed changes

deepyaman suggested changes Nov 16, 2024

View reviewed changes

IndexSeek and others added 7 commits November 16, 2024 21:13

docs(blog): update precision_expr assignment

cd58843

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): update recall_expr assignment

100ece7

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): update f1_score_expr assignment

3c93d03

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): tweak verbiage on post intent

d1470fe

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): update accuracy_expr assignment

180e8ff

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): make breakdown two words

6aa167c

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): explain confusion matrix more concisely

ef63e90

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

IndexSeek force-pushed the classification-metrics-blog branch from 64e025a to ef63e90 Compare November 17, 2024 02:24

IndexSeek and others added 2 commits November 16, 2024 21:26

docs(blog): fix sentence to make sense

37533c1

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

docs(blog): adjust wording on efficient approach

9eef753

Co-authored-by: Deepyaman Datta <deepyaman.datta@utexas.edu>

IndexSeek commented Nov 17, 2024

View reviewed changes

docs/posts/classification-metrics-on-the-backend/index.qmd Show resolved Hide resolved

docs(blog): print metrics as python types

7e9d2e9

IndexSeek commented Nov 18, 2024

View reviewed changes

docs/posts/classification-metrics-on-the-backend/index.qmd Outdated Show resolved Hide resolved

docs(blog): exclude explicit size on confusion matrix table

746edcb

gforsyth approved these changes Nov 21, 2024

View reviewed changes

lostmygithubaccount added the docs-preview Add this label to trigger a docs preview label Nov 21, 2024

ibis-docs-bot bot removed the docs-preview Add this label to trigger a docs preview label Nov 21, 2024

docs(blog): add confusion matrix seaborn viz

fddc762

Merge branch 'main' into classification-metrics-blog

77a49b4

Update docs/posts/classification-metrics-on-the-backend/index.qmd

e89341a

Co-authored-by: Tyler White <50381805+IndexSeek@users.noreply.github.com>

gforsyth merged commit aafb30f into ibis-project:main Dec 5, 2024
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(blog): classification metrics on the backend #10501

docs(blog): classification metrics on the backend #10501

IndexSeek commented Nov 15, 2024

deepyaman commented Nov 16, 2024

Description of changes

ibis-docs-bot bot commented Nov 16, 2024

IndexSeek commented Nov 16, 2024

deepyaman left a comment •

edited

Loading

IndexSeek commented Nov 16, 2024

deepyaman commented Nov 16, 2024

IndexSeek commented Nov 16, 2024

IndexSeek Nov 16, 2024 •

edited

Loading

deepyaman Nov 16, 2024

IndexSeek Nov 17, 2024

ibis-docs-bot bot commented Nov 16, 2024

deepyaman left a comment

deepyaman left a comment

IndexSeek commented Nov 18, 2024

gforsyth left a comment

IndexSeek commented Nov 21, 2024

ibis-docs-bot bot commented Nov 21, 2024

lostmygithubaccount commented Nov 22, 2024

IndexSeek commented Nov 22, 2024

IndexSeek commented Nov 23, 2024

gforsyth commented Dec 4, 2024

IndexSeek commented Dec 4, 2024 •

edited

Loading

gforsyth commented Dec 5, 2024

docs(blog): classification metrics on the backend #10501

docs(blog): classification metrics on the backend #10501

Conversation

IndexSeek commented Nov 15, 2024

Description of changes

deepyaman commented Nov 16, 2024

Description of changes

ibis-docs-bot bot commented Nov 16, 2024

IndexSeek commented Nov 16, 2024

deepyaman left a comment • edited Loading

Choose a reason for hiding this comment

IndexSeek commented Nov 16, 2024

deepyaman commented Nov 16, 2024

IndexSeek commented Nov 16, 2024

IndexSeek Nov 16, 2024 • edited Loading

Choose a reason for hiding this comment

deepyaman Nov 16, 2024

Choose a reason for hiding this comment

IndexSeek Nov 17, 2024

Choose a reason for hiding this comment

ibis-docs-bot bot commented Nov 16, 2024

deepyaman left a comment

Choose a reason for hiding this comment

deepyaman left a comment

Choose a reason for hiding this comment

IndexSeek commented Nov 18, 2024

gforsyth left a comment

Choose a reason for hiding this comment

IndexSeek commented Nov 21, 2024

ibis-docs-bot bot commented Nov 21, 2024

lostmygithubaccount commented Nov 22, 2024

IndexSeek commented Nov 22, 2024

IndexSeek commented Nov 23, 2024

gforsyth commented Dec 4, 2024

IndexSeek commented Dec 4, 2024 • edited Loading

gforsyth commented Dec 5, 2024

deepyaman left a comment •

edited

Loading

IndexSeek Nov 16, 2024 •

edited

Loading

IndexSeek commented Dec 4, 2024 •

edited

Loading