Add Slack to the list of filtered web crawlers #5285

rcscott · 2017-04-24T18:21:01Z

This adds Slack to the list of filtered web crawlers. Slack lists 3 different user agents for their crawlers and the Slack regex covers all 3 (https://api.slack.com/robots).

Filtering Twitter's crawler was suggested in #5284, but this should already be filtered by the generic bot[\/\s\)\;] regex. I added tests for this just to ensure that it is actually filtered.

Fixes #5284

mattrobenolt

Can we add a couple tests to cover these?

https://github.com/getsentry/sentry/blob/master/tests/sentry/filters/test_web_crawlers.py

I'd say since these are so easy to test, it'd be worth just piling a few in here.

Again, if not, I'd be happy to take over from here.

mattrobenolt · 2017-04-24T18:57:00Z

src/sentry/filters/web_crawlers.py

+    # Slack - see https://api.slack.com/robots
+    r'Slack',
+    # Twitter - see https://dev.twitter.com/cards/getting-started#crawling
+    r'Twitterbot',


So it seems that this is already being covered by L30 above. No?

Yup I think you're right! I'll get rid of that.

rcscott · 2017-04-24T19:01:48Z

@mattrobenolt got it, I'll throw some tests in there this afternoon.

rcscott · 2017-04-24T19:26:55Z

@mattrobenolt I removed the Twitter filter and added unit tests. I figured I'd add tests for Twitter as well just to be sure.

mattrobenolt

Awesome! Thanks. 🍦

Add Slack and Twitter to the list of filtered web crawlers.

9136af5

rcscott mentioned this pull request Apr 24, 2017

Filter Slackbot crawler #5284

Closed

mattrobenolt reviewed Apr 24, 2017

View reviewed changes

Ryan Scott added 2 commits April 24, 2017 15:14

Remove Twitter filter.

5ebc30e

Add unit tests for filtering Twitterbot and Slack.

cc6bdc9

rcscott changed the title ~~Add Slack and Twitter to the list of filtered web crawlers~~ Add Slack to the list of filtered web crawlers Apr 24, 2017

mattrobenolt approved these changes Apr 24, 2017

View reviewed changes

mattrobenolt merged commit 3d3a81e into getsentry:master Apr 24, 2017

github-actions bot locked and limited conversation to collaborators Dec 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Slack to the list of filtered web crawlers #5285

Add Slack to the list of filtered web crawlers #5285

rcscott commented Apr 24, 2017 •

edited

Loading

mattrobenolt left a comment

mattrobenolt Apr 24, 2017

rcscott Apr 24, 2017

rcscott commented Apr 24, 2017

rcscott commented Apr 24, 2017

mattrobenolt left a comment

Add Slack to the list of filtered web crawlers #5285

Add Slack to the list of filtered web crawlers #5285

Conversation

rcscott commented Apr 24, 2017 • edited Loading

mattrobenolt left a comment

Choose a reason for hiding this comment

mattrobenolt Apr 24, 2017

Choose a reason for hiding this comment

rcscott Apr 24, 2017

Choose a reason for hiding this comment

rcscott commented Apr 24, 2017

rcscott commented Apr 24, 2017

mattrobenolt left a comment

Choose a reason for hiding this comment

rcscott commented Apr 24, 2017 •

edited

Loading