fix: Add graphql max depth and aliases limits #955

suejung-sentry · 2024-11-01T22:09:24Z

Add protections against Denial of Service attacks on the GraphQL API that use high depth, high breadth, or high aliases.

For high depth attacks (highly nested GraphQL queries) - I added a new validation_rule to ariadne that rejects past a max setting. There were no off-the-shelf with ariadne so this is a custom one created mimicking the one from the Apollo GraphQL SDK linked by the pentesters.
For high breadth attacks (a ton of fields request for a given level) - we don't really offer in our schema things that wide, so the protection against aliases should cover this issue
For high aliases attacks (requesting the same field over and over using aliases) - I added a new validation_rule counting these and rejecting if beyond a max setting. See same note on high depth attacks.

Note that we also already have cost validation by ariadne. The additional rules in this PR can catch cases where the cost validation is correct against our day-to-day use cases, but may be too lenient against crafted "attacks", such as the ones composed by the pentesters.

We also already have rate limiting on the GraphQL endpoint so that pentest recommendation is already covered.

Closes https://github.com/codecov/internal-issues/issues/918
Closes https://github.com/codecov/internal-issues/issues/917

codecov-notifications · 2024-11-01T22:30:29Z

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

codecov · 2024-11-01T22:30:33Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 96.25%. Comparing base (0273319) to head (903ed78).
Report is 1 commits behind head on main.

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #955   +/-   ##
=======================================
  Coverage   96.25%   96.25%           
=======================================
  Files         826      827    +1     
  Lines       19048    19090   +42     
=======================================
+ Hits        18334    18376   +42     
  Misses        714      714

Flag	Coverage Δ
unit	`92.52% <100.00%> (+0.01%)`	⬆️
unit-latest-uploader	`92.52% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ajay-sentry · 2024-11-04T18:23:02Z

graphql_api/validation.py

+            self.max_depth_reached: bool = False
+            self.max_depth: int = max_depth
+
+        def enter_operation_definition(


just curious, are these functions base functions you had to override default behavior of?

Similar story with enter_field, leave_field, and enter_document

Yup! Though I don't think about it so much as "override" as "implement the interface" that is defined here. The function names enter_X and leave_X are dynamic per here. And for any method that's not implemented explicitly, it behaves as a no-op (here).

this is the stuff I looked at in Ariadne doc & this example implementation

Very cool! Thanks for linking all those docs, that was a fun set of reads

ajay-sentry · 2024-11-04T18:25:08Z

graphql_api/views.py

@@ -201,7 +202,11 @@ def get_validation_rules(
                maximum_cost=settings.GRAPHQL_QUERY_COST_THRESHOLD,
                default_cost=1,
                variables=data.get("variables"),
-            )
+            ),
+            create_max_depth_rule(max_depth=getattr(settings, "GRAPHQL_MAX_DEPTH", 15)),


Nit: Do we need to do the getattr() here if the value will always be set in the settings_base file?

Nitpicking since the rest of the codebase is pulling off of settings directly 😅

Yeah, my intention was to force defensive programming there as I feel like I've seen a lot of accessing of dicts and tribal knowledge on when it's a bug.
But thinking about it again, it does seem like burying a default at this level is the wrong practice and since settings is something we control, we would prefer it to fail early and actually throw an error instead of having it "be forgiving" here.
Fixed it!

Haha yep totally agreed! Thanks for making the update

ajay-sentry · 2024-11-04T18:26:23Z

codecov/settings_base.py

@@ -106,6 +106,10 @@

 GRAPHQL_INTROSPECTION_ENABLED = False

+GRAPHQL_MAX_DEPTH = 15


Wanted to hear a little more about landing on 15 for both of these values, do you think stage/production/dev should have the same value for each?

I skimmed the existing queries and the max depth was around ~10, so I thought it would be a good buffer. Could potentially do like 20 if we want more wiggle room. I think keeping it the same in all envs makes sense since honestly I think it's just a set-and-forget kind of thing. Also, we are the main consumers of the graphql api so I'd want to catch any queries that would error in prod to be caught first in the lower envs at the same settings.

For aliases, I don't think we ever use more than a handful (<5?) at a time, so thought a reasonable use case wouldn't go beyond say 15 anyway (and anything above that seems to be a malicious actor).

Open to thoughts on those. If any become a problem, it's as simple as bumping this up & we would catch the issue during development of some new feature anyway

Totally makes sense! I agree with you having it ~15 for the depth makes a lot of sense. Could probably lower the alias one if we really wanted, though not a deal breaker

ajay-sentry

Looks good to me!

suejung-sentry · 2024-11-06T18:36:30Z

Some other things tested -
smoke tested around staging app that no pages had any regression as a result of these rules
tested in staging a query with many aliases above and below the threshold and behaved as expected
will track any latency delta in our request latency histogram

suejung-sentry added 3 commits November 1, 2024 15:08

fix: Add graphql max depth and aliases limits

32fa397

Merge remote-tracking branch 'origin/main' into sshin/fix/max-depth

93053ff

add types

5f6cb7e

suejung-sentry marked this pull request as ready for review November 1, 2024 22:51

suejung-sentry requested a review from a team as a code owner November 1, 2024 22:51

suejung-sentry added 2 commits November 4, 2024 09:46

cleanup

fff11f0

Merge remote-tracking branch 'origin/main' into sshin/fix/max-depth

dcefcab

ajay-sentry reviewed Nov 4, 2024

View reviewed changes

suejung-sentry added 2 commits November 4, 2024 22:13

fix dict access pattern

c1250c6

Merge remote-tracking branch 'origin/main' into sshin/fix/max-depth

1d5d33c

ajay-sentry approved these changes Nov 5, 2024

View reviewed changes

suejung-sentry added 3 commits November 6, 2024 08:48

incorporate ajay tips

2450670

Merge remote-tracking branch 'origin/main' into sshin/fix/max-depth

9442105

Merge remote-tracking branch 'origin/main' into sshin/fix/max-depth

903ed78

suejung-sentry temporarily deployed to staging November 6, 2024 18:09 — with GitHub Actions Inactive

suejung-sentry added this pull request to the merge queue Nov 6, 2024

Merged via the queue into main with commit 63124e2 Nov 6, 2024
30 of 32 checks passed

suejung-sentry deleted the sshin/fix/max-depth branch November 6, 2024 18:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add graphql max depth and aliases limits #955

fix: Add graphql max depth and aliases limits #955

suejung-sentry commented Nov 1, 2024 •

edited

Loading

codecov-notifications bot commented Nov 1, 2024

codecov bot commented Nov 1, 2024 •

edited

Loading

ajay-sentry Nov 4, 2024

suejung-sentry Nov 5, 2024

ajay-sentry Nov 5, 2024

ajay-sentry Nov 4, 2024

suejung-sentry Nov 5, 2024

ajay-sentry Nov 5, 2024

ajay-sentry Nov 4, 2024

suejung-sentry Nov 5, 2024

ajay-sentry Nov 5, 2024

ajay-sentry left a comment

suejung-sentry commented Nov 6, 2024

		@@ -106,6 +106,10 @@

		GRAPHQL_INTROSPECTION_ENABLED = False

		GRAPHQL_MAX_DEPTH = 15

fix: Add graphql max depth and aliases limits #955

fix: Add graphql max depth and aliases limits #955

Conversation

suejung-sentry commented Nov 1, 2024 • edited Loading

codecov-notifications bot commented Nov 1, 2024

Codecov Report

codecov bot commented Nov 1, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ajay-sentry left a comment

Choose a reason for hiding this comment

suejung-sentry commented Nov 6, 2024

suejung-sentry commented Nov 1, 2024 •

edited

Loading

codecov bot commented Nov 1, 2024 •

edited

Loading