Relax parsing restrictions around host and hostname #1606

pvande · 2020-02-16T08:12:33Z

This approach allows us to match and process any potential input and still meet the contractual requirements of Request#split_authority without requiring the input to be well-formed or RFC-compliant. In turn, this eliminates unexpected failures on invalid-but-functional inputs.

If we deemed it necessary to do validation (particularly on IP addresses), the AUTHORITY regex still provides enough context on which validations should apply, though doing so would require some consideration on what should be returned upon validation failure.

If validations can be ruled unnecessary – I suspect this should be the case – the implementation could be replaced by the semantically identical:

def split_authority(authority)
  /\A(?<host>\[\g<addr>\]|(?<addr>.*?))(:(?<port>\d+))?\Z/ =~ authority
  return host, addr, port&.to_i
end

This gives up differentiation between IPv6, IPv4, and DNS addresses, but is arguably simpler.

(Resolves #1604)

This approach allows us to match and process any potential input and still meet the contractual requirements of `Request#split_authority` _without_ requiring the input to be well-formed or RFC-compliant. In turn, this eliminates unexpected failures on invalid-but-functional inputs. If we deemed it necessary to do validation (particularly on IP addresses), the `AUTHORITY` regex still provides enough context on which validations should apply, though doing so would require some consideration on what should be returned upon validation failure. If validations can be ruled unnecessary – I suspect this should be the case – the implementation could be replaced by the semantically identical: ``` ruby def split_authority(authority) /\A(?<host>\[\g<addr>\]|(?<addr>.*?))(:(?<port>\d+))?\Z/ =~ authority return host, addr, port&.to_i end ``` This gives up differentiation between IPv6, IPv4, and DNS addresses, but is arguably simpler.

ioquatix · 2020-02-16T10:19:26Z

This looks good to me, you took what I did and made it way better. Thanks!

ioquatix · 2020-02-16T10:20:08Z

I’ll backport to 2-2-stable as I consider this a bug fix.

ioquatix · 2020-02-16T10:47:35Z

lib/rack/request.rb

-      AUTHORITY = /^
-        # The host:
+      AUTHORITY = /
+        \A


@pvande what is the difference between \A and \Z vs ^ and $?

\A matches the beginning of the string. ^ matches any beginning of line in the string. \Z matches the end of string, allowing possible newline at the end of the string. $ matches the end of any line in the string.

In general, you are much more likely to want \A and \Z (or \z, which matches the end of string) than ^ and $. Using ^ and $ usually leads to bugs, in my experience.

pvande added 2 commits February 16, 2020 00:07

Add a CHANGELOG.md entry

97efb68

pvande requested a review from ioquatix February 16, 2020 08:23

ioquatix merged commit 766761b into rack:master Feb 16, 2020

ioquatix added this to the v2.2.3 milestone Feb 16, 2020

ioquatix reviewed Feb 16, 2020

View reviewed changes

pvande deleted the relax-hostname-validations branch February 16, 2020 19:11

This was referenced Apr 20, 2023

Port 80 being used even when using a proper host or SERVER_PORT ncr/rack-proxy#118

Open

HTTP_HOST not parsing port correctly at 2.X #2070

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relax parsing restrictions around host and hostname #1606

Relax parsing restrictions around host and hostname #1606

pvande commented Feb 16, 2020 •

edited

Loading

ioquatix commented Feb 16, 2020

ioquatix commented Feb 16, 2020

ioquatix Feb 16, 2020

jeremyevans Feb 16, 2020

Relax parsing restrictions around host and hostname #1606

Relax parsing restrictions around host and hostname #1606

Conversation

pvande commented Feb 16, 2020 • edited Loading

ioquatix commented Feb 16, 2020

ioquatix commented Feb 16, 2020

ioquatix Feb 16, 2020

Choose a reason for hiding this comment

jeremyevans Feb 16, 2020

Choose a reason for hiding this comment

pvande commented Feb 16, 2020 •

edited

Loading