Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spoofable extractors are used with the knowledge of the risks #2998

Open
yanns opened this issue Oct 19, 2024 · 19 comments
Open

Spoofable extractors are used with the knowledge of the risks #2998

yanns opened this issue Oct 19, 2024 · 19 comments
Milestone

Comments

@yanns
Copy link
Collaborator

yanns commented Oct 19, 2024

The ticket follows the discussion in #2507 (comment)

Some extractors, like Host or Scheme, can use the values of some HTTP headers that could be spoofed by malicious users.

We should find a way to make users aware of the risks of using those extractors.

Some ideas:

  • using unsafe. This is not the idea of unsafe and we would be mis-using it. I think that this can be discarded.
  • encapsulating the value in a new struct like SpoofableValue so that users have to call some function to get the value. The name and the documentation of the function should make the user aware of the risk. Example:
async fn handler(Host(host): Host) -> String {
  val value = host.spoofable_value();
  value
}
@bengsparks
Copy link
Contributor

Perhaps something along the lines of:

/// Wrap spoofable extractor
pub struct Spoofable<E>(pub E);

/// Allow `Spoofable` to be used with spoofable extractors in handlers
impl <S, E> FromRequestParts<S> for Spoofable<E> where E: FromSpoofableRequestParts<S> {

}

/// axum private trait
trait FromSpoofableRequestParts<S>: Sized {
    type Rejection: IntoResponse;

    async fn from_request_parts(
        parts: &mut Parts, 
        state: &S
    ) -> impl Future<Output = Result<Self, Self::Rejection>> + Send;
}

/// Mark `Host` as a spoofable extractor
impl <S> FromSpoofableRequestParts<S> for Host { ... } 

/// Use spoofable extractor
async fn handler(Spoofable(Host(host)): Spoofable<Host>) -> String {
    println("{host}");
}

yanns added a commit that referenced this issue Oct 20, 2024
PoC to check which solution to pick for #2998
@yanns
Copy link
Collaborator Author

yanns commented Oct 20, 2024

I've made one PoC so that we can better imagine how the API would be: #3000

@bengsparks could you also make one for the approach you're suggesting. It seems very interesting!

@mladedav
Copy link
Collaborator

Is it possible to add on either of the extractors something like Host::unspoofable_value(&self) -> Option<String>?

I don't think host can be extracted from anything that cannot be spoofed and scheme could theoretically be extracted from connect info, but the way it is implemented now, it prefers the scheme the client used originally if the server is behind a proxy, i.e. it tries to extract from the proxy headers first which might be what the user is interested in.

If we can only return values extracted from spoofable sources, I feel like the destructuring is the nicer syntax from the current two options, but that's just my opinion. Getting rid of the Spoofable wrapper first also allows users to pass around Host in type-safe manner and we can implement Deref and Into for convenience. If we go with the first option, users would either have to call spoofable_value at every usage site or they would have to pass around a String. Implementing Into or Deref would completely circumvent forcing users to be explicit about acknowledging the spoofable scenario so that could never be added.

For completeness, would you be opposed to just having spoofable-extractors feature which would gate Host and Scheme in their current implementation? It would reduce the noise in handler signatures and users still have to opt-in, although just once for all of them and not explicitly for each use. I guess the question is if it's explicit enough.

@jplatte
Copy link
Member

jplatte commented Oct 21, 2024

How about Host<WithProxyHeaders> and Host<WithoutProxyHeaders> as an alternative? I find "spoofable" sounds a bit awkward, and while the proxy thing may not sound as dangerous, it would still get people thinking.

@yanns
Copy link
Collaborator Author

yanns commented Oct 21, 2024

I personal like having to change the usage site.
I guess it would be very easy to have a function taking a Scheme and forgetting about the risks of using it.
Being force to call spoofable_value makes sure that the person taking care of this particular implementation will be reminded of the consequences.

@mladedav
Copy link
Collaborator

mladedav commented Oct 21, 2024

How about Host<WithProxyHeaders> and Host<WithoutProxyHeaders>

I would see that as another dimension because both the proxy headers and the host header can be spoofed.

@jplatte
Copy link
Member

jplatte commented Oct 21, 2024

Okay so I don't hate any of the options presented so far. They all seem a bit weird but that's almost the point, so not too surprising. @yanns, @mladedav if you can agree on a best solution, feel free to go ahead and merge the corresponding PR and close this issue.

@jplatte jplatte changed the title spoofable extractors are used with the knowledge of the risks Spoofable extractors are used with the knowledge of the risks Oct 21, 2024
@jplatte jplatte added this to the 0.8 milestone Oct 21, 2024
@mladedav
Copy link
Collaborator

@yanns I personally see it as the extractor doing the "unsafe" operation. You ask for the Host and acknowledge that you know what you're doing and then you can pass that value around and use it however you want. Plus the potential for implementing some traits I mentioned before.

This should also be easier to migrate to (which might be bad since people won't have to think about every usage site of Host).

But if you really think it would be better to have users call the spoofable_value method every time, I'll yield (unless @bengsparks wants to argue for the Spoofable<T> wrapper more).

@bengsparks
Copy link
Contributor

bengsparks commented Oct 23, 2024

I personally like my solution for the fact that migration is simple and that the use-site is clearly visible.

I'd argue that outside of creating different extractors for reading different headers, there is no fool-proof way to achieve the desired goal.
If a user doesn't want to read docs / ensure safety / take other precautions, then there is nothing to be done.

With my way, there is a handler-specific marker that calls for further inspection during code review instead of potentially being buried inside of the handler at any given position.

Suggestions and adjustments to the naming of Spoofable and other nomenclature in my PR are welcome :)

@yanns
Copy link
Collaborator Author

yanns commented Oct 29, 2024

My only fear is about:

  • one developer does a change to use a spoofable value.
  • later another developer is using this value without knowing that it can be spoofed.

But I don't have strong opinion here. I guess I'm very (too) sensitive to developers not being careful about security...

@yanns
Copy link
Collaborator Author

yanns commented Nov 7, 2024

I have the feeling that the community consensus is tending towards Spoofable<T> wrapper. To make progress, let's go for this. It's always a step in more security awareness.

@yanns
Copy link
Collaborator Author

yanns commented Nov 7, 2024

My other comment is that the current Host extractor is misleading. Personally, I'd have used it by assuming it's only using the Host header.
By making it Spoofable, it's clearer that the value van be read from different headers.

@sclu1034
Copy link

sclu1034 commented Nov 7, 2024

One thing to note is that while this does raise awareness, there is no way for the user to act on that knowledge. The user can choose to not use the extractor at all, but a far more valuable ability would be to selectively choose only the non-spoofable part, which is not possible, as far as I can tell.

Most notably, I don't see any way that the extractors could be used to implement things like the common TRUSTED_PROXIES setting, where the X-Forwarded-* headers are considered safe selectively based on the source IP address.

@yanns
Copy link
Collaborator Author

yanns commented Nov 13, 2024

@sclu1034 I agree with you.
What kind of approach would you see here?
Should we change those extractors to:

  • have extractors only about non-spoofable values
  • have separated extractors also considering the spoofable part
  • add extractor where we can configure the trusted_proxies
    ?

@sclu1034
Copy link

Ideally, it would be a solution where the logic whether a connection can be trusted has to be implemented only once, rather than again for every extractor or handler.

So I think some kind of middleware that can dynamically strip headers would be best. At that point, the extractors could stay as they are, and simply act as if that header was never sent. And a centralized solution like that would also ensure that manual access to these headers can be trusted likewise.

@bengsparks
Copy link
Contributor

+1 to the middleware suggestion by @sclu1034; the best solution would be such headers simply never arrive.
I was about to cite this comment in support, but then I realised that they're also the author thereof 😄

@jplatte jplatte modified the milestones: 0.8, 0.9 Nov 26, 2024
@Turbo87
Copy link
Collaborator

Turbo87 commented Dec 26, 2024

FWIW after reading up on the discussion here, I'm not a huge fan of the spoofable proposal 🫣

if the goal is to make users aware of the spoofing risks of the Host and Scheme extractors, how about making these extractors configurable and defaulting to only reading from "safe" sources? that way, the user has to explicitly configure the extractor to read from a spoofable source, making them aware of the risk.

if we introduce FromSpoofableRequestParts, do we then also introduce OptionalFromSpoofableRequestParts? it feels like this might be getting out of hand soon 😅

@mladedav
Copy link
Collaborator

mladedav commented Jan 6, 2025

Well, I don't think there is any non-spoofable source for host.

But just implementing impl FromRequestParts for Spoofable<Scheme> and any other types excplicitly wouldn't add more traits. That would mean users cannot implement the same for their own types and we couldn't in axum-extra, which might be desirable for some libraries but I don't think that's too bad.

@sclu1034
Copy link

I don't think there is any non-spoofable source for host.

What's there to "spoof" anyways? Host is not usually used by reverse proxies, so no third party involved.
And curl -H 'Host: foo.com' bar.com, curl -H 'Host: foo.com' foo.com or curl -H 'Host: foo.com' 1.2.3.4 all look the same to the server:

GET / HTTP/1.1
Host: foo.com

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants