Using a client inside an web handler #1164

joelwurtz · 2024-12-19T10:25:30Z

I was looking into creating a reverse proxy with only xitca libs, i did manage to have a "working" thing but i encountered several problems during the process and i'm wondering if some changes for this to be working properly would be accepted / wanted ?

Streaming the request body

First problem comes from streaming the request body from the web request to the client request. The web request is bounded to the local thread which makes it impossible to be Send which is required for the client.

I did manage to relax this constraint, however it makes the middleware not working (since they require to be Send also for them) There may be a way to relax the constraint also on middleware but need to have some sort of factory for them to be able to do that (like the one that is done on the web part)

Streaming the response body

Second main problem is that the response is linked to the client (due to the shared pool), so streaming it back directly is not possible (as we don't know if client will still live when streaming back the response, current work around was to make the body into an owned version, but i do believe this make it impossible for the connection to going back to the pool once it is owned.

Also there would be a client per thread, which means that they would have a different pool of connections which is not "ideal" (but not that much of a problem)

I'm not sure how this can resolved however, i'm wondering if it's possible to split the pool from the client (maybe even by default there is no pool of connections, and add api in the builder to either have a pool that is shared across thread or not). Or maybe there is a possibility to have a shared pool, and when we acquire the connection we clone it to the local thread, remove the one from the pool and do the inverse operation when it's not needed (need to look into the low level api to be sure that this is doable).

You can close this if this is not wanted, just wanted to share some problems that i encountered during the building of a reverse proxy.

The text was updated successfully, but these errors were encountered:

fakeshadow · 2024-12-19T11:50:37Z

Yes some of the issues you mentioned are the reason why xitca-client still has no crate release. These are some of the design issue could be explain why these issue exist:

Send bound issue. Unlike the server framework thread-per-core design does not really make sense for http client due to following reason:
- Remote peer only expose limited resource. In most case one can only open so many sockets before remote start to rate limiting and a global thread safe client is suitable for it with better resource usage and less accidental breaking of limit. For example the usual practice of 6 sockets per domain for HTTP/1 and one socket per domain for HTTP/2.
- For a thread safe client the Send bound for async API is the most sensible default. That said there should be nothing stopping the API to generate auto trait bound when necessary and it's the current async Rust's limitation preventing this happen. Namely xitca-client needs it's own Service trait that not compatible with xitca_service::Service used in server crate.
non static lifetime issue. This is an intentional experiment for exploring non static types through out xitca crates due to the main design goal of xitca-xxx is to reduce memory allocation. From our observation a clean and easy API hiding lifetime behind smart pointers heavily associated with more (non-deterministic) memory usage. This design goal indeed limit and complicate the user facing API and we are open to alter the course for achieving a possible middle ground through following possible routes:
- Expose static lifetime types like xitca-http doing. This will ease the user experience enabling cleaner types with no lifetime pollution. In exchange the types will have to use smart pointers with reference counting to track lifetime in background.
- Expose non static lifetime types as is and exploring new Rust features like async generator to avoid writing lifetimes explicitly. This will ease the user experience and in exchange the API will solely depend on the progression of async Rust features for moving forward.

Or in other word personally my preference on resolving these issue is to wait for async Rust progression and possibly utilize some cutting edge features to improve the API. Namely features like return_type_notation, gen_blocks, async_iterator etc.

As for making the client more modular I'm all for it. Making connection pool generic is a good idea if it contributes to solve these issue.

fakeshadow · 2024-12-19T11:59:08Z

BTW we have this utility type called FakeSend to help bridging the Send trait bound with xitca-xxx crates. It's example usage can be found with the tower-http compat feature of xitca-web. You can find relevant code here:

xitca-web/web/src/service/tower_http_compat.rs

Line 97 in 7fc8fc4

pub struct CompatReqBody<B, C> {

fakeshadow · 2024-12-24T11:17:57Z

#1168 removed lifetime param from response body type.

For Send bound there is no plan to change it yet. Following snippet is a simple forwarding proxy example which can be used as a work around.

use std::{
    io,
    pin::Pin,
    sync::Arc,
    task::{Context, Poll},
};

use futures::stream::Stream;
use xitca_client::Client;
use xitca_web::{
    App,
    body::{RequestBody, ResponseBody},
    error::{Error, ErrorStatus},
    handler::{body::Body, handler_service, state::StateRef},
    http::{WebRequest, WebResponse},
};

#[tokio::main]
async fn main() -> io::Result<()> {
    App::new()
        .at("/*", handler_service(handler))
        .with_state(Arc::new(Client::new()))
        .serve()
        .bind("0.0.0.0:8080")?
        .run()
        .await
}

async fn handler(
    StateRef(cli): StateRef<'_, Arc<Client>>,
    mut req: WebRequest<()>,
    Body(body): Body<RequestBody>,
) -> Result<WebResponse, Error> {
    if req.headers_mut().remove("proxy-connection").is_none() {
        return Err(ErrorStatus::bad_request().into());
    }

    let res = cli
        .request(req.map(|_| FakeSend::new(body)))
        .send()
        .await
        .map_err(|_| ErrorStatus::internal())?;

    Ok(res.into_inner().map(ResponseBody::box_stream))
}

struct FakeSend<B>(xitca_unsafe_collection::fake::FakeSend<B>);

impl<B> FakeSend<B> {
    fn new(body: B) -> Self {
        Self(xitca_unsafe_collection::fake::FakeSend::new(body))
    }
}

impl<B> Stream for FakeSend<B>
where
    B: Stream + Unpin,
{
    type Item = B::Item;

    fn poll_next(self: Pin<&mut Self>, cx: &mut Context<'_>) -> Poll<Option<Self::Item>> {
        Pin::new(&mut *self.get_mut().0).poll_next(cx)
    }

    fn size_hint(&self) -> (usize, Option<usize>) {
        (0, Some(0))
    }
}

joelwurtz · 2024-12-26T10:35:20Z

Thanks a lot for your insight and removing the lifetime, i updated my draft PR related to this change (i already did use fake send before, but your way of doing it seems simpler will look into it), and everythings works nicely.

I will certainly throw other PR to allow better extensibility of some points in the client, but not sure if this is wanted, i plan to do the following changes :

allow to use a specific socket adress for a request : feat(client): allow to specify a specific remote adress on client request #1169
allow specific timeout (connect / resolve / ...) per request (so we can use the same client but with different timeout)
allow specifiying a specific sni host (per request)
allow invalid certificates (this is already the case when using dangerous feature, this will sill be the case but with the dangerous feature there will be a new method allowing them, and it will validate by default)

fakeshadow · 2024-12-26T12:43:30Z

Thanks for your PRs and all contributions are welcomed. I'll look into and review them later as it's pretty busy here during the holiday. Hope you don't mind some delay.

joelwurtz · 2024-12-26T13:07:36Z

Don't worry take your time, it's open source, it's normal to have other priorities, thanks anyway for your quick response time 🫶

joelwurtz · 2025-01-10T20:39:37Z

I can close this, all pr "needed" to do a reverse proxy have been made, and we can discuss each subject on each pr, will certainly do more depending on bug / feature that i encounter. Thanks for you time

joelwurtz mentioned this issue Dec 19, 2024

reverse proxy expirementation #1165

Closed

joelwurtz closed this as completed Jan 10, 2025

fakeshadow mentioned this issue Jan 17, 2025

feat(client): add default header middleware #1181

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using a client inside an web handler #1164

Using a client inside an web handler #1164

joelwurtz commented Dec 19, 2024 •

edited

Loading

fakeshadow commented Dec 19, 2024 •

edited

Loading

fakeshadow commented Dec 19, 2024

fakeshadow commented Dec 24, 2024

joelwurtz commented Dec 26, 2024 •

edited

Loading

fakeshadow commented Dec 26, 2024

joelwurtz commented Dec 26, 2024

joelwurtz commented Jan 10, 2025

Using a client inside an web handler #1164

Using a client inside an web handler #1164

Comments

joelwurtz commented Dec 19, 2024 • edited Loading

Streaming the request body

Streaming the response body

fakeshadow commented Dec 19, 2024 • edited Loading

fakeshadow commented Dec 19, 2024

fakeshadow commented Dec 24, 2024

joelwurtz commented Dec 26, 2024 • edited Loading

fakeshadow commented Dec 26, 2024

joelwurtz commented Dec 26, 2024

joelwurtz commented Jan 10, 2025

joelwurtz commented Dec 19, 2024 •

edited

Loading

fakeshadow commented Dec 19, 2024 •

edited

Loading

joelwurtz commented Dec 26, 2024 •

edited

Loading