feat: add `coroutine::await_in_coroutine` to await awaitables in coroutine context #3611

wyfo · 2023-11-30T15:41:51Z

Relates to #1632

Draft based on #3610

codspeed-hq · 2023-11-30T16:35:29Z

CodSpeed Performance Report

Merging #3611 will not alter performance

_{Comparing wyfo:pyfuture (f9084e5) with main (ad5f6d4)}

Summary

✅ 83 untouched benchmarks

davidhewitt · 2023-12-17T08:04:36Z

@wyfo I'm sorry for the painfully slow review here.

Given that allow_threads is currently under intense scrutiny and we haven't yet decided on a concrete solution, #3610 risks getting stuck for a bit while we figure out a way forward for that API.

Does this patch rely on #3610 getting merged, or is it possible to rebase this on main? That would allow us to move forward with some of these remaining async PRs.

wyfo · 2024-04-07T02:00:29Z

There is one thing bothering me with this implementation: the name PyFuture. In fact, there is Future object in Python, and it doesn't match the underlying type of PyFuture.
PyFuture is the result of calling __await__, so it's an iterator, but that's all we know, and I don't know any naming convention for this object – PEP 492 doesn't give it a name.

I've chosen PyFuture because it sounds like Rust Future, but that's maybe not a good reason. If you have some name suggestion, I'm interested.

wyfo · 2024-04-07T11:46:06Z

The error in CI (other than non_local_definition lint) is due to the bug mentioned in #4055.

wyfo · 2024-05-07T08:37:16Z

As written above, the name PyFuture was not a good name (the Python object doesn't even have a name, see this discussion), so I dropped it here to reuse it in #4057 where it's more suited.
Instead, I chose the more explicit name await_in_coroutine, making it easier to document, and for the user to understand that it must be used in coroutine context.

…ine context

wyfo · 2025-01-15T23:09:35Z

@davidhewitt As promised, the PR has been rebased on main and is now ready to review.

davidhewitt

Thanks for this epic piece of engineering, and I'm very sorry for my long delay in review; it's a complex piece of code and I've only now reviewed!

Overall this looks great and makes sense to proceed with. I asked a ton of questions and had suggestions for cleanup, I think once these are addressed there will likely be another round of review. Now that my head is much more engaged with this code I promise the next review will come much sooner! 🙏

davidhewitt · 2025-04-11T15:46:23Z

guide/src/async-await/awaiting_python_awaitables.md

+
+## Restrictions
+
+As the name suggests, `await_in_coroutine` resulting future can only be awaited in coroutine context. Otherwise, it


By "coroutine context", I understand that means "within an async call stack underneath a #[pyfunction] or #[pymethods]"?

Maybe we can adjust wording to something like that? It took me a few reads to process this, and it's maybe not obvious to all users that async Rust functions become coroutines.

You're right, I will try to reword it. But it will maybe more easily understandable when the Coroutine type will be stabilized.

davidhewitt · 2025-04-11T15:48:59Z

pyo3-ffi/src/abstract_.rs

+    pub fn PyIter_Send(
+        iter: *mut PyObject,
+        arg: *mut PyObject,
+        presult: *mut *mut PyObject,
+    ) -> c_int;


I think this was fixed in #4746, will need to rebase.

davidhewitt · 2025-04-11T16:05:23Z

src/coroutine/awaitable.rs

+/// })
+/// # }
+/// ```
+pub fn await_in_coroutine(


API question: do you think it's better to have it this way or as a method on PyAnyMethods?

e.g. obj.bind(py).await_in_coroutine()?

Is there ever a reasonable way to await one a Python awaitable without being in coroutine context? I guess that there would be no way to send / throw / close because the enclosing context would not have such a mechanism. Even more, I suppose that driving a Python awaitable really requires you to be in a Python event loop (i.e. in coroutine context).

That makes me think, is the key difference that "coroutine context" means something to be run on the Python event loop, as opposed to in a Rust runtime?

Coroutine context technically means that the Waker inside the Context used to poll the Future is the one instantiated in Coroutine::poll_inner. That's not really related to being run on the Python event loop, even if you expect your Coroutine object to be run on it.
That concept should be properly documented, as you mentioned in another comment.

Is there ever a reasonable way to await one a Python awaitable without being in coroutine context?

Awaiting a Python awaitable means delegating send/throw/close when it's possible. That's the goal of this PR, to make things as much as it would be done in pure Python.

Actually, it would be technically feasible to "await" a python awaitable in a Rust runtime, but the main issue is that Python async stack is not supposed to be thread safe, and I assume that most awaitables assume to be run by a Python event loop by using asyncio API, so you should expect to have issue if you don't use the Python runtime.
Another point is that Python awaitables have a different semantic than Rust future regarding cancellation: Python awaitable should always be run until completion, there is no such thing as "dropped in the middle of execution" like Rust future (otherwise, finally block are not executed, and that's so much counterintuitive to debug; I speak from experience, as I've already seen encountered nasty things like this). So wrapping an awaitable in a Rust future that is not guaranteed to be polled into completion is not the best thing to do — Coroutine are not technically guaranteed to be run until completion, but again, they are supposed to be run on the Python event loop.
That's why I didn't think at that time it's worth to allow something almost guaranteed to behave or fail badly, and I put this conservative protection, i.e. only allows await_in_coroutine inside "coroutine context"; I do think that panicking is better that returning a PyErr wrapping things like RuntimeError: no running event loop.

API question: do you think it's better to have it this way or as a method on PyAnyMethods?

I didn't think about it, but yes, it could be better than a badly named free function like this one.

Makes total sense, thank you 👍

src/coroutine/awaitable.rs

src/lib.rs

src/coroutine.rs

davidhewitt · 2025-04-11T17:01:47Z

src/coroutine/asyncio.rs

+fn get_running_loop(py: Python<'_>) -> PyResult<Bound<'_, PyAny>> {
+    static GET_RUNNING_LOOP: GILOnceCell<PyObject> = GILOnceCell::new();
+    let import = || -> PyResult<_> {
+        let module = py.import("asyncio")?;
+        Ok(module.getattr("get_running_loop")?.into())
+    };
+    GET_RUNNING_LOOP
+        .get_or_try_init(py, import)?
+        .bind(py)
+        .call0()


Same comment that we can use GILOnceCell::import to simplify here.

A very welcomed QOL improvement indeed!

src/coroutine/asyncio.rs

davidhewitt · 2025-04-11T17:04:52Z

src/coroutine/asyncio.rs

+        // `asyncio.Future` must be awaited; in normal case, it implements  `__iter__ = __await__`,
+        // but `create_future` may have been overriden
+        let mut iter = match PyIterator::from_object(self.future.bind(py)) {


Sounds like this might go wrong if the overridden future defines __next__ and also has important logic in its __await__ method. Should we just always do the call to __await__ (maybe via a slot call again to avoid Python dispatch)?

Indeed, it might go wrong, even if it would be very surprising to do such thing. I only know two alternative event loops:

uvloop, by far the most used after asyncio I belive, reuse asyncio.Future;

leviathan defines __await__ as __iter__.
Also, alternative implementations would try to be compatible with asyncio.Future, and this one defines __iter__ = __await__. That's why I think it can be a reasonable assumption to rely on the iterator protocol for an object returned by create_future. And I obviously chose to use the iterator protocol for performance reason over Python dispatch.

However, I didn't think about using am_await slot (don't know how to do for now, will dig), so I should maybe benchmark both approaches to decide which one to keep.

See src/internal/get_slot.rs and also https://docs.python.org/3/c-api/typeobj.html#c.PyAsyncMethods.am_await

src/coroutine/asyncio.rs

Co-authored-by: David Hewitt <[email protected]>

wyfo

Glad to read you again! And thank you a lot for this detailed review!
As I've written in some comments, Rust 1.83 changes things radically. My hack is no more needed, but there is more. In fact, Waker::vtable could be used to detect this much written-about "coroutine context", and it could be use to handle cancellation in a much better way.
CancelHandle is indeed no longer needed. We could just provide an asynchronous function waiting for thrown exception, and awaiting it would register the exception catch in Coroutine state machine. We could also do the same for close and GeneratorExit exception.
In fact, I would like to use Coroutine static method for that, because it would make it clearer that this features are related to being polled in a coroutine context. For example:

impl Coroutine {
    /// Returns the argument of `Coroutine::throw` whenever it's called.
    async fn catch_throw() -> PyResult<()> {
        todo!()
    }
    /// Returns whenever `Coroutine::close` is called.
    async fn catch_close() {
        todo!()
    }
    /// Wrap a Python awaitable into a Rust `Future` that can be awaited in the context of a `Coroutine`.
    async fn delegate(
        awaitable: obj: &Bound<'_, PyAny>
    ) -> PyResult<impl Future<Output = PyResult<PyObject>> + Send + Sync + 'static> {
        todo!()
    }

What do you think about that idea?

wyfo · 2025-04-15T08:11:25Z

guide/src/async-await/awaiting_python_awaitables.md

+
+## Restrictions
+
+As the name suggests, `await_in_coroutine` resulting future can only be awaited in coroutine context. Otherwise, it


You're right, I will try to reword it. But it will maybe more easily understandable when the Coroutine type will be stabilized.

src/coroutine.rs

wyfo · 2025-04-15T10:32:44Z

src/coroutine.rs

+    fn close(&mut self, py: Python<'_>) -> PyResult<()> {
+        match self.poll(py, CoroOp::Close) {
+            Ok(_) => Ok(()),
+            Err(err) if err.is_instance_of::<PyGeneratorExit>(py) => Ok(()),


Because that's how coroutine are supposed to works: https://docs.python.org/3/reference/datamodel.html#coroutine.close, so I mimic the behavior.
For example, you can execute this code:

async def example(): try: import asyncio await asyncio.Future() except GeneratorExit: print("close") raise coro = example() coro.send(None) coro.close() # print "close" but don't reraise the exception

There is no close_callback now, but if there was, the error could be caught by the code and be reraised. And with the changes I want to do, it would indeed maybe be possible to catch it in the code, so why not reraising it.

wyfo · 2025-04-15T10:41:26Z

src/coroutine/asyncio.rs

+fn get_running_loop(py: Python<'_>) -> PyResult<Bound<'_, PyAny>> {
+    static GET_RUNNING_LOOP: GILOnceCell<PyObject> = GILOnceCell::new();
+    let import = || -> PyResult<_> {
+        let module = py.import("asyncio")?;
+        Ok(module.getattr("get_running_loop")?.into())
+    };
+    GET_RUNNING_LOOP
+        .get_or_try_init(py, import)?
+        .bind(py)
+        .call0()


A very welcomed QOL improvement indeed!

wyfo · 2025-04-15T11:39:48Z

src/coroutine/asyncio.rs

+        // `asyncio.Future` must be awaited; in normal case, it implements  `__iter__ = __await__`,
+        // but `create_future` may have been overriden
+        let mut iter = match PyIterator::from_object(self.future.bind(py)) {


Indeed, it might go wrong, even if it would be very surprising to do such thing. I only know two alternative event loops:

uvloop, by far the most used after asyncio I belive, reuse asyncio.Future;

leviathan defines __await__ as __iter__.
Also, alternative implementations would try to be compatible with asyncio.Future, and this one defines __iter__ = __await__. That's why I think it can be a reasonable assumption to rely on the iterator protocol for an object returned by create_future. And I obviously chose to use the iterator protocol for performance reason over Python dispatch.

However, I didn't think about using am_await slot (don't know how to do for now, will dig), so I should maybe benchmark both approaches to decide which one to keep.

src/lib.rs

wyfo · 2025-04-15T13:22:44Z

tests/test_await_in_coroutine.rs

+                "CancelledError"
+            )
+        });
+        assert!(!cancel.is_cancelled());


Indeed, the equivalent Python code would be

async def wrap_cancellable(awaitable): try: await awaitable except Exception as err: assert type(err).__name__ == "CancelledError"

Because cancellation is delegated to the awaitable, this one will reraise it (and not the CancelHandle). Then if we catch the error in wrap_cancellable and don't reraise it, then the outer task is not cancelled.

But your question makes me understand I should add some comments to better explain the tests.

wyfo · 2025-04-15T13:28:40Z

tests/test_await_in_coroutine.rs

+            res = future.fuse() => res,
+            _ = checkpoint().fuse() => unreachable!(),


The same panic is indeed raised at two different places in the code depending on the polling order. I will add some comments to clarify what happens.

wyfo · 2025-04-15T13:35:46Z

src/coroutine/waker.rs

+enum WakerHack {
+    Argument(PyObject),
+    Result(Poll<PyResult<PyObject>>),
 }


Yes, I'm completely for raising the MSRV of the async feature to 1.83!
When I wrote this code, back in 1.74, the feature was far from stabilized, which is why I came up with this hack. But using 1.83 would in fact bring other benefits (more details in the review comment).

wyfo · 2025-04-15T13:46:38Z

src/coroutine/awaitable.rs

+/// })
+/// # }
+/// ```
+pub fn await_in_coroutine(


Coroutine context technically means that the Waker inside the Context used to poll the Future is the one instantiated in Coroutine::poll_inner. That's not really related to being run on the Python event loop, even if you expect your Coroutine object to be run on it.
That concept should be properly documented, as you mentioned in another comment.

Is there ever a reasonable way to await one a Python awaitable without being in coroutine context?

Awaiting a Python awaitable means delegating send/throw/close when it's possible. That's the goal of this PR, to make things as much as it would be done in pure Python.

Actually, it would be technically feasible to "await" a python awaitable in a Rust runtime, but the main issue is that Python async stack is not supposed to be thread safe, and I assume that most awaitables assume to be run by a Python event loop by using asyncio API, so you should expect to have issue if you don't use the Python runtime.
Another point is that Python awaitables have a different semantic than Rust future regarding cancellation: Python awaitable should always be run until completion, there is no such thing as "dropped in the middle of execution" like Rust future (otherwise, finally block are not executed, and that's so much counterintuitive to debug; I speak from experience, as I've already seen encountered nasty things like this). So wrapping an awaitable in a Rust future that is not guaranteed to be polled into completion is not the best thing to do — Coroutine are not technically guaranteed to be run until completion, but again, they are supposed to be run on the Python event loop.
That's why I didn't think at that time it's worth to allow something almost guaranteed to behave or fail badly, and I put this conservative protection, i.e. only allows await_in_coroutine inside "coroutine context"; I do think that panicking is better that returning a PyErr wrapping things like RuntimeError: no running event loop.

API question: do you think it's better to have it this way or as a method on PyAnyMethods?

I didn't think about it, but yes, it could be better than a badly named free function like this one.

davidhewitt · 2025-05-12T17:39:58Z

Thanks for the thorough set of responses here (and sorry I took a few weeks to loop back round, I have been reviewing a LOT of overdue PRs 👀).

I think the proposal to bump MSRV on the experimental-async feature is totally reasonable and the APIs it enables look good to me. 👍

wyfo force-pushed the pyfuture branch from 55a0e82 to f9c7ec0 Compare November 30, 2023 15:42

wyfo mentioned this pull request Nov 30, 2023

feat: support anyio with a Cargo feature #3612

Draft

wyfo marked this pull request as draft November 30, 2023 15:43

wyfo force-pushed the pyfuture branch 17 times, most recently from d1e9e17 to 53cccb4 Compare December 7, 2023 19:17

jopemachine mentioned this pull request Jan 7, 2024

Make python binding's FSM API async lablup/raftify#70

Closed

adamreichold mentioned this pull request Feb 4, 2024

async fn tracking issue #1632

Open

wyfo force-pushed the pyfuture branch 2 times, most recently from 2eb4c74 to 732bb84 Compare April 7, 2024 01:53

wyfo force-pushed the pyfuture branch 2 times, most recently from e15675f to 053bfc2 Compare April 7, 2024 09:40

wyfo force-pushed the pyfuture branch from 053bfc2 to 60693ac Compare April 9, 2024 07:58

wyfo force-pushed the pyfuture branch 3 times, most recently from 2ebb268 to 18f59ea Compare April 25, 2024 09:20

wyfo force-pushed the pyfuture branch from 18f59ea to 1bf4f4c Compare May 7, 2024 08:28

wyfo changed the title ~~feat: add PyFuture to await Python awaitables~~ feat: add coroutine::await_in_coroutine to await awaitables in coroutine context May 7, 2024

feat: add coroutine::await_in_coroutine to await awaitables in corout…

e0b6cbf

…ine context

wyfo force-pushed the pyfuture branch from 1bf4f4c to e0b6cbf Compare January 15, 2025 23:03

wyfo marked this pull request as ready for review January 15, 2025 23:03

fix: typo

11d1ec2

wyfo added 5 commits January 16, 2025 00:42

fix: remove useless dependency

a5c2e0f

fix: typo

1b3fa98

fix: typo

28e4875

fix: lints

07c5406

fix: fix compilation with no_gil

f9084e5

davidhewitt reviewed Apr 11, 2025

View reviewed changes

wyfo and others added 3 commits April 15, 2025 12:41

Update src/coroutine.rs

c3881d8

Co-authored-by: David Hewitt <[email protected]>

Update src/coroutine/asyncio.rs

f0203e7

Co-authored-by: David Hewitt <[email protected]>

Update src/coroutine/awaitable.rs

5c8346e

Co-authored-by: David Hewitt <[email protected]>

wyfo commented Apr 15, 2025

View reviewed changes


		## Restrictions

		As the name suggests, `await_in_coroutine` resulting future can only be awaited in coroutine context. Otherwise, it

		res = future.fuse() => res,
		_ = checkpoint().fuse() => unreachable!(),

feat: add coroutine::await_in_coroutine to await awaitables in coroutine context #3611

Are you sure you want to change the base?

feat: add coroutine::await_in_coroutine to await awaitables in coroutine context #3611

Uh oh!

Conversation

wyfo commented Nov 30, 2023

Uh oh!

codspeed-hq bot commented Nov 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging #3611 will not alter performance

Summary

Uh oh!

davidhewitt commented Dec 17, 2023

Uh oh!

wyfo commented Apr 7, 2024

Uh oh!

wyfo commented Apr 7, 2024

Uh oh!

wyfo commented May 7, 2024

Uh oh!

wyfo commented Jan 15, 2025

Uh oh!

davidhewitt left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wyfo left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidhewitt commented May 12, 2025

Uh oh!

Uh oh!

feat: add `coroutine::await_in_coroutine` to await awaitables in coroutine context #3611

feat: add `coroutine::await_in_coroutine` to await awaitables in coroutine context #3611

codspeed-hq bot commented Nov 30, 2023 •

edited

Loading