Completion-based I/O #915

Ralith · 2020-11-13T21:05:53Z

Modern high-performance I/O APIs (e.g. io_uring and I/O completion ports) are completion-oriented, unlike the traditional readiness-oriented epoll paradigm. Advantages include fewer syscalls and no copying. On Windows in particular, readiness-oriented I/O is poorly supported.

@Matthias247 has reported that a prototype variant of Quinn modified to use registered I/O on windows performs significantly (~20%?) better than our Linux backend using sendmmsg and recvmmsg for efficient batching, and drastically better than the fallback backend on Windows. Due to major changes in tokio/mio's windows support, we should re-evaluate the latter result after moving to tokio 0.3.

On Linux, io_uring is only available on recent kernels. Other platforms (e.g. macOS, BSDs) may not offer completion-based I/O at all. Retaining a readiness-oriented fallback will therefore remain necessary for some time. However, the performance benefits seem to justify making completion-based I/O the first-class target.

A complicating factor is that tokio itself is, currently, 100% readiness-oriented. On Linux, we may be able to bridge this gap gracefully due to io_uring/epoll interop, but it's unclear if something similar is possible on Windows. If not, a background thread may be necessary.

The text was updated successfully, but these errors were encountered:

Matthias247 · 2020-11-14T21:43:12Z

I pushed my POC for RIO and completion based IO to #918

This might help on getting an idea what is necessary. I think in a model where Quinn owns all IO and the associated thread it's not terribly complicated.
The fact that datagrams are received and transmitted in an all-or-nothing fashion, and that it mostly has to deal with a single socket and not lots of them makes it easier than e.g. implementing support for completion oriented TCP. But obviously one still has to be a bit careful around ownership.

KirillLykov · 2022-05-26T13:14:45Z

I wonder if there are any plans about using io_uring? Have you considered integrating DataDog's glommio crate? Also nowadays it looks like another way to proceed might be using AF_XDP which might be simpler(?) in a way

Ralith · 2022-05-27T00:37:45Z

See also discussion at #1319. This issue was originally opened before GSO/GRO support were implemented and it's unclear if io_uring will be a benefit in net on Linux, though we'd be happy to mentor someone who wants to experiment.

KirillLykov · 2022-05-27T11:49:23Z

See also discussion at #1319. This issue was originally opened before GSO/GRO support were implemented and it's unclear if io_uring will be a benefit in net on Linux, though we'd be happy to mentor someone who wants to experiment.

It sounds super interesting to check out, not sure if I will manage yet. I would start with creating a hack to plugin monoio and check the performance metrics. Hence, the first question is if there are any existing benchmarks that I can use to measure the performance. And the second -- where to start from?

Ralith · 2022-05-27T16:27:24Z

See the bench and perf directories for some benchmarking tools. As discussed in #1319, tokio-uring might be a more appropriate place to start. You could look at @Matthias247's prototype above for inspiration, or try modifying the endpoint driver in quinn directly, or try building directly on top of quinn-proto to avoid being influenced by the existing architecture.

djc · 2022-06-27T11:35:35Z

@sowhu do you have more context on the talk? Not sure which Stephen you're referring to.

sowhu · 2022-06-27T14:39:05Z

Sorry guys. I just realized I posted on the wrong issue. Please just disregard my previous two comments. My mistake.

KirillLykov · 2022-06-27T15:53:34Z

For the reference, @djc regarding io_uring library in rust: if the plots by monoio are still up-to-date and the benchmarks scenarios are fair, it looks like monoio is the fastest choice, see

Ralith · 2022-06-27T16:03:02Z

Those benchmarks don't seem to involve tokio-uring, just regular epoll-based tokio.

Icelk · 2023-02-04T14:15:24Z

Those benchmarks don't seem to involve tokio-uring, just regular epoll-based tokio.

I did some benchmarks with tokio-uring (which is single-threaded). It scores 370k/s, while a single-threaded monoio scores 270k/s. Monoio can scale to multiple threads, but so can tokio-uring (if you just spawn multiple executors!).

TL;DR: tokio-uring seems to be the way to go.

I would really like to use QUIC with io_uring! Thanks.

Ralith added the enhancement New feature or request label Nov 13, 2020

Ralith mentioned this issue Nov 14, 2020

Reduce task-switching overhead on I/O path #914

Open

Ralith mentioned this issue May 27, 2022

possible to implement io uring? #1319

Closed

KirillLykov mentioned this issue Jun 27, 2022

Use io_uring For The Networking Stack (Zero Copy Networking, Etc...) solana-labs/solana#25851

Closed

djc mentioned this issue Jun 29, 2022

Increasing gossip vote metrics on mainnet-beta 7/29 #1378

Closed

Matthias247 mentioned this issue Nov 10, 2022

Preallocate output vector in poll_transmit(). #1438

Closed

Icelk mentioned this issue Jul 20, 2023

Implement opcode sendmsg (non-zc) for UdpStream. tokio-rs/tokio-uring#263

Merged

This was referenced Oct 11, 2023

Adding poll_readable / poll_writeable to Datagram DataDog/glommio#609

Closed

Single threaded runtime support #1681

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Completion-based I/O #915

Completion-based I/O #915

Ralith commented Nov 13, 2020

Matthias247 commented Nov 14, 2020

KirillLykov commented May 26, 2022 •

edited

Loading

Ralith commented May 27, 2022

KirillLykov commented May 27, 2022

Ralith commented May 27, 2022

djc commented Jun 27, 2022 •

edited

Loading

sowhu commented Jun 27, 2022

KirillLykov commented Jun 27, 2022

Ralith commented Jun 27, 2022

Icelk commented Feb 4, 2023

Completion-based I/O #915

Completion-based I/O #915

Comments

Ralith commented Nov 13, 2020

Matthias247 commented Nov 14, 2020

KirillLykov commented May 26, 2022 • edited Loading

Ralith commented May 27, 2022

KirillLykov commented May 27, 2022

Ralith commented May 27, 2022

djc commented Jun 27, 2022 • edited Loading

sowhu commented Jun 27, 2022

KirillLykov commented Jun 27, 2022

Ralith commented Jun 27, 2022

Icelk commented Feb 4, 2023

KirillLykov commented May 26, 2022 •

edited

Loading

djc commented Jun 27, 2022 •

edited

Loading