[ML] Server-Sent Events for Inference response · prwhelan/elasticsearch@428b53d

Commit

[ML] Server-Sent Events for Inference response

Initial implementation of streaming inference responses as Server-Sent
Events for the `POST /_inference` API.

Bytes are requested and read from a Flow.Publisher and encoded in a
ChunkedRestResponseBodyPart before sent to the REST channel.  The
channel will request more bytes via the `getNextPart` API.

Encoding is done in two parts:
1. A wrapper encoding to format the messages as a Server-Sent Event
   stream.
2. The existing JSON (or requested) encoding for the data payload using
   XContent.

Example messages:
```
event: message
data: { "completion": [{"delta": "hello, world"}] }

```

Loading branch information

prwhelan committed Sep 5, 2024

1 parent c805f90 commit 428b53d

0 comments on commit `428b53d`

Please sign in to comment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit

There are no files selected for viewing

0 comments on commit `428b53d`

Commit

There are no files selected for viewing

0 comments on commit 428b53d

0 comments on commit `428b53d`