provide control over transcoded-port buffer sizes #708

owaddell · 2023-07-25T22:23:59Z

This pull request adds two new parameters that provide control over the size of the string buffer and the internal codec bytevector buffer associated with transcoded ports. It also includes changes to reduce allocation and copying in certain cases for bytevector->string, get-bytevector-all, get-bytevector-n, get-string-all, and get-string-n.

One use case for the new parameters is to allow transcoded custom binary ports to take advantage of larger buffer sizes. Previously, such ports were effectively constrained by a fixed 1024-byte internal codec buffer even if created with a large custom-port-buffer-size.

Resolved questions:

Could / should we avoid allocating the ioffsets fxvector if the underlying bp does not support port-position? (Yes. Updated to avoid allocating ioffsets fxvector in this case.)
Should make-codec-buffer procedure take bp and min-size instead of just bp? (For now keeping bp as the only argument.)
- con: For existing codecs it will always be 4, so why pass it in. (Perhaps it could be less for latin-1-codec, but that's seems pedantic.)
- con: immediate 4 is compact in the code stream
- con: if users know the minimum is 4 they could skip the check in cases where they know they have something of length >= 4 available.
- pro: having a min-size argument is a reminder that there is in fact a minimum length
- pro: avoids ugly occurrences of "four" in documentation and 4 in user code
Should I merge the two error checks on the result from make-codec-buffer into one? (Yes. Done.)

csug/io.stex

mats/io.ms

jltaylor-us · 2023-07-29T18:20:43Z

Looks good to me.

Use get-string-all in fresh-line mats where the expected result from block-read relied on the input-port's buffer size.

The string-ports mat expects prettytest.ss to be ASCII. Unicode characters in the files comprising prettysrc can break the test. Add hint to help the next person who stumbles on this. We could update the prettytest.ss make rule to convert the output to ASCII, but that would add a build dependency on something like iconv. We could do the conversion in Scheme, but that would rely on the system under test.

Add transcoded-port-buffer-size to provide control over the string buffer allocated for a new transcoded port. For transcoded input ports this also governs the size of an internal fxvector used by port-position. Add make-codec-buffer to allow control over the internal bytevector buffer for a transcoded port. Note that we get a few new "Expected error" cases for some io.ms mats that were extended by adding new parameters clauses. The mat parameters clause re-runs the mat body with different parameter settings. Here that means we happen to repeat some tests that check expected errors.

Avoid extra allocation and copying when get-bytevector-all, get-bytevector-n, get-string-all, and get-string-n can construct the result from a single file-buffer-size block.

Allocate minimum-sized buffers for transcoded port and codec when the bytevector argument is less than file-buffer-size. Avoid extra allocation and copy of result in that case. Take advantage of shared buffer if user has determined this is safe.

owaddell · 2023-08-02T14:06:46Z

Thanks for the reviews, folks!

jltaylor-us reviewed Jul 27, 2023

View reviewed changes

csug/io.stex Outdated Show resolved Hide resolved

mats/io.ms Outdated Show resolved Hide resolved

mats/io.ms Outdated Show resolved Hide resolved

owaddell force-pushed the owaddell/transcoded-port branch from ed5da1f to 5fe5434 Compare July 27, 2023 13:45

burgerrg approved these changes Jul 31, 2023

View reviewed changes

owaddell-beckman added 5 commits August 1, 2023 10:53

simplify test and remove buffer-size assumption

61b2ba6

Use get-string-all in fresh-line mats where the expected result from block-read relied on the input-port's buffer size.

avoid allocation and copying in single-block case

f11ad12

Avoid extra allocation and copying when get-bytevector-all, get-bytevector-n, get-string-all, and get-string-n can construct the result from a single file-buffer-size block.

owaddell force-pushed the owaddell/transcoded-port branch from 5fe5434 to 821879d Compare August 2, 2023 02:28

burgerrg approved these changes Aug 2, 2023

View reviewed changes

owaddell merged commit 821879d into cisco:main Aug 2, 2023
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

provide control over transcoded-port buffer sizes #708

provide control over transcoded-port buffer sizes #708

owaddell commented Jul 25, 2023 •

edited

Loading

jltaylor-us commented Jul 29, 2023

owaddell commented Aug 2, 2023

provide control over transcoded-port buffer sizes #708

provide control over transcoded-port buffer sizes #708

Conversation

owaddell commented Jul 25, 2023 • edited Loading

jltaylor-us commented Jul 29, 2023

owaddell commented Aug 2, 2023

owaddell commented Jul 25, 2023 •

edited

Loading