Why implement `Buffer` and `Memory`? #125

warrenalphonso · 2022-10-10T20:46:24Z

Hey I've just been poking around this package and it seems overly complex. Can someone more familiar with it make sure I haven't just missed Chesterton's fence 😛 Happy to work on this if people agree it makes sense

AFAICT, TranscodingStream manages an input and output Buffer; data is read from the input stream into the input Buffer. This is passed to a codec via Memory (which is just an abstraction over the "used" and "free" parts of a Buffer). The codec writes to the output Buffer, and that's given to the user via an output stream when requested.

Remove Memory. I can't figure out why this is useful, aside from just making things slightly easier for downstream codecs that are C-based and need pointers. If that's the case, we could views instead.
Remove Buffer. We really don't need our own implementation; I think IOBuffer should work. Fundamentally, we just want to load data from the input stream into some input array, let a codec convert part of that to an output array, and send it to the user via a stream.

Less importantly:

Remove Error. Why not just try/catch instead of passing yet another variable around as state?

The text was updated successfully, but these errors were encountered:

warrenalphonso · 2022-10-13T22:03:39Z

Actually, IOBuffer doesn't work so well because it uses the same pointer when reading/writing. What we want is a buffered stream.

Maybe it makes sense to use BufferedStreams.jl so TranscodingStreams.jl doesn't need to ship its own fast buffer implementation? One problem is that BufferedStreams.jl requires choosing between BufferedInputStream and BufferedOutputStream structs. This doesn't seem compatible with the current TranscodingStream design which uses the given stream as either input or output depending on whether it's in :read or :write mode.

I think it's better design to have two separate structs for input and output transcoding, instead of keeping this logic inside some state component. It would also make the logic simpler since we don't have to worry about writing while in read state or vice versa.

Any thoughts? Maybe @jakobnissen ?

jakobnissen · 2023-01-09T13:59:24Z

It seems complex to me too, but my experience is that usually stuff is complicated because there are edge cases I havne't thought of.
Memory is a way to abstract over input memory, which may not be in the form of a Vector{UInt8}, i.e. it may be a SubString{String}. It may be possible to instead make it a parameterized Julia type instead of storing the raw pointer.

I don't think we can remove Buffer. We need low-level control of the implementation, and so would have to rely on implementation details of IOBuffer. We also need more features like Buffer.marked, which IOBuffer does not provide.

vtjnash linked a pull request Feb 2, 2024 that will close this issue

safety improvement to Memory #167

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why implement `Buffer` and `Memory`? #125

Why implement `Buffer` and `Memory`? #125

warrenalphonso commented Oct 10, 2022 •

edited

Loading

warrenalphonso commented Oct 13, 2022

jakobnissen commented Jan 9, 2023

Why implement Buffer and Memory? #125

Why implement Buffer and Memory? #125

Comments

warrenalphonso commented Oct 10, 2022 • edited Loading

warrenalphonso commented Oct 13, 2022

jakobnissen commented Jan 9, 2023

Why implement `Buffer` and `Memory`? #125

Why implement `Buffer` and `Memory`? #125

warrenalphonso commented Oct 10, 2022 •

edited

Loading