Should `write` with an `END_TOKEN` call `finalize` on the stream to prevent memory leaks? #117

KristofferC · 2022-02-10T12:58:21Z

The desire here is to close the TranscodingStream without closing the underlying buffer. This is documented in https://juliaio.github.io/TranscodingStreams.jl/latest/examples/#Explicitly-finish-transcoding-by-writing-TOKEN_END-1 and says that you should write a TOKEN_END token to the stream. However, an issue with that is that it only flushes the stream but it doesn't finalize it which leads to memory leaks in code written like:

using CodecZlib
using TranscodingStreams

function leak()
    buf = IOBuffer()
    data = rand(10^6)
    while true
        zWriter = ZlibCompressorStream(buf)
        write(zWriter, data)
        write(zWriter, TranscodingStreams.TOKEN_END)
        flush(zWriter)
    end
end

leak()

which will indefinitely leak. Manually calling finalize on the zWriter fixes the issue but it is not clear from the documentation that this is required. There are a few possible solutions:

Attach a finalizer to the stream that calls finalize. This is not ideal because you want a more eager cleanup than whenever the GC gets to it.
Make writing a TOKEN_END call finalize on the stream.

Make writing a TOKEN_END set the stream mode to :closed and thereby allowing close on the wrapper stream to not close the underlying wrapped stream :

TranscodingStreams.jl/src/stream.jl

Lines 174 to 183 in 2fac971

    
           function Base.close(stream::TranscodingStream) 
        
               stopped = stream.state.mode == :stop 
        
               if stream.state.mode != :panic 
        
                   changemode!(stream, :close) 
        
               end 
        
               if !stopped 
        
                   close(stream.stream) 
        
               end 
        
               return nothing 
        
           end

Alternatively, it is also possible that the code that shows the leak above is "faulty" but generally, normal Julia code shouldn't leak like this so at least a finalizer might be a good idea.

The text was updated successfully, but these errors were encountered:

nhz2 · 2024-03-16T23:16:21Z

The stream is expected to still be writable after writing TOKEN_END. For example, https://github.com/BioJulia/FASTX.jl/blob/v2.1.4/src/fastq/writer.jl#L53 uses TOKEN_END in a flush function.

With #178 you can do:

using CodecZlib
using TranscodingStreams

function no_leak()
    buf = IOBuffer()
    data = rand(10^6)
    while true
        zWriter = ZlibCompressorStream(seekstart(buf); stop_on_end=true)
        write(zWriter, data)
        close(zWriter)
    end
end

no_leak()

Adding a finalizer is still a good idea.

nhz2 · 2024-12-18T00:24:20Z

I am adding finalizers for CodecBzip2 in JuliaIO/CodecBzip2.jl#43

@KristofferC what do you mean by "This is not ideal because you want a more eager cleanup than whenever the GC gets to it."

fredrikekre mentioned this issue Feb 10, 2022

Release allocated resources in Zlib, fixes #43. JuliaVTK/WriteVTK.jl#100

Merged

nhz2 mentioned this issue Mar 8, 2024

Fix stop_on_end = true closing underlying stream #178

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should `write` with an `END_TOKEN` call `finalize` on the stream to prevent memory leaks? #117

Should `write` with an `END_TOKEN` call `finalize` on the stream to prevent memory leaks? #117

KristofferC commented Feb 10, 2022 •

edited

Loading

nhz2 commented Mar 16, 2024

nhz2 commented Dec 18, 2024

Should write with an END_TOKEN call finalize on the stream to prevent memory leaks? #117

Should write with an END_TOKEN call finalize on the stream to prevent memory leaks? #117

Comments

KristofferC commented Feb 10, 2022 • edited Loading

nhz2 commented Mar 16, 2024

nhz2 commented Dec 18, 2024

Should `write` with an `END_TOKEN` call `finalize` on the stream to prevent memory leaks? #117

Should `write` with an `END_TOKEN` call `finalize` on the stream to prevent memory leaks? #117

KristofferC commented Feb 10, 2022 •

edited

Loading