The use of heap ByteBuffers in putObject can lead to subtle OOM issues #247

jgiunti-coatue · 2021-08-12T21:04:22Z

The usage of HeapByteBuffer in the putObject method can lead to a pretty subtle OOM issue that is tricky to figure out. If you are using many threads to do IO, in this case putting an object in s3, the JVM is going to cache one or more of these ByteBuffers per thread, and by default there is no limit on the size or number of these buffers. As as result, if you create many threads for IO and these buffers are large, the app can use a lot of additional native memory that looks like a leak.

For us, when running in production, this presented itself as the java process consuming several more GB of memory than we had allocated for the heap and thus getting killed when it ran out of memory.

This issue can be mitigated by defining jdk.nio.maxCachedBufferSize so that large buffers are not cached, but maybe it makes sense to consider using a direct buffer or something else.

The text was updated successfully, but these errors were encountered:

regis-leray · 2021-08-15T19:21:45Z

thank you for the report i will look into

regis-leray · 2021-08-27T14:39:31Z

release v0.3.6 is on his way

regis-leray · 2021-08-27T17:04:26Z

@jgiunti-coatue please dont hesitate to reopen the ticket if you found other problem.
thank again

jgiunti-coatue · 2021-08-31T14:55:18Z

Thank you!

regis-leray · 2021-09-12T12:31:34Z

The fix was wrong

regis-leray · 2021-09-13T13:30:02Z

It would be great to be able to reproduce this error.
It looks like we are accumulating the whole data in memory, which is weird since we are streaming the data.

@jgiunti-coatue
Can you show how did you create stream (content) passed in parameter of the function putObject

 def putObject[R](
    bucketName: String,
    key: String,
    contentLength: Long,
    content: ZStream[R, Throwable, Byte],
    options: UploadOptions
  )

jgiunti-coatue · 2021-09-13T21:08:27Z

@regis-leray we have since moved to using multiPartUpload as it seems to be truly using the stream to send the data to s3 and we haven't had the issue since. If I'm not mistaken, it seems like the conversion of the ZStream in the putObject function by calling toPublisher consumes the entire stream, which is why I think the whole data is being accumulated in memory.

Check out the streamToPublisher method in the reactive streams interop

def streamToPublisher[R, E <: Throwable, O](stream: ZStream[R, E, O]): ZIO[R, Nothing, Publisher[O]] =
    ZIO.runtime.map { runtime => subscriber =>
      if (subscriber == null) {
        throw new NullPointerException("Subscriber must not be null.")
      } else {
        runtime.unsafeRunAsync_(
          for {
            demand <- Queue.unbounded[Long]
            _      <- UIO(subscriber.onSubscribe(createSubscription(subscriber, demand, runtime)))
            _ <- stream
                   .run(demandUnfoldSink(subscriber, demand))
                   .catchAll(e => UIO(subscriber.onError(e)))
                   .forkDaemon
          } yield ()
        )
      }
    }

The stream is consumed using a sink in order to create the publisher. I think this is why all the data is accumulated in memory.

regis-leray · 2021-09-13T22:07:53Z

Glad to hear, you were able to solve it.

thank you, it really help about your findings, i will try to see how we can build a safer publisher or will be force to change the signature to a ByteBuffer.

Need to investigate when we are building a bridge ZIO => Reactive Stream if we need to evaluate the whole stream in memory.

regis-leray · 2021-09-21T19:18:31Z

Im still looking what could cause this issue. Based on the discussion on discord channel with zio team, we didnt find any clue or problem which could lead to accumulate data in memory

jgiunti-coatue · 2021-09-22T14:49:43Z

@regis-leray This article was very helpful in helping me to understand the issue with using ByteBuffers. The problem we were encountering was using many threads to upload multiple large files to s3. This was causing OOM issues even though our heap was relatively low because they are allocated into native memory. This isn't an issue per se, but it can cause problems if you don't set specific java options to limit the amount of native memory and the size of the buffers that are cached per thread. The implementation using byte buffers isn't wrong per se, but it can cause non-obvious OOM issues for users.

regis-leray mentioned this issue Aug 27, 2021

[247] fix putObject remove ByteBuffer wrap, use ByteArray #251

Merged

regis-leray closed this as completed in #251 Aug 27, 2021

regis-leray reopened this Sep 12, 2021

zio deleted a comment from regis-datapassports Sep 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The use of heap ByteBuffers in putObject can lead to subtle OOM issues #247

The use of heap ByteBuffers in putObject can lead to subtle OOM issues #247

jgiunti-coatue commented Aug 12, 2021

regis-leray commented Aug 15, 2021

regis-leray commented Aug 27, 2021

regis-leray commented Aug 27, 2021

jgiunti-coatue commented Aug 31, 2021

regis-leray commented Sep 12, 2021

regis-leray commented Sep 13, 2021

jgiunti-coatue commented Sep 13, 2021

regis-leray commented Sep 13, 2021

regis-leray commented Sep 21, 2021

jgiunti-coatue commented Sep 22, 2021 •

edited

Loading

The use of heap ByteBuffers in putObject can lead to subtle OOM issues #247

The use of heap ByteBuffers in putObject can lead to subtle OOM issues #247

Comments

jgiunti-coatue commented Aug 12, 2021

regis-leray commented Aug 15, 2021

regis-leray commented Aug 27, 2021

regis-leray commented Aug 27, 2021

jgiunti-coatue commented Aug 31, 2021

regis-leray commented Sep 12, 2021

regis-leray commented Sep 13, 2021

jgiunti-coatue commented Sep 13, 2021

regis-leray commented Sep 13, 2021

regis-leray commented Sep 21, 2021

jgiunti-coatue commented Sep 22, 2021 • edited Loading

jgiunti-coatue commented Sep 22, 2021 •

edited

Loading