SNOW-1747415 SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data #790

kartikgupta2607 · 2024-03-06T18:06:36Z

Please answer these questions before submitting your issue.
In order to accurately debug the issue this information is required. Thanks!

What version of NodeJS driver are you using? ->snowflake-sdk -> Tried with 1.6.20, 1.9.3
What operating system and processor architecture are you using? -> macOS 14.3.1, arm64
What version of NodeJS are you using? -> v18.12.1
What are the component versions in the environment (npm list)? -> NA
Server version: -> 8.9.1
What did you do?
Tried running this (snowflake_OOM.txt) script to export records from SNOWFLAKE_SAMPLE_DATA.TPCH_SF1000.CUSTOMER table, after limiting max-old-space-size=150 while running the script. A simple SELECT * FROM CUSTOMER; was used, but the node process exited after exporting some rows (200000) with FATAL ERROR: Reached heap limit Allocation failed - JavaScript heap out of memory error but on modifying the same query with a LIMIT clause (with limit of 1,000,000), it was able to export 1M records with the same old-space limit. I tried workarounds mentioned in SNOW-750472 Out of memory issue #43 and the linked issues but none worked. Tried using the streamResult option in the connection config and while executing the query, tried degrading to 1.6.20.
Following is the metadata of the source table

As it doesn't have any VARIANT type column, so #536 and the related issues because of it shouldn't matter in our case

What did you expect to see?

It should be able to export the same number of records before the OOM issue occurs, also it seems a large buffer is fetched if the LIMIT isn't present in the query as it seems the GC triggers a bit late for query without LIMIT
Can you set logging to DEBUG and collect the logs? -> Can't upload logs due to company security policies.

var snowflake = require('snowflake-sdk');
snowflake.configure(
{
  logLevel: 'trace'
});

What is your Snowflake account identifier, if any? (Optional)

The text was updated successfully, but these errors were encountered:

sfc-gh-dszmolka · 2024-03-07T08:45:06Z

hi - thanks for raising this issue with us, i'll take a look and see how we can proceed

shelbylsmith · 2024-03-07T20:01:12Z

I have the same issue with node 20.9.0 and snowflake-sdk 1.9.0.

sfc-gh-dszmolka · 2024-03-08T09:35:22Z

thank you again for the detailed reproduction, the issue could be reproduced even with the TPCH_SF100 version of the CUSTOMER table which 'only' has 15000000 rows.

Of course if one leaves the memory limit flag off, the query runs to completion and we can observe that memory usage goes up into the ~1G range with this table in this comment, then GC kicks in, and this goes until query completes.
Additionally if a smaller table is created e.g. with 1M rows from the original table, and the smaller table is queried instead with SELECT * FROM SMALLER_TABLE, then the query also runs to completion even with the memory limit.

I'm also not sure if this is the expected behaviour with the 'streaming rows' functionality and also comparing some heap snapshots in the 'bad' and 'good' scenario raises additional questions so now I involved the driver team to take a look at this.
Will keep this thread posted with the progress.

MC7771 · 2024-03-13T15:34:36Z

You can try to fetch stream in specific range like this:
const stream = stmt.streamRows({ start, end })
This way you can limit number of records fetched from snowflake.
But you need to write your own loop and calculate start, end index.

sfc-gh-dszmolka · 2024-03-13T15:39:13Z

based on the same idea, perhaps the LIMIT ... OFFSET ... construct could be also a workaround as part of the big query, and alleviate the problem by 'partitioning' the query result and iterating over it. But it's still just a workaround, until we find a solution.

sfc-gh-dszmolka · 2024-03-18T09:21:00Z

a very quick update: this seems to be deeply interlaced with how Snowflake as the backend behaves today when creating the query result chunks on the server-side, and might not be easily influenced solely on the client (driver) side - besides the workarounds already discussed. Multiple teams are working on this issue and I'll keep this thread posted.

kartikgupta2607 · 2024-04-12T09:20:25Z

Hello @sfc-gh-dszmolka , do you have any updates regarding the previous request? I attempted the workaround mentioned earlier by passing {start, end} in streamRows, but the issue persists. Below is a sample script attached for your reference. Please let me know if there are any adjustments I should make. Additionally, I noticed that even when I'm not storing the data in the script, the heap size increases significantly. Are the rows referenced somewhere internally?
snowflake_OOM_batch.txt

sfc-gh-pmotacki · 2024-04-12T13:04:39Z

Hi @kartikgupta2607 We have internal discussion about this issue. We will inform you.

sfc-gh-dszmolka · 2024-04-15T05:54:18Z

hey folks a quick update. As my colleague mentioned above, we're discussing all the possibilities internally. Sadly the fix is not necessarily trivial because as it seems, it requires server-side (driver independent) changes, which might take a while.

Until it's implemented, please refer to the mitigations mentioned above

fetch range of stream and iterate over ranges, solution from MC7771
use Snowflake builtin LIMIT ... OFFSET ... predicate to define the slice of results you get from Snowflake and loop over these slices for very big results

thank you so much for bearing with us while this is discussed - i'll keep this thread posted

kartikgupta2607 · 2024-04-16T19:22:39Z

thanks for the update @sfc-gh-dszmolka!
the workaround suggested by @MC7771 doesn't seems to work for me, I've attached a sample script for reference here

sfc-gh-dszmolka · 2024-04-17T06:19:59Z

understood , thanks for confirming. Can you please try the other method as well ?

In the meantime, I have further progress update which is not that good. As mentioned above, to actually fix the issue we need server-side improvements implemented (in the Snowflake architecture, it's the Snowflake engine itself who decides the number of query result chunks, size of them, etc. - client driver cannot do anything about it. The issue is connected to how these chunks of query result are generated).
As a possible proper solution we plan to enhance the backend to give more control of the query result chunk creation to the client side to be able to adapt to these restricted memory situations.

It became clear very recently that due to other higher priority issues, the server-side improvement surely cannot be implemented in the next upcoming months, so the earliest it can be possibly addressed is the second half of this year.

I'm very sorry to bear such bad news, but wanted to set the expectations about the timeline.

Which also means the following things:

people affected will need to run on some sort of workaround like LIMIT .. OFFSET .. which could 'partition' the very large query result sets. (or create temporal/transient tables with partial query results from the big one and query the temporal one; or similar)
Alternatively run the workloads which must work on really big query results, in an environment which is not so strictly memory restricted.

But most importantly: if you're already a Snowflake customer and affected by this issue, please do reach out to your Account Team and emphasize how implementing this server-side improvement would be important to your use-case. This could bring some traction and possibly re-prioritizing the backend change.

Again sorry to bring such news and the inconvenience the current behaviour causes - and thank you for bearing with us while the server-side change is implemented. Will keep this thread posted with the progress, if any.

kartikgupta2607 · 2024-04-17T06:59:41Z

Sure @sfc-gh-dszmolka , will try the other method. Please keep us posted if it gets re-prioritised.

RobLuzmo · 2024-04-30T16:31:09Z

@kartikgupta2607 did partitioning with LIMIT .. OFFSET .. work?
We're hitting the same OOM issue even with 4 GB of memory :/
This workaround also isn't straightforward as one needs to apply absolute ordering in the query and hope that no updates happen to the table while you're querying it...

sfc-gh-dszmolka · 2024-04-30T16:53:17Z

We're hitting the same OOM issue even with 4 GB of memory :/

That's very strange (but of course can be possible I guess, if your table is wide enough)
Just to be on the safe side, can you please make double sure you actually use streaming result, and not trying to store the whole resultset (default behaviour) in memory?

RobLuzmo · 2024-05-13T13:23:08Z

Yeah we're doing the following:

const statement = conn.execute({ sqlText: query, streamResult: true, rowMode: 'array' });
const stream = statement.streamRows();

Note that this only happens when trying to retrieve an entire dataset containing almost 400 million rows.

owlas · 2024-06-22T06:58:49Z

@sfc-gh-dszmolka is there a way to serialize the statement object to enable async pagination of results using streamRows({ start, end }) ?

For example:

const statement = conn.execute({ ... })

// pseudo-code
// store statement, start=0, end=pageLength in database
const queryId = storeStatement(statement, start, end)

// load statement
const { statement, start, end } = loadStatement(queryId)

// stream page
const stream = statement.streamRows({ start, end })

This would be really useful for downloading results in parallel, or paginating in a load balancing scenario where multiple servers are handling query execution.

This seems possible with the SQL REST API: https://docs.snowflake.com/en/developer-guide/sql-api/handling-responses#retrieving-additional-partitions

But I guess using the node sdk will be higher performance? If that's not true we could fall back to the REST API

sfc-gh-dszmolka · 2024-06-24T12:31:32Z

at this moment this does not seem to work , at least not this way @owlas . At least in this very thread, folks tried using streamRows({ start, end }) and did not lead to the result which could handle this scenario appropriately. If you're perhaps able to launch multiple separate queries with LIMIT .. OFFSET .. predicates, all in parallel (maybe even async with the asyncExec flag), maybe it can implement what you're seeking?

do note please that it (it == governing the size of the query result chunks) is also not possible with the SQL REST API; the number and size of the partitions are unilaterally determined by the Snowflake engine itself, with no seemingly available customer-exposed method of being to override it.
That's what in discussion with the engine team; independently from this particular driver since the issue affects all of them (one quick look in PythonConnector, JDBC driver, etc repo will yield similar issues over there - makes sense; since the behaviour is governed on server-side and not client side; results are similar in memory-limited environments)

sfc-gh-dszmolka · 2024-10-02T09:40:58Z

I have some update. (Very) recently the server-side code has been changed to support smaller chunk sizes, which in turn hopefully results in less likely to be OOM. After the server-side code has been released, setting CLIENT_RESULT_CHUNK_SIZE (values in MB, min: 16, max: 160, default: 160) should help in controlling the (uncompressed) size of the query result chunks generated on the server side which then needs to be downloaded by the client.

I have yet to confirm of the Snowflake (server) version with which this new change will be released.

Again: very specially; this is not a client library release we're waiting for, but the server side. Also consequentially, it is driver-independent, and affects all of our driver libraries, not just this.
The client-side parameter (CLIENT_RESULT_CHUNK_SIZE) is already there for years, just did not work entirely as expected.

Will keep this thread posted.

sfc-gh-dszmolka · 2024-10-10T10:15:32Z

update: server-side changes seem to be rolled out with Snowflake version 8.39, which is scheduled for next week. Will update this thread when the appropriate Snowflake (server) version is live in production.

LalowiczB · 2024-10-17T07:28:23Z

@sfc-gh-dszmolka Hi, we are also experiencing the same issue (we have an internal support ticket opened with Snowflake support). I can see that the engine version has been 8.39.2 since yesterday.

The issue isn't resolved at least for node.js. I created a blank nodejs project where I stream 400k rows and just simply log them to the console to make sure that no other logic is involved. The memory spikes up to 1GB and changing the session setting for chunk size does not seem to have any effect.

Could you let me know if the release contained the announced fix?

sfc-gh-dszmolka · 2024-10-17T16:08:03Z

hey @LalowiczB - the 8.39 indeed contained the announced fix, and i can confirm now CLIENT_RESULT_CHUNK_SIZE can be set to as low as 16MB (earlier: 48MB).
I can also confirm unfortunately that this change did not seem to singlehandedly resolve the issue. Reproductions show similar results what you saw, so we need to look further into this what else needs to be done, for the driver to seamlessly be able to stream a large amount of rows even in a memory-limited environment.

kartikgupta2607 added the bug Something isn't working label Mar 6, 2024

github-actions bot changed the title ~~Out of Memory Issue while streaming data~~ SNOW-1215393: Out of Memory Issue while streaming data Mar 6, 2024

sfc-gh-dszmolka self-assigned this Mar 7, 2024

sfc-gh-dszmolka added status-triage Issue is under initial triage and removed bug Something isn't working labels Mar 7, 2024

sfc-gh-dszmolka added status-triage_done Initial triage done, will be further handled by the driver team and removed status-triage Issue is under initial triage labels Mar 8, 2024

sfc-gh-dszmolka assigned sfc-gh-snow-drivers-warsaw-dl and unassigned sfc-gh-dszmolka Mar 8, 2024

sfc-gh-dszmolka added the bug Something isn't working label Mar 8, 2024

sfc-gh-dszmolka changed the title ~~SNOW-1215393: Out of Memory Issue while streaming data~~ SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data Apr 17, 2024

sfc-gh-dszmolka added the status-fixed_awaiting_release The issue has been fixed, its PR merged, and now awaiting the next release cycle of the connector. label Oct 2, 2024

sfc-gh-dszmolka added the status-blocked Progress cannot be made to this issue due to an outside blocking factor. label Oct 2, 2024

sfc-gh-dszmolka removed status-fixed_awaiting_release The issue has been fixed, its PR merged, and now awaiting the next release cycle of the connector. status-blocked Progress cannot be made to this issue due to an outside blocking factor. labels Oct 17, 2024

sfc-gh-dszmolka changed the title ~~SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data~~ SNOW-1747415 SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data Oct 17, 2024

sfc-gh-dszmolka mentioned this issue Nov 5, 2024

SNOW-1786191: SNOW-1747415 High Memory Usage SFReusableChunk snowflakedb/snowflake-connector-net#1055

Open

sfc-gh-dszmolka added status-blocked Progress cannot be made to this issue due to an outside blocking factor. backend changes needed Change must be implemented on the Snowflake service, and not in the client driver. labels Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNOW-1747415 SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data #790

SNOW-1747415 SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data #790

kartikgupta2607 commented Mar 6, 2024

sfc-gh-dszmolka commented Mar 7, 2024

shelbylsmith commented Mar 7, 2024

sfc-gh-dszmolka commented Mar 8, 2024

MC7771 commented Mar 13, 2024

sfc-gh-dszmolka commented Mar 13, 2024

sfc-gh-dszmolka commented Mar 18, 2024

kartikgupta2607 commented Apr 12, 2024

sfc-gh-pmotacki commented Apr 12, 2024

sfc-gh-dszmolka commented Apr 15, 2024 •

edited

Loading

kartikgupta2607 commented Apr 16, 2024

sfc-gh-dszmolka commented Apr 17, 2024

kartikgupta2607 commented Apr 17, 2024

RobLuzmo commented Apr 30, 2024

sfc-gh-dszmolka commented Apr 30, 2024 •

edited

Loading

RobLuzmo commented May 13, 2024

owlas commented Jun 22, 2024 •

edited

Loading

sfc-gh-dszmolka commented Jun 24, 2024

sfc-gh-dszmolka commented Oct 2, 2024

sfc-gh-dszmolka commented Oct 10, 2024

LalowiczB commented Oct 17, 2024

sfc-gh-dszmolka commented Oct 17, 2024

SNOW-1747415 SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data #790

SNOW-1747415 SNOW-1215393: Out of Memory Issue in memory-limited environment while streaming data #790

Comments

kartikgupta2607 commented Mar 6, 2024

sfc-gh-dszmolka commented Mar 7, 2024

shelbylsmith commented Mar 7, 2024

sfc-gh-dszmolka commented Mar 8, 2024

MC7771 commented Mar 13, 2024

sfc-gh-dszmolka commented Mar 13, 2024

sfc-gh-dszmolka commented Mar 18, 2024

kartikgupta2607 commented Apr 12, 2024

sfc-gh-pmotacki commented Apr 12, 2024

sfc-gh-dszmolka commented Apr 15, 2024 • edited Loading

kartikgupta2607 commented Apr 16, 2024

sfc-gh-dszmolka commented Apr 17, 2024

kartikgupta2607 commented Apr 17, 2024

RobLuzmo commented Apr 30, 2024

sfc-gh-dszmolka commented Apr 30, 2024 • edited Loading

RobLuzmo commented May 13, 2024

owlas commented Jun 22, 2024 • edited Loading

sfc-gh-dszmolka commented Jun 24, 2024

sfc-gh-dszmolka commented Oct 2, 2024

sfc-gh-dszmolka commented Oct 10, 2024

LalowiczB commented Oct 17, 2024

sfc-gh-dszmolka commented Oct 17, 2024

sfc-gh-dszmolka commented Apr 15, 2024 •

edited

Loading

sfc-gh-dszmolka commented Apr 30, 2024 •

edited

Loading

owlas commented Jun 22, 2024 •

edited

Loading