Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SNOW-692968: Async queries support #787

Merged
merged 62 commits into from
Mar 5, 2025
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
62 commits
Select commit Hold shift + click to select a range
763153f
Add async query support
sfc-gh-ext-simba-nl Nov 28, 2024
974613b
Add async test file
sfc-gh-ext-simba-nl Nov 28, 2024
a2f2701
Fix linux build by including unistd.h
sfc-gh-ext-simba-nl Nov 28, 2024
3829b60
Remove unnecessary test
sfc-gh-ext-simba-nl Nov 28, 2024
d947159
Add query status to C API, refactor some query parameters, move getti…
sfc-gh-ext-simba-nl Dec 3, 2024
8ab3758
Fix typo
sfc-gh-ext-simba-nl Dec 3, 2024
43c5653
Fix typo
sfc-gh-ext-simba-nl Dec 3, 2024
76ac1f1
Add more test cases, fix async fetching bugs
sfc-gh-ext-simba-nl Dec 4, 2024
c4fa76d
Fix bug with normal queries that go async after a while
sfc-gh-ext-simba-nl Dec 4, 2024
387a59f
fix typo
sfc-gh-ext-simba-nl Dec 4, 2024
cb067ec
Remove status check from fake table
sfc-gh-ext-simba-nl Dec 4, 2024
6cd5c78
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Dec 10, 2024
582913e
Fix linux warnings
sfc-gh-ext-simba-nl Dec 10, 2024
98cb876
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-jszczerbinski Dec 12, 2024
d3b94a0
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Dec 12, 2024
f84fc25
Improve error handling and logging
sfc-gh-ext-simba-nl Dec 12, 2024
97531c8
Merge branch 'SNOW-692968-async-queries-support' of https://github.co…
sfc-gh-ext-simba-nl Dec 12, 2024
f32ada0
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Dec 19, 2024
942ae0a
Fix memory issues in test cases
sfc-gh-ext-simba-nl Dec 19, 2024
ed33af6
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Dec 20, 2024
75e6565
organize enums, add test
sfc-gh-ext-simba-nl Dec 21, 2024
9e457ae
Lower the rowcount for the test
sfc-gh-ext-simba-nl Dec 21, 2024
2aaea2a
Have get_query_metadata return a struct instead of a string
sfc-gh-ext-simba-nl Jan 20, 2025
9003f03
merge mastre
sfc-gh-ext-simba-nl Jan 20, 2025
eb4ae08
Fix build issue with merge master
sfc-gh-ext-simba-nl Jan 20, 2025
a793549
Fix merge issue
sfc-gh-ext-simba-nl Jan 20, 2025
45fa5a3
Fix formatting
sfc-gh-ext-simba-nl Jan 20, 2025
734a064
remove sf_sleep_ms from platform header file
sfc-gh-ext-simba-nl Jan 22, 2025
67e6db3
Move sf_sleep_ms from platform to util
sfc-gh-ext-simba-nl Jan 22, 2025
7671e14
Fix linux compilation and remove extra newline in platform.h
sfc-gh-ext-simba-nl Jan 22, 2025
4efbd8b
Fix build errors
sfc-gh-ext-simba-nl Jan 22, 2025
04100c3
Add util.h to test file
sfc-gh-ext-simba-nl Jan 22, 2025
c89f086
Merge master
sfc-gh-ext-simba-nl Jan 29, 2025
8b39c03
Add missing files from merge
sfc-gh-ext-simba-nl Jan 29, 2025
c9d9bbb
Fix formatting, add test case for retries
sfc-gh-ext-simba-nl Jan 30, 2025
17d6236
Minor logic fix, uncomment test cases
sfc-gh-ext-simba-nl Jan 30, 2025
f298aa2
change get results logic to query /queries/{sfqid}/result instead of …
sfc-gh-ext-simba-nl Jan 31, 2025
73a1a98
merge master
sfc-gh-ext-simba-nl Jan 31, 2025
79a8df0
Fix async timeout test
sfc-gh-ext-simba-nl Feb 1, 2025
2f99bdd
Remove unused variables
sfc-gh-ext-simba-nl Feb 1, 2025
31ac782
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 3, 2025
ae46e5a
Forgot to close connection on test case
sfc-gh-ext-simba-nl Feb 4, 2025
fa9b3e0
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 4, 2025
32f127a
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 4, 2025
6891894
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 5, 2025
5788a0f
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 6, 2025
bde8ccc
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 10, 2025
2030329
Make less unnecessary calls to get query metadata, rename to make thi…
sfc-gh-ext-simba-nl Feb 11, 2025
d2371b7
Uncomment test cases
sfc-gh-ext-simba-nl Feb 11, 2025
5b1198f
Fix formatting
sfc-gh-ext-simba-nl Feb 11, 2025
a6c707c
Minor fix
sfc-gh-ext-simba-nl Feb 12, 2025
222b94b
Fix code quality warnings
sfc-gh-ext-simba-nl Feb 13, 2025
79e5e20
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-dprzybysz Feb 19, 2025
11b2c40
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 19, 2025
2313d73
Merge branch 'SNOW-692968-async-queries-support' of https://github.co…
sfc-gh-ext-simba-nl Feb 19, 2025
6b37e6a
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-ext-simba-nl Feb 20, 2025
e86aa64
Fix build error
sfc-gh-ext-simba-nl Feb 21, 2025
e711c27
Fix filename
sfc-gh-ext-simba-nl Feb 21, 2025
2c27a38
Fix windows VS17 build error
sfc-gh-ext-simba-nl Mar 3, 2025
463e016
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-jszczerbinski Mar 4, 2025
284de79
Revert fix in SnowflakeCommon, remove unused stats from SF_QUERY_META…
sfc-gh-ext-simba-nl Mar 4, 2025
6e448c8
Merge branch 'master' into SNOW-692968-async-queries-support
sfc-gh-jszczerbinski Mar 5, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 12 additions & 1 deletion include/snowflake/client.h
Original file line number Diff line number Diff line change
Expand Up @@ -495,6 +495,8 @@ typedef struct SF_STMT {
SF_STATS *stats;
void *stmt_attrs;
sf_bool is_dml;
sf_bool is_async;
sf_bool is_async_initialized;

/**
* User realloc function used in snowflake_fetch
Expand Down Expand Up @@ -640,7 +642,16 @@ SF_STMT *STDCALL snowflake_stmt(SF_CONNECT *sf);
*
* @return sfstmt SNOWFLAKE_STMT context for async queries.
*/
SF_STMT* STDCALL snowflake_async_stmt(SF_CONNECT *sf, const char *query_id);
SF_STMT* STDCALL snowflake_create_async_query_result(SF_CONNECT *sf, const char *query_id);

/**
* Get the status of a query
*
* @param sfstmt The SF_STMT context.
*
* @return The query status.
*/
SF_QUERY_STATUS STDCALL snowflake_get_query_status(SF_STMT sfstmt);

/**
* Frees the memory used by a SF_QUERY_RESULT_CAPTURE struct.
Expand Down
134 changes: 88 additions & 46 deletions lib/client.c
Original file line number Diff line number Diff line change
Expand Up @@ -98,26 +98,28 @@ SF_QUERY_STATUS get_status_from_string(const char *query_status) {

/**
* Get the metadata of the query
* @param sf the SF_CONNECT context
* @param query_id the query id
*
* @param sfstmt The SF_STMT context.
*
* The query metadata
*/
char *get_query_metadata(SF_CONNECT *sf, const char *query_id) {
char *get_query_metadata(SF_STMT* sfstmt) {
cJSON *resp = NULL;
cJSON *data = NULL;
cJSON *queries = NULL;
char *s_resp = NULL;
const char *error_msg;
size_t url_size = strlen(QUERY_MONITOR_URL) -2 + strlen(query_id) + 1;
size_t url_size = strlen(QUERY_MONITOR_URL) -2 + strlen(sfstmt->sfqid) + 1;
char *status_query = (char*)SF_CALLOC(1, url_size);
sf_sprintf(status_query, url_size, QUERY_MONITOR_URL, query_id);
sf_sprintf(status_query, url_size, QUERY_MONITOR_URL, sfstmt->sfqid);

if (request(sf, &resp, status_query, NULL, 0, NULL, NULL,
GET_REQUEST_TYPE, &sf->error, SF_BOOLEAN_TRUE,
0, sf->retry_count, get_retry_timeout(sf),
if (request(sfstmt->connection, &resp, status_query, NULL, 0, NULL, NULL,
GET_REQUEST_TYPE, &sfstmt->error, SF_BOOLEAN_TRUE,
0, sfstmt->connection->retry_count, get_retry_timeout(sfstmt->connection),
NULL, NULL, NULL, SF_BOOLEAN_FALSE)) {

s_resp = snowflake_cJSON_Print(resp);
log_info("Here is JSON response:\n%s", s_resp);
log_trace("Here is JSON response:\n%s", s_resp);

data = snowflake_cJSON_GetObjectItem(resp, "data");

Expand All @@ -131,29 +133,38 @@ char *get_query_metadata(SF_CONNECT *sf, const char *query_id) {
return metadata;
}
SF_FREE(status_query);
log_trace("Error getting query metadata.");
log_error("Error getting query metadata. Query id: %s", sfstmt->sfqid);
return NULL;
}

/**
* Get the status of the query
* @param sf the SF_CONNECT context
* @param query_id the query id
*/
SF_QUERY_STATUS get_query_status(SF_CONNECT *sf, const char *query_id) {

SF_QUERY_STATUS snowflake_get_query_status(SF_STMT *sfstmt) {
SF_QUERY_STATUS ret = SF_QUERY_STATUS_NO_DATA;
char *metadata = get_query_metadata(sf, query_id);
char *metadata = get_query_metadata(sfstmt);
if (metadata) {
cJSON* metadataJson = snowflake_cJSON_Parse(metadata);

cJSON* status = snowflake_cJSON_GetObjectItem(metadataJson, "status");
if (snowflake_cJSON_IsString(status))
{
if (snowflake_cJSON_IsString(status)) {
char* queryStatus = snowflake_cJSON_GetStringValue(status);
ret = get_status_from_string(queryStatus);
}
else {
SET_SNOWFLAKE_STMT_ERROR(&sfstmt->error,
SF_STATUS_ERROR_GENERAL,
"Error retrieving the status from the metadata.",
NULL,
sfstmt->sfqid);
}
snowflake_cJSON_Delete(metadataJson);
}
else {
SET_SNOWFLAKE_STMT_ERROR(&sfstmt->error,
SF_STATUS_ERROR_GENERAL,
"Error retrieving query metadata.",
NULL,
sfstmt->sfqid);
}

return ret;
}
Expand All @@ -174,23 +185,26 @@ sf_bool is_query_still_running(SF_QUERY_STATUS query_status) {
* Get the results of the async query
* @param sfstmt The SF_STMT context
*/
void get_real_results(SF_STMT * sfstmt) {
SF_QUERY_STATUS query_status = get_query_status(sfstmt->connection, sfstmt->sfqid);
void get_real_results(SF_STMT *sfstmt) {
//Get status until query is complete or timed out
SF_QUERY_STATUS query_status = snowflake_get_query_status(sfstmt);
int retry = 0;
int no_data_retry = 0;
int no_data_max_retries = 30;
int retry_pattern[] = {1, 1, 2, 3, 4, 8, 10};
int max_retries = 7;
while (query_status != SF_QUERY_STATUS_SUCCESS) {
if (!is_query_still_running(query_status) && query_status != SF_QUERY_STATUS_SUCCESS) {
log_error("Query status is done running and did not succeed. Status is %s", query_status_names[query_status]);
log_error("Query status is done running and did not succeed. Status is %s",
query_status_names[query_status]);
return;
}
if (query_status == SF_QUERY_STATUS_NO_DATA) {
no_data_retry++;
if (no_data_retry >= no_data_max_retries) {
log_error(
"Cannot retrieve data on the status of this query. No information returned from server for queryID=%s", sfstmt->sfqid);
"Cannot retrieve data on the status of this query. No information returned from server for queryID=%s",
sfstmt->sfqid);
SET_SNOWFLAKE_STMT_ERROR(&sfstmt->error,
SF_STATUS_ERROR_GENERAL,
"Cannot retrieve data on the status of this query.",
Expand All @@ -199,28 +213,44 @@ void get_real_results(SF_STMT * sfstmt) {
return;
}
}
}
int sleep_time = retry_pattern[retry] * 500;

int sleep_time = retry_pattern[retry] * 500;
#ifdef _WIN32
Sleep(sleep_time);
Sleep(sleep_time);
#else
usleep(sleep_time * 1000);
usleep(sleep_time * 1000);
#endif
if (retry < max_retries) {
retry++;
} else {
log_error(
"Cannot retrieve data on the status of this query. Max retries hit with queryID=%s", sfstmt->sfqid);
if (retry < max_retries) {
retry++;
}
else {
log_error(
"Cannot retrieve data on the status of this query. Max retries hit with queryID=%s", sfstmt->sfqid);
}
query_status = snowflake_get_query_status(sfstmt);
}
query_status = get_query_status(sfstmt->connection, sfstmt->sfqid);

// Get query results
char query[1024];
char* query_template = "select * from table(result_scan('%s'))";
sf_sprintf(query, strlen(query_template) - 2 + strlen(sfstmt->sfqid) + 1, query_template, sfstmt->sfqid);
SF_STATUS ret = snowflake_query(sfstmt, query, strlen(query));
if (ret != SF_STATUS_SUCCESS) {
snowflake_propagate_error(sfstmt->connection, sfstmt);
}

// Get query stats
char* metadata_str = get_query_metadata(sfstmt);
if (metadata_str) {
cJSON* metadata = snowflake_cJSON_Parse(metadata_str);
cJSON* stats = snowflake_cJSON_GetObjectItem(metadata, "stats");
if (snowflake_cJSON_IsObject(stats)) {
if (sfstmt->stats) {
SF_FREE(sfstmt->stats);
}
sfstmt->stats = set_stats(stats);
}
}
}

#define _SF_STMT_TYPE_DML 0x3000
Expand Down Expand Up @@ -1739,7 +1769,7 @@ SF_STMT *STDCALL snowflake_stmt(SF_CONNECT *sf) {
return sfstmt;
}

SF_STMT *STDCALL snowflake_async_stmt(SF_CONNECT *sf, const char *query_id) {
SF_STMT *STDCALL snowflake_create_async_query_result(SF_CONNECT *sf, const char *query_id) {
if (!sf) {
return NULL;
}
Expand All @@ -1749,18 +1779,8 @@ SF_STMT *STDCALL snowflake_async_stmt(SF_CONNECT *sf, const char *query_id) {
_snowflake_stmt_reset(sfstmt);
sfstmt->connection = sf;
sf_strcpy(sfstmt->sfqid, SF_UUID4_LEN, query_id);
}

get_real_results(sfstmt);

char *metadata_str = get_query_metadata(sfstmt->connection, query_id);
if (metadata_str) {
cJSON* metadata = snowflake_cJSON_Parse(metadata_str);
cJSON* stats = snowflake_cJSON_GetObjectItem(metadata, "stats");
if (snowflake_cJSON_IsObject(stats)) {
_snowflake_stmt_row_metadata_reset(sfstmt);
sfstmt->stats = set_stats(stats);
}
sfstmt->is_async = SF_BOOLEAN_TRUE;
sfstmt->is_async_initialized = SF_BOOLEAN_FALSE;
}

return sfstmt;
Expand Down Expand Up @@ -1941,6 +1961,11 @@ SF_STATUS STDCALL snowflake_fetch(SF_STMT *sfstmt) {
return SF_STATUS_ERROR_STATEMENT_NOT_EXIST;
}

if (sfstmt->is_async && !sfstmt->is_async_initialized) {
get_real_results(sfstmt);
sfstmt->is_async_initialized = SF_BOOLEAN_TRUE;
}

clear_snowflake_error(&sfstmt->error);
SF_STATUS ret = SF_STATUS_ERROR_GENERAL;
sf_bool get_chunk_success = SF_BOOLEAN_TRUE;
Expand Down Expand Up @@ -2634,13 +2659,24 @@ int64 STDCALL snowflake_num_rows(SF_STMT *sfstmt) {
return -1;
}

if (sfstmt->is_async && !sfstmt->is_async_initialized) {
get_real_results(sfstmt);
sfstmt->is_async_initialized = SF_BOOLEAN_TRUE;
}

return sfstmt->total_rowcount;
}

int64 STDCALL snowflake_num_fields(SF_STMT *sfstmt) {
if (!sfstmt) {
return -1;
}

if (sfstmt->is_async && !sfstmt->is_async_initialized) {
get_real_results(sfstmt);
sfstmt->is_async_initialized = SF_BOOLEAN_TRUE;
}

return sfstmt->total_fieldcount;
}

Expand All @@ -2649,6 +2685,12 @@ uint64 STDCALL snowflake_num_params(SF_STMT *sfstmt) {
// TODO change to -1?
return 0;
}

if (sfstmt->is_async && !sfstmt->is_async_initialized) {
get_real_results(sfstmt);
sfstmt->is_async_initialized = SF_BOOLEAN_TRUE;
}

ARRAY_LIST *p = (ARRAY_LIST *) sfstmt->params;
return p->used;
}
Expand Down
4 changes: 2 additions & 2 deletions tests/test_async.c
Original file line number Diff line number Diff line change
Expand Up @@ -79,7 +79,7 @@ void test_new_connection(void** unused) {
}
assert_int_equal(status, SF_STATUS_SUCCESS);

SF_STMT* async_sfstmt = snowflake_async_stmt(sf, sfqid);
SF_STMT* async_sfstmt = snowflake_create_async_query_result(sf, sfqid);

/* get results */
int64 out = 0;
Expand Down Expand Up @@ -128,7 +128,7 @@ void test_invalid_query_id(void** unused) {
assert_int_equal(status, SF_STATUS_SUCCESS);

char* fake_sfqid = "fake-query-id";
SF_STMT* async_sfstmt = snowflake_async_stmt(sf, fake_sfqid);
SF_STMT* async_sfstmt = snowflake_create_async_query_result(sf, fake_sfqid);

assert_non_null(async_sfstmt);
assert_non_null(async_sfstmt->connection);
Expand Down
Loading