bugfix: await on size, assuming it can be an async function #1281

mukhery · 2023-05-29T16:57:02Z

Checking size may require a call to the async _info function. On line 277, _cp_file checks if f1.size is async (callback.set_size(await maybe_await(f1.size))), but then later it simply references f1.size which fails if this function is async. As such, this PR corrects the code to ensure f1.size references are wrapped in await maybe_await().

mukhery · 2023-06-01T13:07:52Z

@martindurant what do you think?

martindurant · 2023-06-01T13:12:21Z

I think you are right in your argument. However, I have a feeling that f.size is never async actually, so it won't matter. open_async/AsyncStreamedFiles could use work!

fsspec/generic.py

mukhery · 2023-06-01T13:20:29Z

I think you are right in your argument. However, I have a feeling that f.size is never async actually, so it won't matter. open_async/AsyncStreamedFiles could use work!

fsspec/s3fs#742 makes it async in order to call the async _info needed for _cp_file

martindurant · 2023-06-01T13:23:47Z

Funny, because S3 is the only one where for sure we should know the size before starting the download: the content-size header is always populated and, possibly outside compressive transcoding, will be correct.

mukhery · 2023-06-10T15:28:27Z

Funny, because S3 is the only one where for sure we should know the size before starting the download: the content-size header is always populated and, possibly outside compressive transcoding, will be correct.

Perhaps I should change https://github.com/fsspec/filesystem_spec/blob/master/fsspec/generic.py#L277 to just remove the assumption that f1.size could be async? Or do you think this still might be useful for other implementations even though it isn't an issue for s3?

martindurant · 2023-06-10T17:36:28Z

maybe_await should never be harmful and not add any overhead

await on size, assuming it can be an async function

8b2bf0c

mukhery mentioned this pull request May 29, 2023

add size property to enable _cp_file fsspec/s3fs#742

Closed

mukhery commented Jun 1, 2023

View reviewed changes

fsspec/generic.py Outdated Show resolved Hide resolved

black

69dd8af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: await on size, assuming it can be an async function #1281

bugfix: await on size, assuming it can be an async function #1281

mukhery commented May 29, 2023

mukhery commented Jun 1, 2023

martindurant commented Jun 1, 2023

mukhery commented Jun 1, 2023

martindurant commented Jun 1, 2023

mukhery commented Jun 10, 2023

martindurant commented Jun 10, 2023

bugfix: await on size, assuming it can be an async function #1281

Are you sure you want to change the base?

bugfix: await on size, assuming it can be an async function #1281

Conversation

mukhery commented May 29, 2023

mukhery commented Jun 1, 2023

martindurant commented Jun 1, 2023

mukhery commented Jun 1, 2023

martindurant commented Jun 1, 2023

mukhery commented Jun 10, 2023

martindurant commented Jun 10, 2023