Replies: 3 comments 5 replies
-
Is this feature primarily intended for recursive directory listing ?
The drawback is that determining an optimal splitting strategy may be challenging, but this challenge also applies in manual partitioning(maybe users have prior knowledge to guide their partitioning strategy). |
Beta Was this translation helpful? Give feedback.
-
This is great, we are currently simulating concurrent listing by manually listing different prefix, this API will fit our requirement very well. BTW, here's an example for reference, OSS cost about 1 second to perform a single |
Beta Was this translation helpful? Give feedback.
-
I skimmed s3 list objects v2. I am not sure if s3 will return objects as contents or common prefixes. Will check this. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
The issue of adding concurrent support for opendal list has been discussed for a long time: #987.
Listing a large prefix on S3 is a very slow process, as demonstrated in this image.
It would be a great improvement for OpenDAL to support concurrent list. The challenge is how to offer this as a user-friendly and effective API.
My current idea is that we can introduce a
partitions
option for list. The partitions must not overlap to ensure that the list returns only unique results.The usage will be like:
opendal will start three list underhood:
list("dir/")
,list("dir/a")
,list("dir/m")
. Users can speed up the process by addingconcurrent
for them:After adding
partitions
, the return results is not in order anymore.What do you think?
Beta Was this translation helpful? Give feedback.
All reactions