Implement locked iteration for PyList #4789

ngoldbaum · 2024-12-10T22:09:57Z

Re-enables get_item_unchecked on the free-threaded build (with a new free-threaded-specific note about safety), adds locked_for_each, and implements a number of iterator methods for BouldListIterator on the free-threaded build to amortize synchronization overhead where possible.

Largely follows the implementation and tests from #4439, along with fixes similar to the ones I implemented for #4788.

src/types/list.rs

ngoldbaum · 2024-12-12T18:41:24Z

I was chatting with @epilys on an IRC channel we both use and he had a suggestion to avoid the inner struct. I've included a commit from him implementing that.

mejrs · 2024-12-13T00:01:23Z

src/types/list.rs

+macro_rules! split_borrow {
+    ($instance:expr, $index:ident, $length:ident, $list:ident) => {
+        let Self {
+            ref mut $index,
+            ref mut $length,
+            ref $list,
+        } = $instance;
+    };
 }


rather than just using this to split the borrow, maybe we should just have a macro that wraps

crate::sync::with_critical_section(list, || { ... })

as well?

I tried to do this this afternoon and got stuck. I'm pretty novice at writing macros - do you know of any similar examples I could look at somewhere?

Here's what I tried, but this doesn't compile if I used it in e.g. the fold implementation: https://gist.github.com/ngoldbaum/13ac11629f042ae0bee84559e4e8bb31

@ngoldbaum was the error about identifiers not being found? If yes it's because Rust macros are hygienic, all referenced identifiers in a macro argument must be defined outside of the macro. What you can do is something like this:

/// Helper to manage mutable borrows below macro_rules! split_borrow { ($instance:expr, $index:ident, $length:ident, $list:ident) => { let Self { ref mut $index, ref mut $length, ref $list, } = $instance; }; } macro_rules! op_with_critical_section { ($instance:expr, $fn:expr) => {{ split_borrow!($instance, index, length, list); crate::sync::with_critical_section(list, || $fn(index, length, list)) }}; } impl<'py> Iterator for BoundListIterator<'py> { type Item = Bound<'py, PyAny>; fn fold<B, F>(mut self, init: B, mut f: F) -> B where Self: Sized, F: FnMut(B, Self::Item) -> B, { op_with_critical_section!(self, |index: &mut Index, length: &mut Length, list| { let mut accum = init; while let Some(x) = unsafe { Self::next_unchecked(&mut *index, &mut *length, list) } { accum = f(accum, x); } accum }) } }

Though at this point I'm not sure the macro redirections makes it more readable anymore...

Personally I'd drop the macros and suggest we have

impl<'py> BoundListIterator<'py> { fn with_critical_section<R>(&mut self, f: impl FnOnce(&mut Index, &mut Length, &Bound<'py, PyList>) -> R) -> R { let Self { index, length, list } = self; crate::sync::with_critical_section(list, || $fn(index, length, list)) }

mejrs · 2024-12-13T00:23:05Z

src/types/list.rs

+
+        crate::sync::with_critical_section(list, || {
+            let mut accum = init;
+            while let Some(x) = unsafe { Self::next_unchecked(index, length, list) } {


Is it really safe to call next_unchecked here (and elsewhere)? Can't the closure modify the list?

More thoughts about thread safety here:

It is possible for another thread to try to acquire a critical section on the list, but we hold a critical section here so that thread will block until this thread exits the critical section.

That means the index and length are correct going into f, despite us getting them without any synchronization in next_unchecked.

99% of the time, the critical section never getd released. It does get released if f creates a new innermost critical section on the list, but then only this thread can access the list still. It is possible that fold is getting called recursively, in which case this thread would create new innermost critical sections until the recursion terminates.

(did some edits of the text above to drop irrelevant references to the GIL)

ngoldbaum · 2024-12-13T16:52:22Z

src/types/list.rs

+        length: &mut Length,
+        list: &Bound<'py, PyList>,
+    ) -> Option<Bound<'py, PyAny>> {
+        let length = length.0.min(list.len());


@mejrs if a closure updates the length and the old length stored in the iterator is out-of-bounds, this step means the index.0 < length check below is False so the iterator returns None and the iteration terminates.

We hold a critical section so other threads can't modify the list between here and the next get_item_unchecked call.

Does that make sense? Are you worried about other scenarios?

In any case, I'll try to add a test where the closure modifies the list to see what happens....

See test_iter_fold_out_of_bounds added in the last commit.

davidhewitt

Thanks, this looks like it's needed before we can land #4810, additionally it'd be great to include this in 0.23.4. Sorry for the very slow review!

I think main suggestions from me is that we can simplify away the macros and also we should probably use the _unchecked paths in PyPy.

davidhewitt · 2025-01-03T11:18:44Z

src/types/list.rs

-    #[cfg(not(any(Py_LIMITED_API, Py_GIL_DISABLED)))]
+    /// On the free-threaded build, caller must verify they have exclusive access to the list
+    /// via a lock or by holding the innermost critical section on the list.
+    #[cfg(not(any(Py_LIMITED_API)))]


Suggested change

#[cfg(not(any(Py_LIMITED_API)))]

#[cfg(not(Py_LIMITED_API))]

src/types/list.rs

davidhewitt · 2025-01-03T11:36:54Z

src/types/list.rs

+macro_rules! split_borrow {
+    ($instance:expr, $index:ident, $length:ident, $list:ident) => {
+        let Self {
+            ref mut $index,
+            ref mut $length,
+            ref $list,
+        } = $instance;
+    };
 }


Personally I'd drop the macros and suggest we have

impl<'py> BoundListIterator<'py> { fn with_critical_section<R>(&mut self, f: impl FnOnce(&mut Index, &mut Length, &Bound<'py, PyList>) -> R) -> R { let Self { index, length, list } = self; crate::sync::with_critical_section(list, || $fn(index, length, list)) }

davidhewitt · 2025-01-03T11:37:50Z

src/types/list.rs

@@ -493,14 +604,21 @@ impl<'py> Iterator for BoundListIterator<'py> {

    #[inline]
    fn next(&mut self) -> Option<Self::Item> {
-        let length = self.length.min(self.list.len());
+        split_borrow!(self, index, length, list);


Especially if we use my suggestion from above to avoid macros on the critical sections, I think this can just be:

Suggested change

split_borrow!(self, index, length, list);

let Self { index, length, list } = self;

Inline ListIterImpl implementations by using split borrows and destructuring let Self { .. } = self destructuring inside BoundListIterator impls. Signed-off-by: Manos Pitsidianakis <[email protected]>

* implement locked iteration for PyList * fix limited API and PyPy support * fix formatting of safety docstrings * only define fold and rfold on not(feature = "nightly") * add missing try_fold implementation on nightly * Use split borrows for locked iteration for PyList Inline ListIterImpl implementations by using split borrows and destructuring let Self { .. } = self destructuring inside BoundListIterator impls. Signed-off-by: Manos Pitsidianakis <[email protected]> * use a function to do the split borrow * add changelog entries * fix clippy on limited API and PyPy * use a macro for the split borrow * add a test that mutates the list during a fold * enable next_unchecked on PyPy * fix incorrect docstring for locked_for_each * simplify borrows by adding BoundListIterator::with_critical_section * fix build on GIL-enabled and limited API builds * fix docs build on MSRV --------- Signed-off-by: Manos Pitsidianakis <[email protected]> Co-authored-by: Manos Pitsidianakis <[email protected]>

ngoldbaum mentioned this pull request Dec 10, 2024

Add locked iterations APIs for dicts and lists #4571

Closed

ngoldbaum force-pushed the pylist-locking branch 2 times, most recently from 2fd3e6d to 7d3fad0 Compare December 11, 2024 19:29

ngoldbaum added the free-threading label Dec 11, 2024

ngoldbaum force-pushed the pylist-locking branch 2 times, most recently from d1d1824 to 2692d89 Compare December 11, 2024 20:59

ngoldbaum commented Dec 11, 2024

View reviewed changes

src/types/list.rs Show resolved Hide resolved

mejrs reviewed Dec 13, 2024

View reviewed changes

ngoldbaum commented Dec 13, 2024

View reviewed changes

davidhewitt reviewed Jan 3, 2025

View reviewed changes

davidhewitt mentioned this pull request Jan 3, 2025

release: 0.23.4 #4835

Open

ngoldbaum and others added 14 commits January 6, 2025 10:35

implement locked iteration for PyList

a23499a

fix limited API and PyPy support

6b97a56

fix formatting of safety docstrings

2ef8f17

only define fold and rfold on not(feature = "nightly")

b911887

add missing try_fold implementation on nightly

4e2fae6

Use split borrows for locked iteration for PyList

0ee19bf

Inline ListIterImpl implementations by using split borrows and destructuring let Self { .. } = self destructuring inside BoundListIterator impls. Signed-off-by: Manos Pitsidianakis <[email protected]>

use a function to do the split borrow

d7f6abf

add changelog entries

6c99a5c

fix clippy on limited API and PyPy

f56802c

use a macro for the split borrow

0a57207

add a test that mutates the list during a fold

e6809f7

enable next_unchecked on PyPy

a27288a

fix incorrect docstring for locked_for_each

f57bab1

simplify borrows by adding BoundListIterator::with_critical_section

2967b21

ngoldbaum force-pushed the pylist-locking branch from 5b52e68 to 2967b21 Compare January 6, 2025 18:43

fix build on GIL-enabled and limited API builds

994bc49

ngoldbaum added this pull request to the merge queue Jan 8, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 8, 2025

ngoldbaum enabled auto-merge January 8, 2025 20:29

ngoldbaum added this pull request to the merge queue Jan 8, 2025

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 8, 2025

fix docs build on MSRV

5562368

ngoldbaum force-pushed the pylist-locking branch from ccffdcb to 5562368 Compare January 8, 2025 22:34

ngoldbaum enabled auto-merge January 8, 2025 22:44

ngoldbaum added this pull request to the merge queue Jan 8, 2025

Merged via the queue into PyO3:main with commit c0f08c2 Jan 8, 2025
45 of 46 checks passed

ngoldbaum deleted the pylist-locking branch January 8, 2025 23:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement locked iteration for PyList #4789

Implement locked iteration for PyList #4789

ngoldbaum commented Dec 10, 2024 •

edited

Loading

ngoldbaum commented Dec 12, 2024

mejrs Dec 13, 2024 •

edited

Loading

ngoldbaum Dec 13, 2024

epilys Dec 14, 2024

davidhewitt Jan 3, 2025

mejrs Dec 13, 2024

ngoldbaum Dec 13, 2024 •

edited

Loading

ngoldbaum Dec 13, 2024 •

edited

Loading

ngoldbaum Dec 13, 2024

davidhewitt left a comment

davidhewitt Jan 3, 2025

davidhewitt Jan 3, 2025

davidhewitt Jan 3, 2025

	split_borrow!(self, index, length, list);
	let Self { index, length, list } = self;

Implement locked iteration for PyList #4789

Implement locked iteration for PyList #4789

Conversation

ngoldbaum commented Dec 10, 2024 • edited Loading

ngoldbaum commented Dec 12, 2024

mejrs Dec 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ngoldbaum Dec 13, 2024 • edited Loading

Choose a reason for hiding this comment

ngoldbaum Dec 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davidhewitt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ngoldbaum commented Dec 10, 2024 •

edited

Loading

mejrs Dec 13, 2024 •

edited

Loading

ngoldbaum Dec 13, 2024 •

edited

Loading

ngoldbaum Dec 13, 2024 •

edited

Loading