Fixed #22 Propose new pyfftw_sdp #25

NimaSarajpoor · 2025-12-11T02:50:41Z

This PR addresses issue #22. The proposed pyfftw-based sdp will first be added under challenger_sdp.py so that we can compare it against the existing pyfftw_sdp. Eventually, once we are certain that there is no other concern, we will move it to pyfftw_sdp.

gitnotebooks · 2025-12-11T02:50:45Z

Review these changes at https://app.gitnotebooks.com/stumpy-dev/sliding_dot_product/pull/25

timing.py

NimaSarajpoor · 2025-12-11T11:12:42Z

timing.sh:

#!/bin/bash

rm -rf sdp/__pycache__
./timing.py -timeout 5.0 -pmin 2 -pmax 24 -pdiff 100 pyfftw challenger > timing.csv
rm -rf sdp/__pycache__

The performance of challenger relative to existing pyffte_sdp

Observation
The proposed challenger is approximately 1.25x-2x faster than pyfftw_sdp for large T, where len(T) >= 2^15.

NimaSarajpoor · 2025-12-12T16:00:49Z

@seanlaw
If there is no particular concern on the logic, please let me know and I can move the script to pyfftw_sdp.

seanlaw · 2025-12-12T16:02:34Z

@NimaSarajpoor Please go ahead!

NimaSarajpoor · 2025-12-12T22:59:46Z

The script for pyfftw_sdp has been updated to reflect the recent proposal. Given this change to our baseline performance, it is worth it to re-evaluate the performance of other modules relative to this new baseline. The following figure illustrates the performance of pocketfft_r2c_c2r (blue plot) and scipy_oaconvolve (orange plot) benchmarked against the new baseline: pyfftw_sdp.

#!/bin/bash

rm -rf sdp/__pycache__
./timing.py -timeout 5.0 -pmin 6 -pmax 24 pyfftw pocketfft_r2c_c2r scipy_oaconvolve > timing.csv
rm -rf sdp/__pycache__

Observations

In all cases, pyfftw_sdp outperforms pocketfft_r2c_c2r_sdp
For len(Q)<2^15, scipy_oaconvolve_sdp outperforms (single-threaded) pyfftw_sdp when len(T) >> len(Q).

Conclusion
For len(Q) >= 2^6, our focus can be on the following two modules:

scipy_oaconvolve_sdp
pyfftw_sdp

I think it should be worth it to check the performance of multi-threading pyfftw_sdp and see if that outperforms scipy_oaconvolve_sdp.

NimaSarajpoor · 2025-12-14T04:20:27Z

I think it is worth evaluating the performance of multi-threaded pyfftw_sdp to see whether it can outperform scipy_oaconvolve_sdp.

The baseline implementation of pyfftw_sdp is single-threaded. In what follows, I explore multi-threaded variants of pyfftw_sdp, where the number of threads (self.threads, see the line below) is set to 2, 4, and 8:

sliding_dot_product/sdp/pyfftw_sdp.py

Line 11 in d8f0423

self.threads = 1

The benchmark was run using the following timing.sh script:

#!/bin/bash

rm -rf sdp/__pycache__
./timing.py -timeout 5.0 -pmin 6 -pmax 24 pyfftw challenger_2threads challenger_4threads challenger_8threads scipy_oaconvolve > timing.csv
rm -rf sdp/__pycache__

Results

Observations

Multi-threaded pyfftw_sdp does not consistently outperform the single-threaded baseline. In fact, for shorter input arrays, the single-threaded version often performs better.
pyfftw_sdp with 4 or 8 threads outperforms scipy_oaconvolve.

Side note
It is important to clarify a key assumption underlying this comparison: the FFT planning time in pyfftw is excluded. This assumes that the FFT is applied repeatedly to inputs of the same size, in which case the one-time planning cost (which can also be computed in advance) can reasonably be ignored.
Under this assumption, single-threaded / multi-threaded pyfftw_sdp seems to be the solution we are looking for. However, if the application requires FFTs on new, unforeseen large input sizes, then scipy_oaconvolve may be the better choice, as it does not have to spend time for the "planning" phase.

NimaSarajpoor · 2025-12-14T04:23:18Z

@seanlaw
Do you think this PR is ready to be merged?

seanlaw · 2025-12-14T12:57:23Z

Do you think this PR is ready to be merged?

Please give me some time to review it more thoroughly

seanlaw

@NimaSarajpoor I basically have one primary suggestion for you to consider but I am not married to it (see below)

sdp/pyfftw_sdp.py

seanlaw

@NimaSarajpoor I've left some comments for you to consider

sdp/pyfftw_sdp.py

NimaSarajpoor · 2025-12-20T01:44:38Z

@seanlaw
I've addressed the comments. I've also improved the comments and docstrings in pyfftw_sdp.py. Additionally, improved the comment in the new test function that was added recently to test.py.

seanlaw

@NimaSarajpoor I think this is ready to be merged. I left a comment for you to consider but feel free to ignore if you disagree as I do not feel strongly about it

seanlaw · 2025-12-20T02:00:53Z

sdp/pyfftw_sdp.py

+            Sliding dot product between `Q` and `T`.
+        """
+        m = Q.shape[0]
+        if self.n != T.shape[0]:


I don't think this if statement is needed. The lines below can still be executed regardless of whether the length has changed

Right..... the only reason for having that if statement is to avoid calling pyfftw.next_fast_len when possible. It should be fine though to remove it as that results in just a slight drop in the performance for lengths 2^18 to 2^21 (0%-10%).

propose revised pyfftw in challenger

59b49e1

NimaSarajpoor commented Dec 11, 2025

View reviewed changes

timing.py Outdated Show resolved Hide resolved

minor fix

c8074a2

NimaSarajpoor added 3 commits December 12, 2025 11:07

update files

bb8b9ea

improve comments

aa592db

fix black formating

d8f0423

seanlaw reviewed Dec 15, 2025

View reviewed changes

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

NimaSarajpoor added 2 commits December 16, 2025 23:46

addressed comments

73447a7

fixed unnecessary import

47ec8b7

NimaSarajpoor commented Dec 17, 2025

View reviewed changes

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

seanlaw reviewed Dec 17, 2025

View reviewed changes

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

NimaSarajpoor added 8 commits December 17, 2025 21:35

add comment to improve clarity

c2d5247

addressed comments

4014bf8

increase the default value of max_n parameter

f9d4bb3

added test function

1f1bae6

renamed test function

6954b31

fixed minor issue

1acd761

minor change

f013dd7

Revised comment

20318d9

NimaSarajpoor commented Dec 18, 2025

View reviewed changes

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

sdp/pyfftw_sdp.py Outdated Show resolved Hide resolved

NimaSarajpoor added 2 commits December 19, 2025 15:55

addressed comments

c79ea81

minor changes

70bf879

NimaSarajpoor added 3 commits December 19, 2025 20:34

minor change in docstring

b259eae

minor change in docstring

a419895

add comment to test function to show its purpose

c358de9

seanlaw reviewed Dec 20, 2025

View reviewed changes

avoid tracking len(T) via attribute

28b1b75

Fixed #22 Propose new pyfftw_sdp #25

Are you sure you want to change the base?

Fixed #22 Propose new pyfftw_sdp #25

Uh oh!

Conversation

NimaSarajpoor commented Dec 11, 2025

Uh oh!

gitnotebooks bot commented Dec 11, 2025

Uh oh!

Uh oh!

NimaSarajpoor commented Dec 11, 2025

Uh oh!

NimaSarajpoor commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seanlaw commented Dec 12, 2025

Uh oh!

NimaSarajpoor commented Dec 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NimaSarajpoor commented Dec 14, 2025

Uh oh!

NimaSarajpoor commented Dec 14, 2025

Uh oh!

seanlaw commented Dec 14, 2025

Uh oh!

seanlaw left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seanlaw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NimaSarajpoor commented Dec 20, 2025

Uh oh!

seanlaw left a comment

Choose a reason for hiding this comment

Uh oh!

seanlaw Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

NimaSarajpoor Dec 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NimaSarajpoor commented Dec 12, 2025 •

edited

Loading

NimaSarajpoor commented Dec 12, 2025 •

edited

Loading

seanlaw left a comment •

edited

Loading