-
Notifications
You must be signed in to change notification settings - Fork 14
Open
Description
Right now this line
Line 519 in 127aa0f
| scratch.as_slice_mut().add(full_matrix.row(i), 1); |
is extremely hot. I would even say it's one of the main bottlenecks to computing at large stems. Looking at active threads with
top and running pstack on them periodically, it seems I usually have only a handful of threads running, and they are all executing that line. (See #101 for more detailed logging)
If I'm reading this correctly, this code looks like it's doing some vector-vector operations repeatedly. Would this be a good candidate to replace with vector-matrix or matrix-matrix operations? In the latter case #100 will probably give a massive speedup.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels