Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sparse #1

Open
wants to merge 38 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
38 commits
Select commit Hold shift + click to select a range
c1a3cd7
rebsing
ABenC377 Oct 11, 2024
21366b4
rebsing
ABenC377 Oct 11, 2024
f2ed11f
Implementing cuSPARSE kernel
ABenC377 Mar 13, 2024
c208246
Trying to work out CSR malloc bug
ABenC377 Mar 13, 2024
de14a56
Trying to work out CSR malloc bug
ABenC377 Mar 13, 2024
49cddf0
cuSPARSE unified memory implementation
ABenC377 Mar 19, 2024
37ce8b4
Now compiles
ABenC377 Mar 21, 2024
143c1c0
Now compiles with fewer runtime errors
ABenC377 Mar 25, 2024
bcd7ae8
rebasing
ABenC377 Oct 11, 2024
2ffee16
All implemented and running. No checksum at the end
ABenC377 Mar 26, 2024
064ec57
Removing print statements
ABenC377 Mar 26, 2024
88a053f
Removing print statements
ABenC377 Mar 29, 2024
5b04a2c
rebasing
ABenC377 Apr 1, 2024
23d318b
rebasing
ABenC377 Oct 11, 2024
be9094c
rebasing
ABenC377 Oct 11, 2024
7cfa7be
Tidying up spGEMM classes to remove duplicated code
ABenC377 Apr 3, 2024
30d384e
rebasing
ABenC377 Oct 11, 2024
cc8e2a8
rebasing
ABenC377 Oct 11, 2024
de56ae1
rebasing
ABenC377 Oct 11, 2024
b972c23
rebasing
ABenC377 Oct 11, 2024
1f5f2dd
rebasing
ABenC377 Oct 11, 2024
b06250c
rebasing
ABenC377 Oct 11, 2024
42bdc58
Adding AOCL files
ABenC377 Aug 21, 2024
521cbf3
Working changes
ABenC377 Oct 1, 2024
a8e5c46
Adding AOCL files
ABenC377 Oct 11, 2024
9eb4646
No longer overwriting B_
ABenC377 Oct 3, 2024
7f82b7d
Adding AOCL files
ABenC377 Oct 11, 2024
0130b81
Adding AOCL files
ABenC377 Oct 11, 2024
4581637
Providing armpl with hints
ABenC377 Oct 11, 2024
477b7a0
Updating createGflopsGraphs.py to show sparsity
ABenC377 Oct 21, 2024
407c008
Beginning gemv ARMPL
ABenC377 Oct 21, 2024
8934588
Beginning gemv ARMPL
ABenC377 Oct 21, 2024
2e61261
still trying to figure out segfault...
ABenC377 Jan 2, 2025
bc70814
Getting rid of old oneMKL sparse file
ABenC377 Jan 2, 2025
52d5e91
Refactoring to make individual files relate to a single kernel
ABenC377 Jan 7, 2025
7819f6f
Moving spgemv into new format
ABenC377 Jan 8, 2025
d7ad2b7
Finishing off armpl and cusparse kernels
ABenC377 Jan 9, 2025
8bc9125
Finishing off OneMKL CPU support
ABenC377 Jan 14, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added .DS_Store
Binary file not shown.
2 changes: 2 additions & 0 deletions .idea/GPU-BLAS-Offload-Benchmark.iml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

5 changes: 5 additions & 0 deletions .idea/codeStyles/codeStyleConfig.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions .idea/misc.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

8 changes: 8 additions & 0 deletions .idea/modules.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

6 changes: 6 additions & 0 deletions .idea/vcs.xml

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

634 changes: 634 additions & 0 deletions .idea/workspace.xml

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions AOCL/gemm.hh
Original file line number Diff line number Diff line change
Expand Up @@ -23,6 +23,7 @@ class gemm_cpu : public gemm<T> {
private:
/** Make call to the GEMM kernel. */
void callGemm() override {

if constexpr (std::is_same_v<T, float>) {
bli_sgemm(BLIS_NO_TRANSPOSE, BLIS_NO_TRANSPOSE, m_, n_, k_, &alpha, A_,
rowStride, std::max(1, m_), B_, rowStride, std::max(1, k_),
Expand Down
Loading