[cuDNN] Add cudnn conv2d #435

yudi0201 · 2024-03-07T02:25:25Z

No description provided.

yaoyaoding

Overall looks good to me. After merging this PR, we can add a primitive function to call the conv2d_cudnn in our runtime library and have an operator like hidet.ops.conv2d_cudnn.

yaoyaoding · 2024-03-07T06:38:18Z

src/hidet/runtime/cuda/cudnn.cpp

+
+    void *dev_ptrs[3] = {ptr_x, ptr_w, ptr_y}; // device pointers
+    int64_t uids[3] = {'x', 'w', 'y'};
+    void *workspace = hidet_cuda_malloc_async(workspaceSize, cur_stream);


It might be better to use the workspace shared by all hidet operators (i.e., https://github.com/hidet-org/hidet/blob/main/include/hidet/runtime/cuda/context.h#L46).

When we run the operator in the second time, there will not be any memory allocation. Thus, it can also be used in cudaGraph.

yaoyaoding · 2024-03-07T06:40:00Z

src/hidet/runtime/cuda/cudnn.cpp

+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(xDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(wDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(yDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(cDesc));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(fprop));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(op_graph));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(engine));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(engcfg));
+    CHECK_CUDNN(cudnnBackendDestroyDescriptor(plan));


Might be good to benchmark the performance of our implementation vs. PyTorch's conv2d performance. I am not sure whether the overhead of creating/destroying descriptors is large enough to influence the performance.

vadiklyutiy · 2024-03-07T17:05:27Z

@yaoyaoding
A little bit different but connected question.
Are we planing to include conv2d_cudnn to search space? I mean we can search via our current space + cudnn implementation. Is it doable at all (wo huge redesign)?

yaoyaoding · 2024-03-07T18:34:42Z

It's doable, similar to cublas gemm: 072a606

vadiklyutiy · 2024-03-07T19:00:28Z

What about adding cudnn*, cublas* etc to search space?

yaoyaoding · 2024-03-08T09:02:29Z

What about adding cudnn*, cublas* etc to search space?

That's exactly what the commint I mentioned before does.

destefy · 2024-05-31T14:25:46Z

If cuDNN needs to be installed, could it be added to the README? It doesn't seem to be included in CUDA Toolkit: link

yaoyaoding · 2024-06-03T15:42:33Z

If cuDNN needs to be installed, could it be added to the README? It doesn't seem to be included in CUDA Toolkit: link

We can add the nvidia-cudnn-cu12 to our dependency in setup.py like what pytorch does (#1 #2) and the pypi package.

yaoyaoding · 2024-06-03T15:42:47Z

Similar like other package like cublas.

…cudnn_conv

…g precision

yaoyaoding · 2024-06-28T17:04:12Z

Hi @c-fteixeira, the ci seems can not initialize the vm for test, could you help us to take a look? thank you!

yaoyaoding

LGTM, thanks @yudi0201 !

yudi0201 force-pushed the cudnn_conv branch from 075da22 to 86d9aca Compare March 7, 2024 02:28

yaoyaoding reviewed Mar 7, 2024

View reviewed changes

Add cudnn conv2d

4b37c61

yudi0201 force-pushed the cudnn_conv branch from ab1c51d to f7ce7ef Compare March 14, 2024 22:54

[CUDNN] Add CuDNN performance benchmarks

68192fd

yudi0201 force-pushed the cudnn_conv branch from f7ce7ef to 68192fd Compare March 14, 2024 23:04

Yudi Sun added 5 commits April 3, 2024 18:17

Add cudnn conv2d

90a1791

[CUDNN] Add CuDNN performance benchmarks

43fbeab

[CuDNN] Support float16

0d4de20

[CuDNN] Add legacy APIs for conv2d

b215669

Add cudnn_gemm

730d30f

Yudi Sun added 4 commits June 25, 2024 13:10

cuDNN cleanup

40a6149

Merge branch 'cudnn_conv' of https://github.com/hidet-org/hidet into …

8236c5e

…cudnn_conv

[CUDNN] Cleanup

5d5d8a6

[CUDNN] Format and lint

2f8fdaa

yudi0201 force-pushed the cudnn_conv branch from aa558c0 to 2f8fdaa Compare June 27, 2024 16:56

[CUDNN] Disable TF32 operations on Ampere architecture to avoid losin…

23d80fe

…g precision

[CuDNN] Increase test tol for sm80 and higher

a48012c

yaoyaoding approved these changes Jul 3, 2024

View reviewed changes

yaoyaoding changed the title ~~Add cudnn conv2d~~ [cuDNN] Add cudnn conv2d Jul 3, 2024

yaoyaoding merged commit a2a60b1 into main Jul 3, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cuDNN] Add cudnn conv2d #435

[cuDNN] Add cudnn conv2d #435

yudi0201 commented Mar 7, 2024

yaoyaoding left a comment

yaoyaoding Mar 7, 2024

yaoyaoding Mar 7, 2024

vadiklyutiy commented Mar 7, 2024

yaoyaoding commented Mar 7, 2024

vadiklyutiy commented Mar 7, 2024 •

edited

Loading

yaoyaoding commented Mar 8, 2024

destefy commented May 31, 2024 •

edited

Loading

yaoyaoding commented Jun 3, 2024

yaoyaoding commented Jun 3, 2024

yaoyaoding commented Jun 28, 2024

yaoyaoding left a comment

[cuDNN] Add cudnn conv2d #435

[cuDNN] Add cudnn conv2d #435

Conversation

yudi0201 commented Mar 7, 2024

yaoyaoding left a comment

Choose a reason for hiding this comment

yaoyaoding Mar 7, 2024

Choose a reason for hiding this comment

yaoyaoding Mar 7, 2024

Choose a reason for hiding this comment

vadiklyutiy commented Mar 7, 2024

yaoyaoding commented Mar 7, 2024

vadiklyutiy commented Mar 7, 2024 • edited Loading

yaoyaoding commented Mar 8, 2024

destefy commented May 31, 2024 • edited Loading

yaoyaoding commented Jun 3, 2024

yaoyaoding commented Jun 3, 2024

yaoyaoding commented Jun 28, 2024

yaoyaoding left a comment

Choose a reason for hiding this comment

vadiklyutiy commented Mar 7, 2024 •

edited

Loading

destefy commented May 31, 2024 •

edited

Loading