-
Notifications
You must be signed in to change notification settings - Fork 7
Open
Description
I just tested out the quickstart on Google Colab but it just hangs.
- Sign up for Google Colab
- Choose a T4 Runtime type
- Run
!pip install nsight-python - Run the quickstart code
import torch
import nsight
@nsight.analyze.kernel
def benchmark_matmul(n):
"""
The simplest possible benchmark.
We create two matrices and multiply them.
"""
# Create two NxN matrices on GPU
a = torch.randn(n, n, device="cuda")
b = torch.randn(n, n, device="cuda")
# Mark the kernel we want to profile
with nsight.annotate("matmul"):
c = a @ b
return c
result = benchmark_matmul(1024)After running for 10 mins it is still executing.

Metadata
Metadata
Assignees
Labels
No labels