Skip to content

Conversation

pghysels
Copy link
Contributor

@pghysels pghysels commented Sep 3, 2025

Motivation

This allows skipping the kernel for matrix initialization in hipblaslt-bench using the environment variable TENSILE_DB2=0x1, which is also used to skip GEMM kernel execution.
Useful for testing host code if a specific GPU is not available.

Submission Checklist

@pghysels pghysels requested a review from a team as a code owner September 3, 2025 00:50
@@ -126,6 +128,12 @@ void hipblaslt_init_device(ABC_dims abc,
size_t stride,
size_t batch_count)
{
if(TensileLite::Debug::Instance().skipKernelLaunch())
{
std::cout << "DEBUG: Skip execution of kernel for matrix initialization" << std::endl;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why do we need this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same reason why you would want to skip GEMM kernel execution (both with TENSILE_DB2=0x1). Perhaps for testing host code when a specific GPU is not available. This was requested by @bethune-bryant

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We shouldn't use TensileLite::Debug outside tensilelite.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tensile runtime env shouldn't be used by hipblaslt-bench tool. Don't mix those up.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why not?

eidenyoshida pushed a commit that referenced this pull request Sep 3, 2025
- Fix groovy multi-gpu test script.

[ROCm/rocFFT commit: 68037a6]
Copy link
Contributor

@KKyang KKyang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Don't use TensileLite::Debug outside Tensilelite.

@pghysels pghysels closed this Sep 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants