Skip to content

Question about pointer arrays for grouped GEMM in cublasLtMatmul #308

@Shmilkky

Description

@Shmilkky

I am trying to understand how grouped GEMM is represented in cuBLASLt.

My confusion is the following:

For grouped GEMM, each operation in the group should have its own matrix pointers A[i], B[i], and C[i].
But from the cublasLtMatmul function signature, A/B/C seem to be single pointers rather than pointer-to-pointer arguments.

Because of this, I am unsure which of the following is correct:

  1. The API expects pointer arrays for A/B/C.
    If yes, are these arrays located in host memory or device memory?
  2. The API expects a single pointer, and grouped GEMM is expressed through offsets or strides.
  3. There is another grouped GEMM-specific mechanism in cuBLASLt that I may be missing.

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions