gemmt
Computes a matrix-matrix product with general matrices, but updates only
the upper or lower triangular part of the result matrix.
Description
The
gemmt
routines compute a scalar-matrix-matrix product and add the
result to the upper or lower part of a scalar-matrix product, with
general matrices. The operation is defined as:
where:
- op(X) is one of op(X) =X, or op(X) =XT, or op(X) =XH
- alphaandbetaare scalars
- A,B, andCare matrices
- op(A) isnxk, op(B) iskxn, andCisnxn
gemmt
supports the following precisions:T |
---|
float |
double |
std::complex<float> |
std::complex<double> |
gemmt (Buffer Version)
Syntax
namespace oneapi::mkl::blas::column_major {
void gemmt(sycl::queue &queue,
oneapi::mkl::uplo upper_lower,
oneapi::mkl::transpose transa,
oneapi::mkl::transpose transb,
std::int64_t n,
std::int64_t k,
T alpha,
sycl::buffer<T,1> &a,
std::int64_t lda,
sycl::buffer<T,1> &b,
std::int64_t ldb,
T beta,
sycl::buffer<T,1> &c,
std::int64_t ldc)
}
namespace oneapi::mkl::blas::row_major {
void gemmt(sycl::queue &queue,
oneapi::mkl::uplo upper_lower,
oneapi::mkl::transpose transa,
oneapi::mkl::transpose transb,
std::int64_t n,
std::int64_t k,
T alpha,
sycl::buffer<T,1> &a,
std::int64_t lda,
sycl::buffer<T,1> &b,
std::int64_t ldb,
T beta,
sycl::buffer<T,1> &c,
std::int64_t ldc)
}
Input Parameters
- queue
- The queue where the routine should be executed.
- upper_lower
- Specifies whether matrixCis upper or lower triangular. See Data Types for more details.
- transa
- transb
- n
- Number of rows of matrix op(A) and matrixC. Must be at least zero.
- k
- Number of columns of matrix op(A) and rows of matrix op(B). Must be at least zero.
- alpha
- Scaling factor for matrix-matrix product.
- a
- Buffer holding input matrixA. See Matrix Storage for more details.transa=transpose::nontranstransa=transpose::transortrans=transpose::conjtransColumn majorAisnxkmatrix. Size of arrayamust be at leastlda*kAiskxnmatrix. Size of arrayamust be at leastlda*nRow majorAisnxkmatrix. Size of arrayamust be at leastlda*nAiskxnmatrix. Size of arrayamust be at leastlda*k
- lda
- Leading dimension of matrixA. Must be positive.transa=transpose::nontranstransa=transpose::transortrans=transpose::conjtransColumn majorMust be at leastnMust be at leastkRow majorMust be at leastkMust be at leastn
- b
- Buffer holding input matrixB. See Matrix Storage for more details.transb=transpose::nontranstransb=transpose::transortrans=transpose::conjtransColumn majorBiskxnmatrix. Size of arraybmust be at leastldb*nBisnxkmatrix. Size of arraybmust be at leastldb*kRow majorBiskxnmatrix. Size of arraybmust be at leastldb*kBisnxkmatrix. Size of arraybmust be at leastldb*n
- ldb
- Leading dimension of matrixB. Must be positive.transb=transpose::nontranstransb=transpose::transortrans=transpose::conjtransColumn majorMust be at leastkMust be at leastnRow majorMust be at leastnMust be at leastk
- beta
- Scaling factor for matrixC.
- c
- Buffer holding input/output matrixC. See Matrix Storage for more details.Column majorCismxnmatrix. Size of arraycmust be at leastldc*nRow majorCismxnmatrix. Size of arraycmust be at leastldc*m
- ldc
- Leading dimension of matrixC. Must be positive.Column majorMust be at leastmRow majorMust be at leastn
Output Parameters
- c
- Output buffer overwritten by upper or lower triangular part ofalpha* op(A)*op(B) +beta*C.
If
beta
= 0, matrix C
does not need to be initialized before calling gemmt
. gemmt (USM Version)
Syntax
namespace oneapi::mkl::blas::column_major {
sycl::event gemmt(sycl::queue &queue,
oneapi::mkl::uplo upper_lower,
oneapi::mkl::transpose transa,
oneapi::mkl::transpose transb,
std::int64_t n,
std::int64_t k,
T alpha,
const T* a,
std::int64_t lda,
const T* b,
std::int64_t ldb,
T beta,
T* c,
std::int64_t ldc,
const std::vector<sycl::event> &dependencies = {})
}
namespace oneapi::mkl::blas::row_major {
sycl::event gemmt(sycl::queue &queue,
oneapi::mkl::uplo upper_lower,
oneapi::mkl::transpose transa,
oneapi::mkl::transpose transb,
std::int64_t n,
std::int64_t k,
T alpha,
const T* a,
std::int64_t lda,
const T* b,
std::int64_t ldb,
T beta,
T* c,
std::int64_t ldc,
const std::vector<sycl::event> &dependencies = {})
}
Input Parameters
- queue
- The queue where the routine should be executed.
- upper_lower
- Specifies whether matrixCis upper or lower triangular. See Data Types for more details.
- transa
- transb
- n
- Number of rows of matrix op(A) and matrixC. Must be at least zero.
- k
- Number of columns of matrix op(A) and rows of matrix op(B). Must be at least zero.
- alpha
- Scaling factor for matrix-matrix product.
- a
- Pointer to input matrixA. See Matrix Storage for more details.transa=transpose::nontranstransa=transpose::transortrans=transpose::conjtransColumn majorAisnxkmatrix. Size of arrayamust be at leastlda*kAiskxnmatrix. Size of arrayamust be at leastlda*nRow majorAisnxkmatrix. Size of arrayamust be at leastlda*nAiskxnmatrix. Size of arrayamust be at leastlda*k
- lda
- Leading dimension of matrixA. Must be positive.transa=transpose::nontranstransa=transpose::transortrans=transpose::conjtransColumn majorMust be at leastnMust be at leastkRow majorMust be at leastkMust be at leastn
- b
- Pointer to input matrixB. See Matrix Storage for more details.transb=transpose::nontranstransb=transpose::transortrans=transpose::conjtransColumn majorBiskxnmatrix. Size of arraybmust be at leastldb*nBisnxkmatrix. Size of arraybmust be at leastldb*kRow majorBiskxnmatrix. Size of arraybmust be at leastldb*kBisnxkmatrix. Size of arraybmust be at leastldb*n
- ldb
- Leading dimension of matrixB. Must be positive.transb=transpose::nontranstransb=transpose::transortrans=transpose::conjtransColumn majorMust be at leastkMust be at leastnRow majorMust be at leastnMust be at leastk
- beta
- Scaling factor for matrixC.
- c
- Pointer to input/output matrixC. See Matrix Storage for more details.Column majorCismxnmatrix. Size of arraycmust be at leastldc*nRow majorCismxnmatrix. Size of arraycmust be at leastldc*m
- ldc
- Leading dimension of matrixC. Must be positive.Column majorMust be at leastmRow majorMust be at leastn
- dependencies
- List of events to wait for before starting computation, if any. If omitted, defaults to no dependencies.
Output Parameters
- c
- Pointer to output matrixCoverwritten by upper or lower triangular part ofalpha* op(A)*op(B) +beta*C.
If
beta
= 0, matrix C
does not need to be initialized before calling gemmt
. Return Values
Output event to wait on to ensure computation is complete.