PaStiX Handbook
6.4.0
|
Data Structures | |
struct | symbol_function_s |
Cost functions to compute statistics on the symbolic structure. More... | |
Typedefs | |
typedef struct symbol_function_s | symbol_function_t |
Cost functions to compute statistics on the symbolic structure. | |
Functions | |
static double | sum1d (const symbol_function_t *fptr, const symbol_matrix_t *symbmtx, pastix_int_t cblknum) |
Template function to compute cost on a column-block based approach with a single update per column block. More... | |
static double | sum2d (const symbol_function_t *fptr, const symbol_matrix_t *symbmtx, pastix_int_t cblknum) |
Template function to compute cost on block based approach. More... | |
static double | sum2dext (const symbol_function_t *fptr, const symbol_matrix_t *symbmtx, pastix_int_t cblknum, double *blokcost) |
Template function to compute cost on block based approach which keeps the cost per block. More... | |
static double | recursive_sum (pastix_int_t a, pastix_int_t b, double(*fval)(const symbol_function_t *, const symbol_matrix_t *, pastix_int_t), const symbol_function_t *fptr, const symbol_matrix_t *symbmtx) |
Recursive function to compute the cost of the full symbolic structure with either sum1d(), sum2d(), or sum2dext(). More... | |
Variables | |
symbol_function_t | flopstable [2][5] |
array of pointer to the flops functions per factorization and arithmetic | |
symbol_function_t | perfstable [2][5] |
array of pointer to the performance functions per factorization and arithmetic | |
symbol_function_t | flopstable [2][5] |
array of pointer to the flops functions per factorization and arithmetic More... | |
symbol_function_t | perfstable [2][5] |
array of pointer to the performance functions per factorization and arithmetic More... | |
Flops functions set | |
static double | flops_zpotrf_diag (pastix_int_t N) |
Computations flops of diagonal blocks. More... | |
static double | flops_dpotrf_diag (pastix_int_t N) |
Computations flops of diagonal blocks. More... | |
static double | flops_zgetrf_diag (pastix_int_t N) |
Computations flops of diagonal blocks. More... | |
static double | flops_dgetrf_diag (pastix_int_t N) |
Computations flops of diagonal blocks. More... | |
static double | flops_zsytrf_diag (pastix_int_t N) |
Computations flops of diagonal blocks. More... | |
static double | flops_dsytrf_diag (pastix_int_t N) |
Computations flops of diagonal blocks. More... | |
static double | flops_zpotrf_trsm (pastix_int_t M, pastix_int_t N) |
Computations flops of the solve step. More... | |
static double | flops_dpotrf_trsm (pastix_int_t M, pastix_int_t N) |
Computations flops of the solve step. More... | |
static double | flops_zgetrf_trsm (pastix_int_t M, pastix_int_t N) |
Computations flops of the solve step. More... | |
static double | flops_dgetrf_trsm (pastix_int_t M, pastix_int_t N) |
Computations flops of the solve step. More... | |
static double | flops_zsytrf_trsm (pastix_int_t M, pastix_int_t N) |
Computations flops of the solve step. More... | |
static double | flops_dsytrf_trsm (pastix_int_t M, pastix_int_t N) |
Computations flops of the solve step. More... | |
static double | flops_zpotrf_update (pastix_int_t K, pastix_int_t M) |
Theroretical computation flops of the update step per coumn block (see sum1d()) More... | |
static double | flops_dpotrf_update (pastix_int_t K, pastix_int_t M) |
Theroretical computation flops of the update step per coumn block (see sum1d()) More... | |
static double | flops_zgetrf_update (pastix_int_t K, pastix_int_t M) |
Theroretical computation flops of the update step per coumn block (see sum1d()) More... | |
static double | flops_dgetrf_update (pastix_int_t K, pastix_int_t M) |
Theroretical computation flops of the update step per coumn block (see sum1d()) More... | |
static double | flops_zsytrf_update (pastix_int_t K, pastix_int_t M) |
Theroretical computation flops of the update step per coumn block (see sum1d()) More... | |
static double | flops_dsytrf_update (pastix_int_t K, pastix_int_t M) |
Theroretical computation flops of the update step per coumn block (see sum1d()) More... | |
static double | flops_zpotrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | flops_dpotrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | flops_zgetrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | flops_dgetrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | flops_zsytrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | flops_dsytrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
Performance functions set | |
static double | perfs_zpotrf_diag (pastix_int_t N) |
Time model of the computation of the diagonal block. More... | |
static double | perfs_dpotrf_diag (pastix_int_t N) |
Time model of the computation of the diagonal block. More... | |
static double | perfs_zgetrf_diag (pastix_int_t N) |
Time model of the computation of the diagonal block. More... | |
static double | perfs_dgetrf_diag (pastix_int_t N) |
Time model of the computation of the diagonal block. More... | |
static double | perfs_zsytrf_diag (pastix_int_t N) |
Time model of the computation of the diagonal block. More... | |
static double | perfs_dsytrf_diag (pastix_int_t N) |
Time model of the computation of the diagonal block. More... | |
static double | perfs_zpotrf_trsm (pastix_int_t M, pastix_int_t N) |
Time performance model of the solve step. More... | |
static double | perfs_dpotrf_trsm (pastix_int_t M, pastix_int_t N) |
Time performance model of the solve step. More... | |
static double | perfs_zgetrf_trsm (pastix_int_t M, pastix_int_t N) |
Time performance model of the solve step. More... | |
static double | perfs_dgetrf_trsm (pastix_int_t M, pastix_int_t N) |
Time performance model of the solve step. More... | |
static double | perfs_zsytrf_trsm (pastix_int_t M, pastix_int_t N) |
Time performance model of the solve step. More... | |
static double | perfs_dsytrf_trsm (pastix_int_t M, pastix_int_t N) |
Time performance model of the solve step. More... | |
static double | perfs_zpotrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Time performance model of the update step per block (see sum2d()) More... | |
static double | perfs_dpotrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | perfs_zgetrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | perfs_dgetrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | perfs_zsytrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
static double | perfs_dsytrf_blkupdate (pastix_int_t M, pastix_int_t N, pastix_int_t K) |
Computes the theoretical number of flops of the update step per block (see sum2d()) More... | |
struct symbol_function_s |
Cost functions to compute statistics on the symbolic structure.
Definition at line 27 of file symbol_cost.h.
Data Fields | |
double(* | diag )(pastix_int_t) |
double(* | trsm )(pastix_int_t, pastix_int_t) |
double(* | update )(pastix_int_t, pastix_int_t) |
double(* | blkupdate )(pastix_int_t, pastix_int_t, pastix_int_t) |
double(* symbol_function_s::diag) (pastix_int_t) |
Return a statistic based on the diagonal block
Definition at line 28 of file symbol_cost.h.
Referenced by sum1d(), sum2d(), and sum2dext().
double(* symbol_function_s::trsm) (pastix_int_t, pastix_int_t) |
Return a statistic based on the sum of all off-diagonal of each column-block
Definition at line 29 of file symbol_cost.h.
Referenced by sum1d(), sum2d(), and sum2dext().
double(* symbol_function_s::update) (pastix_int_t, pastix_int_t) |
Return a statistic for a large accumulated update per column-block
Definition at line 31 of file symbol_cost.h.
Referenced by sum1d().
double(* symbol_function_s::blkupdate) (pastix_int_t, pastix_int_t, pastix_int_t) |
Return a statistic for each individual off-diagonal block
Definition at line 33 of file symbol_cost.h.
Referenced by sum2d(), and sum2dext().
|
static |
Template function to compute cost on a column-block based approach with a single update per column block.
[in] | fptr | The structure that contains the cost functions (diag, trsm and update are used) |
[in] | symbmtx | The symbolic matrix structure on which to compute the costs. |
[in] | cblknum | The index of the column-block for which the cost will be computed |
Definition at line 53 of file symbol_cost.c.
References symbol_cblk_s::bloknum, symbol_matrix_s::bloktab, symbol_matrix_s::cblktab, symbol_function_s::diag, symbol_matrix_s::dof, symbol_cblk_s::fcolnum, symbol_blok_s::frownum, symbol_cblk_s::lcolnum, symbol_blok_s::lrownum, pastix_int_t, symbol_function_s::trsm, and symbol_function_s::update.
Referenced by pastixSymbolGetFlops().
|
static |
Template function to compute cost on block based approach.
As opposed to sum1d(), the updates are split in one per off-diagonal block making it more precise to evaluate the performance cost of the GEMMs, for example, as it exactly follow the 1D scheme used in the static scheduler of PaStiX.
[in] | fptr | The structure that contains the cost functions (diag, trsm and blkupdate are used) |
[in] | symbmtx | The symbolic matrix structure on which to compute the costs. |
[in] | cblknum | The index of the column-block for which the cost will be computed |
Definition at line 121 of file symbol_cost.c.
References symbol_function_s::blkupdate, symbol_cblk_s::bloknum, symbol_matrix_s::bloktab, symbol_matrix_s::cblktab, symbol_function_s::diag, symbol_matrix_s::dof, symbol_cblk_s::fcolnum, symbol_blok_s::frownum, symbol_cblk_s::lcolnum, symbol_blok_s::lrownum, pastix_int_t, and symbol_function_s::trsm.
Referenced by pastixSymbolGetFlops().
|
static |
Template function to compute cost on block based approach which keeps the cost per block.
As opposed to sum2d(), the cost of each update per block is stored in the blokcost array. Despite this storage, the function is completely identical.
[in] | fptr | The structure that contains the cost functions (diag, trsm and blkupdate are used) |
[in] | symbmtx | The symbolic matrix structure on which to compute the costs. |
[in] | cblknum | The index of the column-block for which the cost will be computed |
[in,out] | blokcost | An array of size the number of blocks in the cblknum column-block in which to store the cost per block of each updates. |
Definition at line 202 of file symbol_cost.c.
References symbol_function_s::blkupdate, symbol_cblk_s::bloknum, symbol_matrix_s::bloktab, symbol_matrix_s::cblktab, symbol_function_s::diag, symbol_matrix_s::dof, symbol_cblk_s::fcolnum, symbol_blok_s::frownum, symbol_cblk_s::lcolnum, symbol_blok_s::lrownum, pastix_int_t, and symbol_function_s::trsm.
Referenced by pastixSymbolGetTimes().
|
static |
Recursive function to compute the cost of the full symbolic structure with either sum1d(), sum2d(), or sum2dext().
[in] | a | The first column-block index of the range to address |
[in] | b | The last column-block index of the range to address (inclusive) |
[in] | fval | The function to use to compute the cost. This can be sum1d(), sum2d(), or sum2dext() |
[in] | fptr | The set of functions that will be applied. It can be size function, floating point operation, performance models, ... |
[in] | symbmtx | The symbol matrix on which to compute the wanted information. |
Definition at line 287 of file symbol_cost.c.
Referenced by pastixSymbolGetFlops().
|
inlinestatic |
Computations flops of diagonal blocks.
[in] | N | Size of the matrix block |
Cholesky complex case
Definition at line 40 of file symbol_cost_flops.c.
|
inlinestatic |
Computations flops of diagonal blocks.
[in] | N | Size of the matrix block |
Cholesky complex case
Cholesky real case
Definition at line 49 of file symbol_cost_flops.c.
|
static |
Computations flops of diagonal blocks.
[in] | N | Size of the matrix block |
Cholesky complex case
LU complex case
Definition at line 58 of file symbol_cost_flops.c.
|
inlinestatic |
Computations flops of diagonal blocks.
[in] | N | Size of the matrix block |
Cholesky complex case
LU real case
Definition at line 67 of file symbol_cost_flops.c.
|
inlinestatic |
Computations flops of diagonal blocks.
[in] | N | Size of the matrix block |
Cholesky complex case
LDL^t complex case
Definition at line 76 of file symbol_cost_flops.c.
|
inlinestatic |
Computations flops of diagonal blocks.
[in] | N | Size of the matrix block |
Cholesky complex case
LDL^t real case
Definition at line 85 of file symbol_cost_flops.c.
|
inlinestatic |
Computations flops of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
Definition at line 97 of file symbol_cost_flops.c.
References PastixRight.
|
inlinestatic |
Computations flops of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
Cholesky real case
Definition at line 106 of file symbol_cost_flops.c.
References PastixRight.
|
static |
Computations flops of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LU complex case
Definition at line 115 of file symbol_cost_flops.c.
References PastixRight.
|
inlinestatic |
Computations flops of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LU real case
Definition at line 124 of file symbol_cost_flops.c.
References PastixRight.
|
inlinestatic |
Computations flops of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LDL^t complex case
Definition at line 133 of file symbol_cost_flops.c.
References PastixRight.
|
inlinestatic |
Computations flops of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LDL^t real case
Definition at line 142 of file symbol_cost_flops.c.
References PastixRight.
|
inlinestatic |
Theroretical computation flops of the update step per coumn block (see sum1d())
[in] | K | Number of columns of A, and rows of B in the GEMM operation |
[in] | M | Dimension of all other sizes in the GEMM |
Cholesky complex case
Definition at line 154 of file symbol_cost_flops.c.
|
inlinestatic |
Theroretical computation flops of the update step per coumn block (see sum1d())
[in] | K | Number of columns of A, and rows of B in the GEMM operation |
[in] | M | Dimension of all other sizes in the GEMM |
Cholesky complex case
Cholesky real case
Definition at line 163 of file symbol_cost_flops.c.
|
static |
Theroretical computation flops of the update step per coumn block (see sum1d())
[in] | K | Number of columns of A, and rows of B in the GEMM operation |
[in] | M | Dimension of all other sizes in the GEMM |
Cholesky complex case
LU complex case
Definition at line 172 of file symbol_cost_flops.c.
|
inlinestatic |
Theroretical computation flops of the update step per coumn block (see sum1d())
[in] | K | Number of columns of A, and rows of B in the GEMM operation |
[in] | M | Dimension of all other sizes in the GEMM |
Cholesky complex case
LU real case
Definition at line 181 of file symbol_cost_flops.c.
|
inlinestatic |
Theroretical computation flops of the update step per coumn block (see sum1d())
[in] | K | Number of columns of A, and rows of B in the GEMM operation |
[in] | M | Dimension of all other sizes in the GEMM |
Cholesky complex case
LDL^t complex case
Definition at line 190 of file symbol_cost_flops.c.
|
inlinestatic |
Theroretical computation flops of the update step per coumn block (see sum1d())
[in] | K | Number of columns of A, and rows of B in the GEMM operation |
[in] | M | Dimension of all other sizes in the GEMM |
Cholesky complex case
LDL^t real case
Definition at line 199 of file symbol_cost_flops.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
Definition at line 212 of file symbol_cost_flops.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
Cholesky real case
Definition at line 222 of file symbol_cost_flops.c.
|
static |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LU complex case
Definition at line 232 of file symbol_cost_flops.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LU real case
Definition at line 243 of file symbol_cost_flops.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LDL^t complex case
Definition at line 254 of file symbol_cost_flops.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LDL^t real case
Definition at line 273 of file symbol_cost_flops.c.
|
inlinestatic |
Time model of the computation of the diagonal block.
[in] | N | Size of the matrix block |
Cholesky complex case
Definition at line 40 of file symbol_cost_perfs.c.
Referenced by perfs_zgetrf_diag().
|
inlinestatic |
Time model of the computation of the diagonal block.
[in] | N | Size of the matrix block |
Cholesky complex case
Cholesky real case
Definition at line 51 of file symbol_cost_perfs.c.
Referenced by perfs_dgetrf_diag().
|
inlinestatic |
Time model of the computation of the diagonal block.
[in] | N | Size of the matrix block |
Cholesky complex case
LU complex case
Definition at line 62 of file symbol_cost_perfs.c.
References perfs_zpotrf_diag().
|
inlinestatic |
Time model of the computation of the diagonal block.
[in] | N | Size of the matrix block |
Cholesky complex case
LU real case
Definition at line 72 of file symbol_cost_perfs.c.
References perfs_dpotrf_diag().
|
inlinestatic |
Time model of the computation of the diagonal block.
[in] | N | Size of the matrix block |
Cholesky complex case
LDL^t complex case
Definition at line 82 of file symbol_cost_perfs.c.
|
inlinestatic |
Time model of the computation of the diagonal block.
[in] | N | Size of the matrix block |
Cholesky complex case
LDL^t real case
Definition at line 93 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
Definition at line 107 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
Cholesky real case
Definition at line 118 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LU complex case
Definition at line 129 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LU real case
Definition at line 140 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LDL^t complex case
Definition at line 151 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the solve step.
[in] | M | Number of rows of the B matrix in the TRSM, and size of the matrix A |
[in] | N | Number of columns of the B matrix in the TRSM |
Cholesky complex case
LDL^t real case
Definition at line 163 of file symbol_cost_perfs.c.
|
inlinestatic |
Time performance model of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
Definition at line 179 of file symbol_cost_perfs.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
Cholesky real case
Definition at line 191 of file symbol_cost_perfs.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LU complex case
Definition at line 203 of file symbol_cost_perfs.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LU real case
Definition at line 216 of file symbol_cost_perfs.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LDL^t complex case
Definition at line 229 of file symbol_cost_perfs.c.
|
inlinestatic |
Computes the theoretical number of flops of the update step per block (see sum2d())
[in] | M | Number of rows of the A and C matrices in the GEMM |
[in] | N | Number of columns of the B and C matrices in the GEMM |
[in] | K | Number of columns of the A matrix, and rows of the B matrix in the GEMM |
Cholesky complex case
LDL^t real case
Definition at line 241 of file symbol_cost_perfs.c.
symbol_function_t flopstable[2][5] |
array of pointer to the flops functions per factorization and arithmetic
Definition at line 291 of file symbol_cost_flops.c.
Referenced by pastixSymbolGetFlops().
symbol_function_t perfstable[2][5] |
array of pointer to the performance functions per factorization and arithmetic
Definition at line 252 of file symbol_cost_perfs.c.
Referenced by pastixSymbolGetTimes().