HPL_pdmxswp - Man Page

swaps and broacast the pivot row.

Synopsis

#include "hpl.h"
 void HPL_pdmxswp( HPL_T_panel * PANEL, const int M, const int II, const int JJ, double * WORK );

Description

HPL_pdmxswp swaps  and  broadcasts  the  absolute value max row using bi-directional exchange.  The buffer is partially set by HPL_dlocmax.
 Bi-directional  exchange  is  used  to  perform  the  swap::broadcast operations  at once  for one column in the panel.  This  results in a lower number of slightly larger  messages than usual.  On P processes and assuming bi-directional links,  the running time of this function can be approximated by

  log_2( P ) * ( lat + ( 2 * N0 + 4 ) / bdwth )
 where  lat and bdwth are the latency and bandwidth of the network for double precision real elements.  Communication  only  occurs  in  one process  column. Mono-directional links  will cause the communication cost to double.

Arguments

PANEL   (local input/output)    HPL_T_panel *

On entry,  PANEL  points to the data structure containing the panel information.

M       (local input)           const int

On entry,  M specifies the local number of rows of the matrix column on which this function operates.

II      (local input)           const int

On entry, II  specifies the row offset where the column to be operated on starts with respect to the panel.

JJ      (local input)           const int

On entry, JJ  specifies the column offset where the column to be operated on starts with respect to the panel.

WORK    (local workspace)       double *

On entry, WORK  is a workarray of size at least 2 * (4+2*N0). It  is assumed that  HPL_dlocmax  was called  prior  to  this routine to  initialize  the first four entries of this array. On exit, the  N0  length max row is stored in WORK[4:4+N0-1]; Note that this is also the  JJth  row  (or column) of L1. The remaining part is used as a temporary array.

See Also

HPL_dlocmax (3), HPL_dlocswpN (3), HPL_dlocswpT (3), HPL_pdpancrN (3), HPL_pdpancrT (3), HPL_pdpanllN (3), HPL_pdpanllT (3), HPL_pdpanrlN (3), HPL_pdpanrlT (3), HPL_pdrpancrN (3), HPL_pdrpancrT (3), HPL_pdrpanllN (3), HPL_pdrpanllT (3), HPL_pdrpanrlN (3), HPL_pdrpanrlT (3), HPL_pdfact (3).

Referenced By

HPL_dlocmax(3), HPL_dlocswpN(3), HPL_dlocswpT(3), HPL_pdfact(3), HPL_pdpancrN(3), HPL_pdpancrT(3), HPL_pdpanllN(3), HPL_pdpanllT(3), HPL_pdpanrlN(3), HPL_pdpanrlT(3), HPL_pdrpancrN(3), HPL_pdrpancrT(3), HPL_pdrpanllN(3), HPL_pdrpanllT(3), HPL_pdrpanrlN(3), HPL_pdrpanrlT(3).

February 24, 2016 HPL 2.2 HPL Library Functions