@@ -43,7 +43,7 @@ memory footprint per MPI task exceeds the amount of main memory
43
43
available to the compiled code. To help the user to make the correct
44
44
decision in terms of resources a simple formula for **Block distribution ** of matrix can be used ::
45
45
46
- sizeof(float_type) *[n * m + (n + m + max(m,n) ) * block + 2 * N *
46
+ sizeof(float_type) *[n * m + (n + m) * block + 2 * N *
47
47
block + 1 + 5*n + 2*pow(n,2) + 2 * nb]/(1024^3) GigaByte
48
48
49
49
where ``n `` and ``m `` are fractions of ``N `` which depend on the size
@@ -61,7 +61,7 @@ For ChASE with **Block-Cyclic distribution** of matrix, addtional memory of
61
61
size ``sizeof(float_type) * N * block `` is required for managing the internal reshuffing
62
62
for block-cyclic data layout. Thus the total memory required is::
63
63
64
- sizeof(float_type) *[n * m + (n + m + max(m,n) ) * block + 3 * N *
64
+ sizeof(float_type) *[n * m + (n + m) * block + 3 * N *
65
65
block + 1 + 5*n + 2*pow(n,2) + 2 * nb]/(1024^3) GigaByte
66
66
67
67
@@ -97,8 +97,8 @@ in which ``${n}`` is the rank of matrix, ``${nev}`` is the number of eigenpairs
97
97
Data Layout: block
98
98
99
99
100
- Main memory usage per MPI-rank: 17.798 GB
101
- Total main memory usage (1824 ranks): 20503.089 GB
100
+ Main memory usage per MPI-rank: 17.546 GB
101
+ Total main memory usage (1824 ranks): 20213.410 GB
102
102
103
103
104
104
Using such a formula one can verify if the allocation of
@@ -136,8 +136,8 @@ the required MPI grid size. Moreover, the flag ``--layout`` should also be expli
136
136
Data Layout: block-cyclic
137
137
138
138
139
- Main memory usage per MPI-rank: 25.844 GB
140
- Total main memory usage (1152 ranks): 29772.803 GB
139
+ Main memory usage per MPI-rank: 25.593 GB
140
+ Total main memory usage (1152 ranks): 29483.125 GB
141
141
142
142
143
143
ChASE with multi-GPUs
0 commit comments