2025-12-09 16:07:23

1. Benchmark Execution Summary

1.1. Session

Hostname: gaya
User: gaya
Time Start: 20251209T160727+0100
Time End: 20251209T170804+0100

1.2. Cases

Total: 58
Failures: 7
Runs: 1

1.3. Parametrization

	Hash	resources.tasks	memory	mesh	discretization	solver	Total Time (s)
🟢	167af2e7	32	50	M1	P1	gamg	1960.511	Description	Logs
🟢	176d3fed	4	200	M1	P3	gamg	918.206	Description	Logs
🟢	17975aa7	64	400	M3	P1	gamg	2446.620	Description	Logs
🟢	23cd5463	128	200	M1	P2	gamg	2389.321	Description	Logs
🟢	244b6f4f	16	50	M2	P1	gamg	1936.263	Description	Logs
🟢	27195a9a	2	50	M2	P1	gamg	576.929	Description	Logs
🔴	2e057b04	1	1200	M2	P3	gamg	1939.242	Description	Logs
🟢	2fee09d5	64	50	M2	P1	gamg	2272.805	Description	Logs
🟢	32beb7e3	1	200	M2	P2	gamg	1108.374	Description	Logs
🟢	345604ad	8	200	M1	P3	gamg	1356.560	Description	Logs
🟢	39768c0a	64	1200	M2	P3	gamg	2937.006	Description	Logs
🟢	3f3de4ff	8	50	M1	P1	gamg	1155.830	Description	Logs
🟢	3fb44e10	4	200	M1	P2	gamg	808.169	Description	Logs
🟢	45824f95	32	200	M2	P2	gamg	2130.902	Description	Logs
🟢	4d73eb1b	128	200	M1	P3	gamg	2425.567	Description	Logs
🟢	4f0f94c0	16	200	M1	P3	gamg	1872.478	Description	Logs
🟢	5349842d	384	400	M3	P1	gamg	3523.946	Description	Logs
🟢	5604fc20	256	400	M3	P1	gamg	2667.410	Description	Logs
🟢	5a008286	32	1200	M2	P3	gamg	2429.467	Description	Logs
🟢	5c01ce2b	8	200	M1	P2	gamg	1192.065	Description	Logs
🟢	62f13b88	128	50	M1	P1	gamg	2357.955	Description	Logs
🟢	6311f485	32	400	M3	P1	gamg	2316.984	Description	Logs
🟢	688bd837	4	50	M2	P1	gamg	1044.089	Description	Logs
🟢	6892f9ed	16	50	M1	P1	gamg	1780.830	Description	Logs
🟢	68d6fab9	64	1200	M3	P2	gamg	3180.852	Description	Logs
🟢	693cb2e9	128	400	M3	P1	gamg	2603.748	Description	Logs
🟢	69498aae	4	200	M2	P2	gamg	1356.684	Description	Logs
🟢	6de50075	8	200	M2	P2	gamg	1760.831	Description	Logs
🟢	6f8fe105	2	50	M1	P1	gamg	151.665	Description	Logs
🟢	7785a035	128	1200	M2	P3	gamg	3228.412	Description	Logs
🟢	7cd50510	1	200	M1	P3	gamg	339.805	Description	Logs
🟢	850da4f0	1	50	M1	P1	gamg	55.409	Description	Logs
🟢	8531d310	1	50	M2	P1	gamg	412.940	Description	Logs
🟢	956d2d83	64	200	M1	P2	gamg	2203.876	Description	Logs
🔴	9656e45f	2	200	M2	P2	gamg	764.184	Description	Logs
🔴	98df175a	4	1200	M2	P3	gamg	1940.857	Description	Logs
🟢	9c0c73a6	128	200	M2	P2	gamg	2493.441	Description	Logs
🟢	a2abc5fb	32	50	M2	P1	gamg	2099.529	Description	Logs
🟢	a78765f5	16	200	M2	P2	gamg	2044.333	Description	Logs
🟢	a9eabded	4	50	M1	P1	gamg	766.372	Description	Logs
🟢	af27eff5	256	1200	M3	P2	gamg	3468.604	Description	Logs
🔴	b02bfe9c	8	1200	M2	P3	gamg	2173.113	Description	Logs
🟢	b4ac4784	1	200	M1	P2	gamg	128.517	Description	Logs
🟢	ba32da44	64	50	M1	P1	gamg	2172.588	Description	Logs
🔴	c19ec7ad	2	1200	M2	P3	gamg	1500.521	Description	Logs
🟢	c8addc44	16	200	M1	P2	gamg	1817.054	Description	Logs
🟢	c91ae1d8	32	1200	M3	P2	gamg	2853.808	Description	Logs
🟢	ccdebbea	128	50	M2	P1	gamg	2487.904	Description	Logs
🟢	d0631d6d	64	200	M1	P3	gamg	2245.684	Description	Logs
🟢	d917cf05	128	1200	M3	P2	gamg	3372.610	Description	Logs
🟢	de8f4f1b	32	200	M1	P2	gamg	1996.744	Description	Logs
🔴	e853ec5e	2	200	M1	P3	gamg	334.734	Description	Logs
🟢	f1d5f83f	384	1200	M3	P2	gamg	3634.077	Description	Logs
🟢	f29397bf	2	200	M1	P2	gamg	224.814	Description	Logs
🟢	fa4f1061	64	200	M2	P2	gamg	2316.828	Description	Logs
🟢	fc15b253	8	50	M2	P1	gamg	1573.397	Description	Logs
🔴	fd0b630b	16	1200	M2	P3	gamg	2296.614	Description	Logs
🟢	ff585e9d	32	200	M1	P3	gamg	2038.535	Description	Logs

Hash

resources.tasks

memory

mesh

discretization

solver

Total Time (s)

🟢

167af2e7

gamg

1960.511

Description

Logs

🟢

176d3fed

200

gamg

918.206

Description

Logs

🟢

17975aa7

400

gamg

2446.620

Description

Logs

🟢

23cd5463

128

200

gamg

2389.321

Description

Logs

🟢

244b6f4f

gamg

1936.263

Description

Logs

🟢

27195a9a

gamg

576.929

Description

Logs

🔴

2e057b04

1200

gamg

1939.242

Description

Logs

🟢

2fee09d5

gamg

2272.805

Description

Logs

🟢

32beb7e3

200

gamg

1108.374

Description

Logs

🟢

345604ad

200

gamg

1356.560

Description

Logs

🟢

39768c0a

1200

gamg

2937.006

Description

Logs

🟢

3f3de4ff

gamg

1155.830

Description

Logs

🟢

3fb44e10

200

gamg

808.169

Description

Logs

🟢

45824f95

200

gamg

2130.902

Description

Logs

🟢

4d73eb1b

128

200

gamg

2425.567

Description

Logs

🟢

4f0f94c0

200

gamg

1872.478

Description

Logs

🟢

5349842d

384

400

gamg

3523.946

Description

Logs

🟢

5604fc20

256

400

gamg

2667.410

Description

Logs

🟢

5a008286

1200

gamg

2429.467

Description

Logs

🟢

5c01ce2b

200

gamg

1192.065

Description

Logs

🟢

62f13b88

128

gamg

2357.955

Description

Logs

🟢

6311f485

400

gamg

2316.984

Description

Logs

🟢

688bd837

gamg

1044.089

Description

Logs

🟢

6892f9ed

gamg

1780.830

Description

Logs

🟢

68d6fab9

1200

gamg

3180.852

Description

Logs

🟢

693cb2e9

128

400

gamg

2603.748

Description

Logs

🟢

69498aae

200

gamg

1356.684

Description

Logs

🟢

6de50075

200

gamg

1760.831

Description

Logs

🟢

6f8fe105

gamg

151.665

Description

Logs

🟢

7785a035

128

1200

gamg

3228.412

Description

Logs

🟢

7cd50510

200

gamg

339.805

Description

Logs

🟢

850da4f0

gamg

55.409

Description

Logs

🟢

8531d310

gamg

412.940

Description

Logs

🟢

956d2d83

200

gamg

2203.876

Description

Logs

🔴

9656e45f

200

gamg

764.184

Description

Logs

🔴

98df175a

1200

gamg

1940.857

Description

Logs

🟢

9c0c73a6

128

200

gamg

2493.441

Description

Logs

🟢

a2abc5fb

gamg

2099.529

Description

Logs

🟢

a78765f5

200

gamg

2044.333

Description

Logs

🟢

a9eabded

gamg

766.372

Description

Logs

🟢

af27eff5

256

1200

gamg

3468.604

Description

Logs

🔴

b02bfe9c

1200

gamg

2173.113

Description

Logs

🟢

b4ac4784

200

gamg

128.517

Description

Logs

🟢

ba32da44

gamg

2172.588

Description

Logs

🔴

c19ec7ad

1200

gamg

1500.521

Description

Logs

🟢

c8addc44

200

gamg

1817.054

Description

Logs

🟢

c91ae1d8

1200

gamg

2853.808

Description

Logs

🟢

ccdebbea

128

gamg

2487.904

Description

Logs

🟢

d0631d6d

200

gamg

2245.684

Description

Logs

🟢

d917cf05

128

1200

gamg

3372.610

Description

Logs

🟢

de8f4f1b

200

gamg

1996.744

Description

Logs

🔴

e853ec5e

200

gamg

334.734

Description

Logs

🟢

f1d5f83f

384

1200

gamg

3634.077

Description

Logs

🟢

f29397bf

200

gamg

224.814

Description

Logs

🟢

fa4f1061

200

gamg

2316.828

Description

Logs

🟢

fc15b253

gamg

1573.397

Description

Logs

🔴

fd0b630b

1200

gamg

2296.614

Description

Logs

🟢

ff585e9d

200

gamg

2038.535

Description

Logs

2. Benchmark: Elliptic linear PDE: Thermal Bridges

2.1. Description

The benchmark known as "thermal bridges" is an example of an application that enables us to validate numerical simulation tools using Feel++. We have developed tests based on the ISO 10211:2017 standard (ISO 10211:2017 - Thermal bridges in building construction — Heat flows and surface temperatures — Detailed calculations, 2017), which provides methodologies for evaluating thermal bridges in building construction.

Thermal bridges are areas within a building envelope where heat flow is different compared to adjacent areas, often resulting in increased heat loss or unwanted condensation. The standard is intended to ensure that thermal bridges simulation are accurately computed. It provides reference values (and tolerance) on heat temperature and heat flux at several location of the geometry.

At the mathematical level, this application requires finding the numerical solution of an elliptic linear PDE (i.e. the heat equation). We employ a finite element method based on continuous Lagrange Finite Element of order 1,2 and 3 (denoted by P1,P2,P3). And we analyzed the execution time of the main components of the simulation.

The Figure 4.3 represents the geometry of this benchmark and the domain decomposition by material.

Figure 1. Figure 4.3: Thermal Bridges benchmarks - geometry and materials

Figure 2. Figure 4.3: Thermal Bridges benchmarks - geometry and materials

2.2. Benchmarking Tools Used

The benchmark was performed on the gaya supercomputer (see Section 10.1). The performance tools integrated into the Feel-toolboxes framework were used to measure the execution time. Moreover, we need to note that we have used here Apptainer with Feel SIF image based on Ubuntu noble OS.

This benchmark was done using feelpp.benchmarking, version 4.0.0

The metrics measured are the execution time of the main components of the simulation. We enumerate these parts in the following:

Init: load mesh from filesystem and initialize heat toolbox (finite element context and algebraic data structure)
Assembly: calculate and assemble the matrix and rhs values obtained using the finite element method
Solve: the linear system by using a preconditioned GMRES.
PostProcess: compute validation measures (temperature at points and heat flux) and export on the filesystem a visualization format (EnsighGold) of the solution.

2.3. Input/Output Dataset Description

2.3.1. Input Data

Meshes: We have generated three levels of mesh called M1, M2 and M3. These meshes are stored in GMSH format. The statistics can be found in Table 4.6. We have also prepared for each mesh level a collection of partitioned mesh. The format used is an in-house mesh format of Feel based on JSON+HDF5 file type. The Gmsh meshes and the partitioned meshes can be found on our Girder database management, in the Feel collections.
Setup: Use standard setup of Feel++ toolboxes. It corresponds to a cfg file and JSON file. These config files are present in the Github of feelpp.
Sif image: feelpp:v0.111.0-preview.10-noble-sif (stored in the Github registry of Feel++)

Tag	# points	# edges	# faces	# elements	P1	P2	P3
M1	1.94E+05	1.30E+06	2.46E+06	1.06E+06	1.94E+05	1.49E+06	4.96E+06
M2	1.40E+06	9.78E+06	1.66E+07	1.66E+07	1.40E+06	1.12E+07	3.75E+07
M3	1.06E+07	7.53E+07	1.29E+08	1.29E+08	1.06E+07	8.59E+07	2.90E+08

Tag

# points

# edges

# faces

# elements

1.94E+05

1.30E+06

2.46E+06

1.06E+06

1.94E+05

1.49E+06

4.96E+06

1.40E+06

9.78E+06

1.66E+07

1.40E+06

1.12E+07

3.75E+07

1.06E+07

7.53E+07

1.29E+08

1.06E+07

8.59E+07

2.90E+08

2.3.2. Output Data

The output includes the computed values of validation measure in CSV files format, export visualization files (mesh, partitioning, temperature), and the time taken to perform each simulation step.

2.4. Results Summary

We start by showing in fig. 4.4 an example of numeric solution and mesh partitioning that we have obtained in the simulation pipeline. The partitioning process is considered an offline process here but requires some time and memory consumption. This should be explicitly described in a future work. With fig. 4.5, we have validated the simulation run by checking measures compared to reference values.

Figure 3. Figure 4.4: Thermal bridges benchmarks - temperature solution

Figure 4. Figure 4.4: Thermal bridges benchmarks - partitioning example

The benchmark performance results are summarized in Figure 4.6, Figure 4.7, Figure 4.8 which correspond respectively to choice of the mesh M1, M2 and M3. Moreover, for each mesh, we have experimented with several finite element discretizations called P1, P2 and P3. For each order of finite element approximation, we have selected a set of number of CPU cores. Concerning the mesh M1, considered a coarse mesh, we note that the scalability scaling is not good, especially for low order. This is simply because the problem is too small for so many HPC resources. MPI communications and IO effects are non-negligible. For the mesh M2 and M3, results are better (but not ideal), and we can rapidly see the limit reached by the scalability test. Finally, the fined mesh M3, illustrates the best scalability on this benchmarking experiment. We see a reduction in computational cost by increasing the computation resources. However, due to the fast execution, the time goes fast to the limit.

With these benchmarking experiences, we have also seen that we have some variability in performance measures. Some aspects such as the filesystem and network load, are not under our control, it can explain a part of this (when computational time belongs small locally).

2.5. Challenges Identified

Several challenges were encountered during the benchmarking process:

Memory Usage: Reduce the memory footprint
Parallelization Inefficiencies: Understand and improve performance when MPI communication and filesystem IO will be dominant

To conclude, we have realized HPC performance tests of benchmark called thermal bridges. We have realized with success the execution of several simulations on significant resources and demonstrated the validation of Feel framework in the elliptic PDE context. We have also validated the deployment of Feel with container support. Now, we need to provide more refined measures to detect and analyze reasons for performance degradation. And also compare to other software installations, like Spack.

2.6. Results

2.6.1. Convergence of validation measures

Heat Flux Convergence

The validation plot for heat flux demonstrates solution convergence across increasing mesh levels (refinement) for different finite element discretizations (P1, P2, P3). Consistency in heat flow values across methods validates the numerical model.

Temperature Validation

2.6.2. P1 Discretization Performance

2.6.3. P2 Discretization Performance

2.6.4. P3 Discretization Performance

2.6.5. Solver Metrics

To understand the parallel scalability, two key solver metrics are presented: the absolute execution time for the algebraic solve step and the number of iterations required by the GMRES solver. Stable or decreasing iteration counts with mesh refinement and strong scaling in solve time are essential indicators of an efficient preconditioner and solver configuration.