Memory leak issue running GCHP14.3.1 on AWS #466
Labels
category: Question
Further information is requested
topic: Performance
Related to GCHP model speed and/or memory
Your name
Yanshun Li
Your affiliation
WashU
Please provide a clear and concise description of your question or discussion topic.
Hi Team,
I'm running GCHP 14.3.1 C180 simulation on AWS and got a memory leak problem. The memory usage fill up to abortion within 24 hours' model run:
AGCM Date: 2020/12/31 Time: 23:10:00 Throughput(days/day)[Avg Tot Run]: 0.7 0.7 23.7 TimeRemaining(Est) 135:06:43 73.9% : 64.3% Mem Comm:Used
Mem/Swap Used (MB) at MAPL_Cap:TimeLoop= 1.248E+05 0.000E+00
...
AGCM Date: 2021/01/01 Time: 23:00:00 Throughput(days/day)[Avg Tot Run]: 40.6 83.2 137.2 TimeRemaining(Est) 001:46:22 108.3% : 98.1% Mem Comm:Used
Mem/Swap Used (MB) at MAPL_Cap:TimeLoop= 1.848E+05 0.000E+00
I was using hourly anthropogenic emission inventories and take the option to output check points. When using only monthly emission inventories and opt out for writing check points, the simulation can last for one month.
My intuition is that the memory leak is related to netcdf reading and writing, but I got no such issue when running on NASA pleiades or WashU compute1. I attached the log of building gchp executable so that more info about the linux environment can be provided.
Appreciate if there could be any suggestions and solutions.
ecbuild.log
Yanshun
The text was updated successfully, but these errors were encountered: