Replace Analysis class with JEDI class #2789

DavidNew-NOAA · 2024-07-24T16:50:27Z

Description

This PR replaces the PyGFS Analysis class with a more general JEDI class. It is a child class of the wxflow Task class, like the Analysis, but it's intended for any Global Workflow job that runs one or more JEDI applications. At this point, it has only been applied to the atmospheric variational and ensemble analysis jobs, so the Analysis class still remains part of the code base, but can be fully replaced after this PR using the atmospheric analysis jobs as a template for the other analysis jobs (snow, aerosol, etc).

In order to make things more generic, the input YAML will now always be named after the JEDI application being run. So, for example, if running gdas.x, the name of the input YAML is gdas.yaml, rather than, say, gdas.t12z.atmvar.yaml. In the case of the atmospheric variational analysis, the finalize() method of the AtmAnalysis class will ensure that gdas.yaml is saved as gdas.t12z.atmvar.yaml in the COMROT directory anyway, so no need to give it such a descriptive name in the run directory.

The AtmAnalysis and AtmEnsAnalysis classes, now subclasses of JEDI, now take care of staging observations and bias corrections in their own initialize() methods, rather than in the initialize() method of JEDI, but the JEDI initalize() method continues to render the input Jinja2-YAMLS/JCB templates and link the JEDI executable file. The JEDI class initalize() method now also saves the final YAML in the run directory, rather than having the initalize() method of the AtmAnalysis and AtmEnsAnalysis classes do that (since all JEDI applications require an input YAML file saved to disk.

A generalized execute() method now exists in JEDI which takes the APRUN command as input, and optionally a list of additional arguments that might be passed to gdas.x (eg. ['fv3jedi', 'variational']).

Type of change

Maintenance (code refactor, clean-up, new CI test, etc.)

Change characteristics

Is this a breaking change (a change in existing functionality)? YES
Does this change require a documentation update? NO

How has this been tested?

Build on Hera
Run cycling experiment

Checklist

Any dependent changes have been merged and published
My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
My changes generate no new warnings
New and existing tests pass with my changes
I have made corresponding changes to the documentation if necessary

… S2SWA (NOAA-EMC#2757)" This reverts commit fc668aa.

…with app S2SWA (NOAA-EMC#2757)"" This reverts commit a3928d2.

This reverts commit 16eaef3.

guillaumevernieres

Thanks for doing this @DavidNew-NOAA . Just a few thoughts/comments.

guillaumevernieres · 2024-07-31T13:06:35Z

ush/python/pygfs/task/jedi.py

+logger = getLogger(__name__.split('.')[-1])
+
+
+class JEDI(Task):


This looks like a class hard-wired for the analysis step, which I think is fine but the name should probably reflect this. JEDIAnalysis? or just keep Analysis?
I think @CoryMartin-NOAA (or maybe that was you @DavidNew-NOAA ?) started a b-matrix base class as well. We probably want to keep these separate.

The intention is for this class to be used anytime a JEDI application is run in general, not just for running an analysis

guillaumevernieres · 2024-07-31T13:13:39Z

ush/python/pygfs/task/jedi.py

+        self.link_jedi_exe()
+
+    @logit(logger)
+    def execute(self, aprun_cmd: str, jedi_args: Optional[List] = None) -> None:


Running only one executable wouldn't work for the "B-matrix" or "prep obs" jobs for example, thus the suggestion above to rename to something descriptive of the analysis job.

guillaumevernieres · 2024-07-31T13:14:33Z

ush/python/pygfs/task/jedi.py

+        save_as_yaml(self.task_config.jedi_config, self.task_config.jedi_yaml)
+        logger.info(f"Wrote YAML to: {self.task_config.jedi_yaml}")
+
+    def link_jedi_exe(self) -> None:


we probably should have this in a common utility module

guillaumevernieres · 2024-07-31T13:15:23Z

ush/python/pygfs/task/jedi.py

+
+
+@logit(logger)
+def find_value_in_nested_dict(nested_dict: Dict, target_key: str) -> Any:


same as above, probably belongs to wxflow or some other common utility module

DavidNew-NOAA · 2024-07-31T13:24:03Z

I'm closing the pull request until I make some modifications. Rahul and I discussed it and agree, based on previous conversations with Dan and Cory, that the JEDI class should not be inherited from the Task class, but rather any child class of Task (eg AtmAnalysis, AtmEnsAnalysis, etc) should have the JEDI class as a member.

DavidNew-NOAA added 17 commits July 18, 2024 13:27

Update GDAS hash

16eaef3

Revert "Address issues in creating XML for GFS forecast-only with app…

a3928d2

… S2SWA (NOAA-EMC#2757)" This reverts commit fc668aa.

Revert "Revert "Address issues in creating XML for GFS forecast-only …

b6c4316

…with app S2SWA (NOAA-EMC#2757)"" This reverts commit a3928d2.

Revert "Update GDAS hash"

0fdd889

This reverts commit 16eaef3.

Initial commit

1146df1

Now do the ensemble

24c75d9

Forgotten changes

1af2d1f

Minor update

945becc

Fixing some bugs

3bb4f64

Forgot to add JEDI class code

3887c65

Fixed bug

5b85c73

Final fixes

754dd8c

Remove FV3 increment child classes

a1ebc3f

Merge branch 'develop' into feature/jedi_class

88a1949

Coding norms

87b9c47

Update comments

638344a

Merge branch 'develop' into feature/jedi_class

4c3b5f7

guillaumevernieres reviewed Jul 31, 2024

View reviewed changes

DavidNew-NOAA closed this Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace Analysis class with JEDI class #2789

Replace Analysis class with JEDI class #2789

DavidNew-NOAA commented Jul 24, 2024 •

edited

Loading

guillaumevernieres left a comment

guillaumevernieres Jul 31, 2024

DavidNew-NOAA Jul 31, 2024

guillaumevernieres Jul 31, 2024

guillaumevernieres Jul 31, 2024

DavidNew-NOAA Jul 31, 2024

guillaumevernieres Jul 31, 2024

DavidNew-NOAA commented Jul 31, 2024

		logger = getLogger(__name__.split('.')[-1])


		class JEDI(Task):



		@logit(logger)
		def find_value_in_nested_dict(nested_dict: Dict, target_key: str) -> Any:

Replace Analysis class with JEDI class #2789

Replace Analysis class with JEDI class #2789

Conversation

DavidNew-NOAA commented Jul 24, 2024 • edited Loading

Description

Type of change

Change characteristics

How has this been tested?

Checklist

guillaumevernieres left a comment

Choose a reason for hiding this comment

guillaumevernieres Jul 31, 2024

Choose a reason for hiding this comment

DavidNew-NOAA Jul 31, 2024

Choose a reason for hiding this comment

guillaumevernieres Jul 31, 2024

Choose a reason for hiding this comment

guillaumevernieres Jul 31, 2024

Choose a reason for hiding this comment

DavidNew-NOAA Jul 31, 2024

Choose a reason for hiding this comment

guillaumevernieres Jul 31, 2024

Choose a reason for hiding this comment

DavidNew-NOAA commented Jul 31, 2024

DavidNew-NOAA commented Jul 24, 2024 •

edited

Loading