-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Refactor Graph de/serialization #2612
base: develop
Are you sure you want to change the base?
Conversation
a5a016e
to
b3117de
Compare
Add a new test suite for graph template loading.
Rewrite `nodeFactory` to reduce cognitive complexity, while preserving the current behavior.
* API Instead of having a single `load` function that exposes in its API some elements only applicable to initializing a graph from a templates, split it into 2 distinct functions: `load` and `initFromTemplate`. Apply those changes to users of the API (UI, CLI), and simplify Graph wrapper classes to better align with those concepts. * Deserialization Reduce the cognitive complexity of the deserizalization process by splitting it into more atomic functions, while maintaining the current behavior.
Extract the logic of importing the content of a graph within a graph instance from the graph loading logic. Add `Graph.importGraphContent` and `Graph.importGraphContentFromFile` methods. Use the deserialization API to load the content in another temporary graph instance, to handle the renaming of nodes using the Graph API, rather than manipulating entries in a raw dictionnary.
…es for unknown File attributes When creating a compatibility description for an unknown attribute, do not consider a link expression as the default value for a File attribute. This is breaking how the link expression solving system works, as it's resetting the attribute to its default value after applying the link. If that expression is kept as the default value, it can be re-evaluated several times incorrectly. Added a test case that was failing before that change to illustrate the issue.
Move Graph.IO internal class to its own module, and rename it to `GraphIO`. This avoid nested classes within the core Graph class, and starts decoupling the management of graph's IO from the logic of the graph itself.
Move the serialization logic to dedicated serializer classes. Implement both `GraphSerializer` and `TemplateGraphSerializer` to cover for the existing serialization use-cases.
Add a new serializer class to manage partial graph serialization logic, ensuring to remove link expressions on attributes refering to nodes that are not in the subset of nodes to serialize.
Only perform uid check when we have both a serialized and a computed UID. If the node has not been serialized with a UID, it means that it does not expect to match a specific value on deserialization.
Re-implement node pasting by relying on the graph partial serializer, to serialize only the subset of selected nodes. On pasting, use standard graph deserialization and import the content of the serialized graph in the active graph instance. Simplify the positioning of pasted nodes to only consider mouse position or center of the graph, which works well for the major variety of use-cases. Compute the offset to apply to imported nodes by using the de-serialized graph content's bounding box.
Add a new method to create a copy of a graph instance, relying on chaining serialization and deserialization operations. Add test suite to validate its behavior, and the underlying serialization processes.
Factorize the logic of replacing a node with another one and re-creating output edges into `Graph.replaceNode` and `Graph._restoreOutEdges`.
At the end of the deserialization process, solve node uid conflicts iteratively by node depths, and only replace the conflicting nodes with a CompatibilityNode. Add new test suite for testing uid conflict handling.
… UidConflict Only set the expectedUid when undoing the upgrade of a uid conflicting node. Otherwise, let the other type of conflicts take precedence.
6477e00
to
d33d0c5
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #2612 +/- ##
===========================================
+ Coverage 69.85% 73.32% +3.46%
===========================================
Files 120 125 +5
Lines 7067 7449 +382
===========================================
+ Hits 4937 5462 +525
+ Misses 2130 1987 -143 ☔ View full report in Codecov by Sentry. |
d33d0c5
to
dbafe84
Compare
name (str): (optional) the node's name | ||
template (bool): (optional) true if the node is part of a template, false otherwise | ||
uidConflict (bool): (optional) true if a UID conflict has been detected externally on that node | ||
nodeDict: The serialized Node data. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nodeDict has been renamed into nodeData
name: (optional) The node's name. | ||
inTemplate: (optional) True if the node is created as part of a graph template. | ||
expectedUid: (optional) The expected UID of the node within the context of a Graph. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not introduced by the PR, but I think we should remove the "(optional)" from the docstrings as it's already automatically explicit in the generated doc.
meshroom/ui/commands.py
Outdated
@@ -432,11 +432,12 @@ def redoImpl(self): | |||
|
|||
def undoImpl(self): | |||
# delete upgraded node | |||
expectedUid = self.graph.node(self.nodeName)._uid | |||
self.graph.removeNode(self.nodeName) | |||
# recreate compatibility node | |||
with GraphModification(self.graph): | |||
# We come back from an upgrade, so we enforce uidConflict=True as there was a uid conflict before |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fix comment:
# We come back from an upgrade, so we're doing the UID check as there could have been an identifier conflict before the upgrade.
meshroom/core/graph.py
Outdated
self._deserialize(Graph._loadGraphData(filepath)) | ||
|
||
if not isinstance(graphData, dict): | ||
raise RuntimeError('loadGraph error: Graph is not a dict. File: {}'.format(filepath)) | ||
if not publishOutputs: | ||
for node in [node for node in self.nodes if node.nodeType == "Publish"]: | ||
self.removeNode(node.name) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't we use with GraphModification(self):
here?
|
||
for attrName in inputKeys: | ||
attribute = node.attribute(attrName) | ||
# check that attribute is not a link for choice attributes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't understand this comment. Same for internalAttributes. Why are we talking about Choice attributes here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point - I've just moved that template serialization code within this class as-is, but it's true that I did not rework the internals (comments included).
I'll try to understand what it means, improve the comment and write a test around that.
@@ -82,25 +82,18 @@ Item { | |||
|
|||
/// Paste content of clipboard to graph editor and create new node if valid | |||
function pasteNodes() { | |||
var finalPosition = undefined | |||
var centerPosition = false | |||
let finalPosition = undefined; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In many other places, we don't use ";".
Args: | ||
outEdges: a dictionary containing the outgoing edges removed by a call to "removeNode". | ||
{dstAttr.getFullNameToNode(), srcAttr.getFullNameToNode()} | ||
outListAttributes: a dictionary containing the values, indices and keys of attributes that were connected |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not introduced by this PR, but it may be good to create a structured type instead of this dictionnary.
@@ -643,7 +643,7 @@ def _createUniqueNodeName(self, inputName: str, existingNames: Optional[set[str] | |||
def node(self, nodeName): | |||
return self._nodes.get(nodeName) | |||
|
|||
def upgradeNode(self, nodeName): | |||
def upgradeNode(self, nodeName) -> Node: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's much better!
Description
Refactoring graph de/serialization for code-base simplification, fixing issues and better testability.
This PR introduces a major refactoring of the graph serialization/deserialization logic (aka graph IO).
With those changes, we address some of the known issues in this area, most notably when copy/pasting nodes.
Overall, the main concept is that any operation that needs to operate on serialized Graph data uses deserialization first, to work with Graph API, instead of manipulating a raw dictionary of serialized data.
It also creates a clear distinction in the API between the loading of a Meshroom file, the initialization of a Graph from a template and importing the content of another Graph.
Features
Implementation details
The goal of this PR is to move away from a monolothic Graph class that mixes a lot of responsibilities, to make the code more manageable and testable.
NodeFactory
The nodeFactory function has been refactored to rely on an underlying class that manages the compatibility checks. While preserving the existing behavior, this class helps understanding the different steps that occurs during that that process.
Graph
Deserialization
The current implementation exposes a unique entry point, as a single
load
function with several booleans to manage the different cases, to deal with:This makes it complex to manage the possible combinations of options, as all of them are not necessarily meaningful.
In the new approach, those are decoupled into as many entry points as required to handle the distinct logic those operations have:
UID conflict check
The current implementation creates all nodes twice to deal with UID compatibility issues, which increases load times and may keep false-positives depending on the order in which nodes are handled.
In the new approach, we only consider nodes that actually have a UID conflict, and solve them by depth ordering.
Indeed, UID conflicts are contagious: an upstream node can impact the downstream ones, as UID flow through connections.
By replacing upstream nodes first with CompatibilityNodes, we restore their serialized UID, which may solve the "false-positive" conflicts downstream, that were only due to the contagion by this node.
Importing another graph content
This area benefits the most from the refactoring, which solves errors we may currently encounter in the copy-pasting of nodes.
Instead of manipulating raw dictionaries to solve conflicting node names w.r.t the current content of the Graph, it now relies on:
Serialization
Introducing new GraphSerializer classes, following a Template Method pattern.
This makes it easier to define several node serialization logic, while inheriting from the standard serialization structure.