Can causal_forest() be used to explore moderation of an observational relationship with unmeasured confounding? #1474
Labels
question
requires research
An issue that needs additional thought and experimentation before it can be implemented.
Hello, thank you for this great package.
I am planning an exploratory observational study on the relationship between W and Y using intensive longitudinal data. Each participant contributes multiple observations over time, with both W and Y measured repeatedly (data are clustered within participants). We also have multiple situational variables (X) measured at each timepoint.
Our research goal is to understand how the W-Y relationship varies across different situations. We do not have any concrete hypotheses for individual moderators, we are at a quite exploratory point in this line of research. Specifically, we want to:
Robustly identify which situational variables (if any) most strongly moderate the W-Y relationship
Visualize how the strength of the W-Y relationship varies across these situational variables
Account for the nested data structure (observations within participants)
The causal_forest() approach is appealing because it:
Focuses on how one key relationship varies across other variables
Provides variable importance measures for moderators
Handles clustering through the clusters parameter
Offers useful visualization and analysis tools
However, my key concern is that we cannot assume unconfoundedness. There are almost certainly unmeasured variables (e.g., personality traits) that affect both W and Y.
I am wondering if causal_forest() can be validly used to study patterns of moderation in our case? We do not need to make causal claims, we just want to explore the heterogeneity of the W-Y relationship across X.
If not, are there alternative approaches you would recommend that could provide similar insights about moderation while accounting for our clustered data structure?
The text was updated successfully, but these errors were encountered: