-
Notifications
You must be signed in to change notification settings - Fork 254
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Questions about causal forests #1294
Comments
Hi @marclet, yes, a continuous W implies regression_forest estimates a generalized propensity score. If you know what the generalized propensity score is, then sure, you can supply it, though I don't know what it would be in your setting (in a RCT, setting W.hat=p make sense if all units have P[W=1]=p and you know p). Using different X's for Y.hat, W.hat, and causal forest can also be perfectly reasonable, intuitively the first X's are those you think matter for doing a regression adjustment while the X's causal forest use on the residuals are the ones you might think matter for heterogeneity (the beginning of this overview have a brief walkthrough of how you can think of orthogonalization as non-parametric regression adjustment, it's the same partially linear model that is estimated when W is continuous) |
Dear Erik, that's perfectly clear. Thank you very much!! |
Dear Erik/grf Team, my apologies if I bother you again for some remaining doubts regarding the above application:
Thank you very much in advance for your help guys, that's the last time I bother you. |
Hi guys,
and thank you for your amazing work.
I am writing to ask a couple of key questions on the use of causal forests in a setting with panel data, binary outcome, and a continuous, plausibly exogenous treatment. Here they are:
I understand orthogonalization is key. But in my case, the treatment is a natural disaster which is plausibly exogenous and unrelated to unit-specific characteristics. Should I orthogonalize only the outcome in this case, or just put W.hat = 0.5 as it is done with RCT data? If I have to apply orthogonalization despite the plausibile treatment exogeneity, given that the treatment is continuous, would the regression forest provide me with a generalized propensity score à la Hirano-Imbens (2004)?
As mentioned above, I have panel data. However, I am not interested in getting CATEs varying depending on unit and time fixed effects. I only want to 'filter' my outcome and treatment variables from these fixed effects in the orthogonalization step and then work on the residuals. Is there any theoretical reason which should prevent me from using different sets of Xs in the two stages (namely, a larger set including unit and time FEs in orthogonalization, and a smaller one - including only covariates for which I suspect HTE) in the causal forest analysis?
Thank you very much in advance.
The text was updated successfully, but these errors were encountered: