Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about Algorithm 1 in acic18 #1313

Open
RamirezAmayaS opened this issue Jun 21, 2023 · 1 comment
Open

Question about Algorithm 1 in acic18 #1313

RamirezAmayaS opened this issue Jun 21, 2023 · 1 comment
Labels

Comments

@RamirezAmayaS
Copy link

RamirezAmayaS commented Jun 21, 2023

I am revisiting the analysis of Athey and Wager (2019) (experiments/acic18).

In Algorithm 1, why are the pilot forest and the final causal forest trained over the full data? Isn't it relevant for the generalizability of the final predictions that the final causal forest is trained on data that was not used for feature selection in the pilot forest?

@erikcs
Copy link
Member

erikcs commented Jun 27, 2023

The way I read that algorithm is that they use CF purely as an exploratory tool to see if there is treatment effect heterogeneity along “some” observable covariates. If there are many covariates, then doing what they do is just a heuristic to narrow down that set of “some”. What would be problematic would be to do something like that with W.hat and Y.hat if you are not in an RCT: recall CFs can be thought of as a two-step estimator: adjust for confounding by estimating W.hat and Y.hat for orthogonalization, then with these residuals, try to detect HTE along some observable covariates of your choice.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants