Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Follow-up documentatin on TRSO #230

Open
cthoyt opened this issue Aug 19, 2024 · 2 comments
Open

Follow-up documentatin on TRSO #230

cthoyt opened this issue Aug 19, 2024 · 2 comments
Assignees

Comments

@cthoyt
Copy link
Member

cthoyt commented Aug 19, 2024

Follow-up to #149:

Add high level documentation to "transport" module

  1. What problem are we trying to solve here?
  2. What's the difference between surrogate outcomes and transportability?
  3. Real world example
@djinnome
Copy link
Contributor

djinnome commented Aug 19, 2024

  1. The problem we are trying to solve is to identify the causal effect of an intervention on an outcome when we have two datasets that came from the same underlying population. The first is an observational dataset where we have measurements of the intervention variables, the surrogate variables, and the outcome variables where people chose to get the intervention. The second is an experimental dataset that only contains measurements of the intervention and the surrogate variables (but not the outcome variables) where people were randomized to receive (or not receive the intervention). The goal is to estimate the causal effect of the intervention on the outcome using the information contained in both the observational data and the experimental data.

  2. Because the observational and experimental data sets were sampled from the same underlying population, the surrogate outcome problem is not technically a transportability problem, but we can transform the surrogate outcome problem into an equivalent transportability problem so that we can determine when the surrogate outcome problem is identifiable.

  3. A real-world example is where we want to know the causal effect of a vaccine on protection from infection, and all we have is measurements of people who chose to get the vaccine, their antibody levels and whether they subsequently got infected. We also have a dataset from the same population where we randomized on who got the vaccine but only measured their antibodies, not whether they got infected. The question is: can we estimate the vaccine efficacy by combining these two datasets?

@Aryan-Seth
Copy link

Could you highlight how it is different from this paper - https://ftp.cs.ucla.edu/pub/stat_ser/r419.pdf
I understand that it builds off of the idea from this paper, but algorithmically I did not find any major changes. Is it possible to use the TRSO code equivalently for this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants