Structured Tree Graphs
The following is a thread I created in 2020 as a basic introduction to structured tree graphs as a tool for defining causal effects. Note that I have edited the language from the original thread in the following
Structured Tree Graphs
Let
A person starts at the center node in the left and proceed down the branches of the tree until they hit the end bucket
(
This tree graph is a lot to look at with all the individual labels, so I am going to simply the graph only indicate the variables or columns. But remember that branch splits indicate the different values
The observed structure tree graph consists of individuals with all different trajectories. The black line indicates a single individual and their observed trajectory over time.
Causal inference contrasts different branch trajectories based on an intervention for
We can also be interested in time-varying exposures, where both
Note that there are some paths we have no interest in with time-varying exposures!
This data structure also has a direct connection to the parametric g-formula. The parametric g-formula uses the observed data to estimate the parameters of outcome models, then simulates individual trajectories for a large sample drawn with replacement. After simulating individuals, we can simply count up the number in each bucket.