-
Notifications
You must be signed in to change notification settings - Fork 3
02 Quick Preview
## Dataset
Data used in VT are modelized by
-
$Y$ is a binary outcome. In R,$Y$ is afactor
. Second level of this factor will be the desirable event. ($Y=1$ ) -
$T$ is treatment variable,$T=1$ means active treatement,$T=0$ means control treatment. In R,$T$ is numeric. -
$X_i$ is covariables,$X_i$ can be categorical, continous, binary.
NOTE: if you run VT with interactions, categorical covariables must be transformed into binary variables.
Type ?formatRCTDataset
for details.
Related functions/classes in aVirtualTwins package : VT.object()
, vt.data()
, formatRCTDataset
.
VT is a two steps method but with many possibilities
let
let
let
- Grow a random forest with data
$\left{Y, T, X \right}$ . - Grow a random forest with interaction treatement / covariable, i.e.
$\left{Y, T, X, XI(T_i=0), XI(T_i=1)\right}$ - Grow two random forests, one for each treatement:
- The first with data
$\left{Y, X \right}$ where$T_i = 0$ - The second with data
$\left{Y, X \right}$ where$T_i = 1$
- The first with data
- Build your own model
From one of these methods you can estimate
Related functions/classes in aVirtualTwins package : VT.difft()
, vt.forest()
.
Define
- Use regression tree to explain
$Z$ by covariables$X$ . Then subjects with predicted$Z_i$ greater than some threshold$c$ are considered to define a subgroup. - Use classification tree on new variable $Z^{}$ defined by $Z^{}_i=1$ if
$Z_i > c$ and$Z^{*}_i=0$ otherwise.
The idea is to identify which covariable from
Related function in aVirtualTwins package : vt.tree()
.