forked from child-growth/ki-growth-prediction
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathgrowth_prediction_shiny_proposal.Rmd
64 lines (33 loc) · 1.32 KB
/
growth_prediction_shiny_proposal.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
---
title: "R Notebook"
output: html_notebook
---
### "if a hypothetical intervention increased growth by X1 amount at age Y1, it would reduce wasting by X2 amount by age y2"
```{r, include=F}
source(paste0(here::here(), "/0-config.R"))
#Load data
d <- readRDS(paste0(ghapdata_dir, "ki-manuscript-dataset.rds")) %>% filter(whz >= -6 & whz <=6, !is.na(whz), agedays < 731)
dim(d)
```
1. ~30 studies, 61,3695 measures for longitudinal data. Alternatively could use nationally-representative cross-sectional survey data from DHS or MICS (millions of obs)
2. -different nutritional and health contexts. Mean growth varies a lot by country
3. -ages measured at different intervals and frequencies across studies
4. -South Asia overrepresented
5. -different covariates in each study
```{r}
p<-ggplot(d, aes(x=agedays)) + geom_histogram()
print(p)
```
```{r}
p<-ggplot(d, aes(x=agedays)) + facet_wrap(~studyid) + geom_histogram()
print(p)
```
```{r}
p<-ggplot(d, aes(x=agedays, y=whz)) + facet_wrap(~studyid) + #geom_point(alpha=0.1) +
geom_smooth(se=F)
print(p)
```
Shiny app off protected data? Either simple superlearner predictions or a TMLE shift analysis
Weighted SuperLearner?
Considerations in transportability/generalizability?
-Printed red flags based on # of children/studies used in predictions?