-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy path01-introduction.Rmd
executable file
·111 lines (90 loc) · 6.12 KB
/
01-introduction.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
---
#########################################
# options for knitting a single chapter #
#########################################
output:
bookdown::pdf_document2:
template: templates/brief_template.tex
citation_package: biblatex
bookdown::html_document2: default
bookdown::word_document2: default
documentclass: book
bibliography: references.bib
---
```{r echo = FALSE}
options(scipen = 100)
knitr::opts_chunk$set(
echo = FALSE,
warning = FALSE,
message = FALSE,
dpi = 320,
cache = TRUE,
out.width = "95%",
fig.align = 'centre'
)
```
# Introduction
\adjustmtc
\markboth{Introduction}{}
<!-- For PDF output, include these two LaTeX commands after unnumbered chapter headings, otherwise the mini table of contents and the running header will show the previous chapter -->
```{r}
custom_numbers_to_words <- function(number) {
powers <- c(3, 6, 9, 12)
if (number == 0) {
return("0")
}
sign <- ifelse(number < 0, -1, 1)
number <- abs(number)
for (power in powers) {
scaled_number <- number / 10^power
if (scaled_number >= 1 && scaled_number < 1000) {
suffix <- gsub("^one\\s", "", xfun::numbers_to_words(10^power))
return(paste0(sign * scaled_number, " ", suffix))
}
}
return("Number out of range.")
}
esa_plhiv2022 <- 20.8 * 1e6
wca_plhiv2022 <- 4.8 * 1e6
ssa_plhiv2022 <- esa_plhiv2022 + wca_plhiv2022
plhiv2022 <- 39 * 1e6
ssa_pop2022 <- 1.2 * 1e9 #' From the world bank
```
This thesis is about applied and methodological Bayesian statistics.
It is applied and methodological in that the primary concern is real-world questions and the means to answer them.
The statistical approach is Bayesian because probability theory is used to arrive at conclusions based on models for observed data.
The applied focus of this thesis is in obtaining the strategic information needed to plan the response to the HIV (human immunodeficiency virus) epidemic in sub-Saharan Africa (SSA).
Over 40 years since the beginning of the epidemic, HIV is the largest annual cause of disability adjusted life years (DALYs) among non-infants in SSA [@ihme2019; Figure \@ref(fig:gbd)].
Quantification of the epidemic using statistics is a crucial part of the public health response.
Effective implementation of HIV prevention and treatment requires strategic information.
However, producing suitable estimates of relevant indicators is complicated by a range of statistical challenges.
(ref:gbd) HIV is the largest cause of annual DALYs among individuals aged >1 year in SSA [@ihme2019]. One DALY represents the loss of the equivalent of one year of full health, and is calculated by the sum of years of life lost and years lost due to disability. Weights used to account for disability vary between 0 (full health) and 1 (death) depending on the severity of the condition.
```{r gbd, fig.cap="(ref:gbd)"}
knitr::include_graphics("figures/introduction/gbd.png")
```
The data used were gathered in national household surveys or routinely collected from healthcare facilities providing HIV services.
An important feature of these data are the location and time at which observations were recorded.
Spatio-temporal data have important recurring commonalities across a diverse range of application settings.
The work conducted in this thesis uses and aspires to contribute to techniques from spatio-temporal statistics.
Computation is an essential part of modern statistical practice.
Each project in this thesis, and the thesis itself, is accompanied by R [@r] code, hosted on GitHub at [`https://github.com/athowes`](https://github.com/athowes).
To facilitate reproducible research, the R package `orderly` [@orderly] was used to structure code repositories.
## Chapter overview
This thesis is structured as follows:
* Chapter \@ref(hiv-aids) provides an overview of the HIV/AIDS epidemic and describes the challenges faced by surveillance efforts.
* Chapter \@ref(bayes-st) introduces the statistical concepts and notation used throughout the thesis, focusing on Bayesian modelling and computation, spatio-temporal statistics, and survey methods.
* Chapter \@ref(beyond-borders): The prevailing model for spatial structure used in small-area estimation [@besag1991bayesian] was intended to analyse a grid of pixels.
In disease mapping, areas correspond to the administrative divisions of a country, which are typically not a grid.
I used simulation and survey data studies to evaluate the practical consequences of this concern.
* Chapter \@ref(multi-agyw): Adolescent girls and young women are a demographic group at disproportionate risk of HIV infection.
The Global AIDS Strategy recommends prioritising interventions on the basis of behaviour to prevent the most new infections using the limited available resources.
I estimated the size of behavioural risk groups across priority countries to enable implementation of this strategy.
Additionally, I assessed the potential benefits of the strategy in terms of numbers of new infections prevented.
This work [@howes2023spatio] was included in the UNAIDS (Joint United Nations Programme on HIV/AIDS) Global AIDS Update 2022 and 2023.
* Chapter \@ref(naomi-aghq): The Naomi small-area estimation model [@eaton2021naomi] is used by countries to estimate district-level HIV indicators.
First, to allow for compatibility with Naomi, I implemented the integrated nested Laplace approximations using automatic differentiation, opening the door to a new class of fast, flexible, and accurate Bayesian inference algorithms.
The implementation was using models for a clinical trial of an epilepsy drug, and for the prevalence of the parasitic worm Loa loa.
Second, I developed an approximate Bayesian inference method combining adaptive Gauss-Hermite quadrature with principal components analysis.
I applied these methods to data from Malawi, and analysed the consequences of the inference method choice for policy relevant outcomes.
* Chapter \@ref(conclusions): Finally, I discuss contributions of the research, avenues for future work, and some broader reflections.
Though chronological order is recommended, Chapters \@ref(beyond-borders), \@ref(multi-agyw) and \@ref(naomi-aghq) may be read in any order, or as stand-alone studies, if preferred.