OSA residuals #1

catarinawor · 2024-03-15T17:20:59Z

Discussion on OSA residuals
Here are a couple more links on OSA residuals:
Slides by Cole Monnahan on using OSA Residuals in stock assessment: Slide 1 (npfmc.org)
An R package for calculating OSA Residuals for compositions: https://github.com/fishfollower/compResidual

Papers:
Trijoulet et al 2023
Li et al. 2024
Thygesen et al. 2017

paul-vdb · 2024-03-15T18:29:22Z

I added the OSA example I wrote to the repo here. https://github.com/pbs-assess/renewassess/blob/main/code/OSA/OSA_Multinomial.R

Feel free to edit it please if it's wrong!

seananderson · 2024-03-15T18:33:35Z

At nearly the same time(!) I added some notes in https://github.com/pbs-assess/renewassess/blob/main/osa-notes.Rmd on what's happening within the TMB machinery to calculate OSA oneStepGeneric residuals. It's pretty crazy to wrap your head around... data become parameters and you profile over them.

paul-vdb · 2024-03-15T18:45:44Z

@seananderson I need to digest that a bit.
How does this sound for understanding for some wavy hands...
For an x[i] value, remove the data x[(i+1):n] data from the likelihood contribution, evaluate likelihood along the domain of -Inf, x[i]. Approximate that integral via a spline? Or what their methods are for profile likelihoods? Then straightforward to transform CDF to standard normal scale and that's the residual.

seananderson · 2024-03-15T20:48:23Z

Well that was an enlightening exercise... I got your R script @paul-vdb to match the TMB OSA residuals exactly here. Partly I did the calculations matching TMB so I could check my calculations as I went and that solidified some of my understanding about how the observation likelihoods accumulate as you step through the data. I also edited your original script to vectorize the runif() call and set the seed to rule that out as the difference. At some point it would be good to write up that matching version more elegantly than how I left it.

paul-vdb · 2024-03-15T21:43:36Z

Thanks @seananderson

paul-vdb · 2024-03-18T16:32:38Z

@seananderson I dug deeper. The way I wrote the original OSA was correct. It didn't match the set.seed argument because they (RTMB) simulate the runif for the full set of observations and then subset. I subset from the outgo and so that length difference in the argument was the difference maker. Thank goodness since when I went through the maths of your OSA2 file it was exactly mine. Looking at the output from the onesteppredict function it also was calculating the same Fx and px values as I was too! So in short, original function is technically correct for the multinomial with no other complexity.

From Equation 8 in Thygesen,
$f_i(y) = \int f_{\bar{x}y_1^i}(y_{i-1}, y)$

What makes life easy for the multinomial, is that when you write it as conditional binomials, then they are independent and as a result, $f(y_{i-1}, y) = f(y_{i-1}) \times f(y)$. Thus all those nll_up_to_obs terms cancel out and the equation is simply,
$F_x = F(y_i) = pbinom(y_i, n_i, p_i)$
and
$p_x = f(y_i)$

Their way of calculating z,
u = runif()
z = qnorm(Fx - u*px) is equivalent to how I had it with,
z = qnorm(runif(1, F(x-1), Fx))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OSA residuals #1

OSA residuals #1

catarinawor commented Mar 15, 2024 •

edited

Loading

paul-vdb commented Mar 15, 2024

seananderson commented Mar 15, 2024

paul-vdb commented Mar 15, 2024

seananderson commented Mar 15, 2024

paul-vdb commented Mar 15, 2024

paul-vdb commented Mar 18, 2024 •

edited

Loading

OSA residuals #1

OSA residuals #1

Comments

catarinawor commented Mar 15, 2024 • edited Loading

paul-vdb commented Mar 15, 2024

seananderson commented Mar 15, 2024

paul-vdb commented Mar 15, 2024

seananderson commented Mar 15, 2024

paul-vdb commented Mar 15, 2024

paul-vdb commented Mar 18, 2024 • edited Loading

catarinawor commented Mar 15, 2024 •

edited

Loading

paul-vdb commented Mar 18, 2024 •

edited

Loading