-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Origin/reweighting #185
base: master
Are you sure you want to change the base?
Origin/reweighting #185
Conversation
…tistics by addStat
…for the generated cf objects, and invalidating cf_orig, introduces a new class property cfrw_boot, if this option is on, than resampling the correlation function should not be allowed
It's not totally clear to me how the reweighting is supposed to work here? I had thought that a function like Why is it not allowed to resample a reweighted cf? |
|
fixed most of the check problems. where can I find an example for this? I'm still not convinced all of this is needed...!? |
this is left:
which I don't understand yet. |
also, the data object will mean we can no longer install for R < 3.5.0
|
Yes, the reading is actually quite format dependent. In the beta12 project I analysed just the output of tmLQCD for the reweighting factors: (that looked like the following): 00 00000 0.163251250000 0.163265000000 0.000000000000 0.000000000000 6.9715302949e+01 In the PLNG project I got the reweighting factor from Marco, entirely different format. |
> fixed most of the check problems.
>
> where can I find an example for this? I'm still not convinced all of this is needed...!?
Yes, the reading is actually quite format dependent. In the beta12 project I analysed just the output of tmLQCD for the reweighting factors: (that looked like the following):
##
00 00000 0.163251250000 0.163265000000 0.000000000000 0.000000000000 6.9715302949e+01
00 00001 0.163251250000 0.163265000000 0.000000000000 0.000000000000 6.9523762274e+01
00 00002 0.163251250000 0.163265000000 0.000000000000 0.000000000000 6.9776317102e+01
00 00003 0.163251250000 0.163265000000 0.000000000000 0.000000000000 6.9501797443e+01
00 00004 0.163251250000 0.163265000000 0.000000000000 0.000000000000 6.9453382954e+01
##
In the PLNG project I got the reweighting factor from Marco, entirely different format.
I had in mind an example of the whole thing working?
|
I understood this to originate from the fact that the normalisation needs to be recomputed (the average of the weights). In other words, the data and the reweighting factors both need to be resampled consistently and separately, such that for each bootstrap resample, the normalisation and the corresponding reweighted data can be generated. There are of course ways to handle this: reweighted data could be stored unnormalised:
which can be resampled any way one wants. However, when the reweighted data (and resampling thereof) is used, the corresponding normalisations need to be available and correctly applied to the central value and bootstrap samples. In other words, Does the above sound reasonable and describe correctly, why one can't "blindly" resample the reweighted data? |
Let me add another qualifying remark: we also don't deal with just a single reweighting factor, but sequences of factors which move us along in parameter space. For this, some sort of solution was required (such as supporting the multiplication of two sets of reweighting factors to form a third). |
I understood this to originate from the fact that the normalisation needs to be recomputed (the average of the weights). In other words, the data and the reweighting factors *both* need to be resampled consistently and separately, such that for each bootstrap resample, the normalisation and the corresponding reweighted data can be generated.
There are of course ways to handle this: reweighted data could be stored unnormalised:
```
d^{rw}_i = d_i * w_i
```
which can be resampled any way one wants. However, when the reweighted data (and resampling thereof) is used, the corresponding normalisations need to be available and correctly applied to the central value and bootstrap samples. In other words, `w_i` need to be resampled too, giving `boot.R` values for the normalisation factor. The normalisation factor for the central value is of course just `sum_i w_i`.
Does the above sound reasonable and describe correctly, why one can't "blindly" resample the reweighted data?
I fully agree here. So it's merely a safety feature that no one resamples "again"? Because you clearly want to resample to compute statistical errors.
And you want the object, which is the result of what I'd call `bootstrap_and_reweight` or so, to be a `cf` again, because that is convenient. Like it is for principal correlators, correct?
|
> Does the above sound reasonable and describe correctly, why one can't "blindly" resample the reweighted data?
Let me add another qualifying remark: we also don't deal with just a single reweighting factor, but sequences of factors which move us along in parameter space. For this, some sort of solution was required (such as supporting the multiplication of two sets of reweighting factors to form a third).
sure. But reweighting factors are always complex (if not real) valued vectors, aren't they? Of course, I agree, it's better to have the data type properly defined...
|
@pittlerf In other words, is there a rmarkdown file which explains how to use this? Are there some tests? |
Hi @urbach, I uploaded a how-to use cheat sheet in rmarkdown. |
thanks. |
…he gauge configuration indices for the x axis
thanks! |
The comment on |
hmm? |
ah, sorry I will do it now. |
Hi,
I started to add functionality to able to handle reweighting for correlation functions.
I use 'cfrw_boot' label for correlation functions that have been reweighted.
These correlation function should not be resampled, I removed the 'cf_orig' label from them.
Next thing is to make this consistent with the other functions in hadron.