switch callback rootfinding algorithm from `InternalFalsi` to `InternalITP` to Improve robustness #917

DaniGlez · 2023-07-18T16:00:15Z

This PR interleaves the bisection and the regula falsi iterations to be able to handle tricky callbacks without going into maxiters. See discussion in #916

ChrisRackauckas · 2023-07-18T17:15:03Z

As suggested in #916, can you try converting this over to itp and seeing if that solves it as well? That should be a very good way to have this mixture.

https://github.com/SciML/SimpleNonlinearSolve.jl/blob/main/src/itp.jl

DaniGlez · 2023-07-18T17:20:23Z

Sure, I'll take a look later (somehow missed the ITP file when eyeballing the SimpleNonlinearSolver repo)

ChrisRackauckas · 2023-07-18T17:21:45Z

somehow missed the ITP file when eyeballing the SimpleNonlinearSolver repo

It just merged a few minutes ago... 😅

ChrisRackauckas · 2023-07-18T17:27:59Z

If the algorithm is now non-differentiable (i.e. bisection isn't a differentiable algorithm) then the derivative needs to be directly computed. I added the derivative calculation handling.

ChrisRackauckas · 2023-07-18T21:35:35Z

It does look like this passes everything, though I'm curious what exactly the difference is between this an ITP. ITP to me seems like the same basic idea (mix Falsi with bisection) but with provable acceleration. Is there theory behind this version of the scheme?

DaniGlez · 2023-07-18T21:54:09Z

It does look like this passes everything, though I'm curious what exactly the difference is between this an ITP. ITP to me seems like the same basic idea (mix Falsi with bisection) but with provable acceleration. Is there theory behind this version of the scheme?

Not really, although I guess you could prove that it takes < min(2falsi_iters, 2bisection_iters) which gets rid of the worst-case issues. In any case I just swapped to the ITP method with the latest commit.

DaniGlez · 2023-07-18T22:00:42Z

So, the current status of the PR is that both of them are implemented and seem to work (at least for the local tests); I also added a couple of tests, including one for issue #916. There's a commented-out test in test/rootfinder.jl from stuff I lifted from the SimpleNonlinearSolve tests; I'm not sure this kind of derivative is needed for the internal rootfinder, so I need someone with better understanding of the sensitivity stuff to double-check if that's actually needed. I can try to compare performance tomorrow on the downstream callback tests, otherwise I'm totally fine with just using ITP and removing the old patched method. I would also need a second pair of eyes on the tolerance/termination implementation - the ITP from SimpleNonlinearSolve uses an abstol=1e-15 kwarg, but I reduced it to machine epsilon since that's what the callback rootfinder wants to guarantee for t.

DaniGlez · 2023-07-19T10:43:18Z

I ran the downstream callback examples with both methods, measuring the outcome in terms of Number of rootfind condition calls in sol.stats. I'm not sure what to make of them: ITP is 3-4x better on the first one, even on the second one, and 6-10x worse in the last couple ones.

Test	Interleaved	ITP
1. Spring	92417	25001
2. Chain	645	646
3. attactor	4000	43888
4. oscillator	167	1151

oscardssmith · 2023-07-19T12:18:04Z

it might be worth playing around with the itp parameters. I'm not sure if our current ones are good.

oscardssmith · 2023-07-19T12:21:25Z

it would also be good to add bisection as an option since itp is supposed to be ~strictly better than it, and for 1d bounded rootfinding bisection should be taking less than 100 steps.

ChrisRackauckas · 2023-07-19T12:40:33Z

And maybe @yash2798 can play around with some ITP parameters

DaniGlez · 2023-07-19T13:44:33Z

it would also be good to add bisection as an option since itp is supposed to be ~strictly better than it, and for 1d bounded rootfinding bisection should be taking less than 100 steps.

Yup, but just in case there is a misinterpretation: the numbers in each row are aggregated for the whole problem, not normalized per callback (there are likely multiple events within the solution)

oscardssmith · 2023-07-19T13:50:30Z

Ah that makes a lot more sense.

DaniGlez · 2023-07-21T10:58:23Z

Doing some research on the test failures, seems like this ITP version runs into a similar issue as the current Falsi-then-bisection on current master (the DDE test that fails passes with the interleaved version).

DaniGlez · 2023-07-21T12:31:18Z

Doing some research on the test failures, seems like this ITP version runs into a similar issue as the current Falsi-then-bisection on current master (the DDE test that fails passes with the interleaved version).

Found the issue (will fix it here soon): SciML/SimpleNonlinearSolve.jl#72

DaniGlez · 2023-07-21T12:40:58Z

ITP fixed; will re-run the benchmarks later, should be ready to launch the test suite again (in case the sign issue was the only root of the test failures)

DaniGlez · 2023-07-21T16:50:45Z

Alright, so this is what it will look like once I push the commit that handles the DDE test failure by improving the eps-level termination handling (possibly the AD ones?)

Test	Interleaved	Fixed ITP
1. Spring	92417	54610
2. Chain	645	601
3. attactor	4000	1950
4. oscillator	167	159

oscardssmith · 2023-07-21T17:03:35Z

It's a little bit weird that Spring got worse, but other than that, looks good.

DaniGlez · 2023-07-21T17:08:25Z

It's a little bit weird that Spring got worse, but other than that, looks good.

There's a chance the previous result is "wrong" in the sense of taking fewer iterations because some of the crossings were incorrectly solved.

DaniGlez · 2023-07-21T17:11:41Z

With the latest commit, the DDE test that failed should pass. I'm not sure regarding the AD tests (the result was actually fairly close, so maybe they were just one floating point from success?), and I think everything else was due to an Actions outage (except for the formatting which I don't know how to handle, not sure JuliaFormatter is picking up the toml file).

ChrisRackauckas · 2023-07-21T17:54:37Z

This downstream test failure from JumpProcesses seems like a real issue here: https://github.com/SciML/DiffEqBase.jl/actions/runs/5624812899/job/15243159120?pr=917#step:6:633 . Probably a floating point error. Is this returning the left end point by default correctly?

DaniGlez · 2023-07-21T18:43:21Z

Seems to be a numerical issue with loss of relative precision in the regula falsi formula. For example:

left = 5.642777923986901
right = 5.6427779240133455
fl = -4.665215073563809e-11
fr = 1.92987234680276e-16
(fr * left - fl * right) / (fr - fl) # = 5.642777924013346 which is nextfloat(right)

therefore it lands outside the [left, right] interval and screws up the next iteration, where exponentiation fails. Not sure if there is a more stable formula, but I guess we can always project into the interval to prevent this issue.

oscardssmith · 2023-07-21T19:18:15Z

Does reformulating as left+(right-left)*(fl/(fl-fr)) improve the numerics at all? Edit: yes it does. The old formula could only go out of bounds when left and right were very close together, and in this case left-right is computed exactly by my formula.

DaniGlez · 2023-07-21T19:28:19Z

It's one point better for this example (same as right), though I'm not sure if that generalizes. In any case, I think there should be no harm in just pushing it inside like I did (though I put the pushing after truncation and projection, just to be extra safe).

oscardssmith · 2023-07-21T19:41:09Z

With the updated version I am ~90% sure both checks are unnecessary. The "proof" of this is that when it matters, (right-left) is computed exactly, and fl/(fl-fr) is within [0, 1] as long as fl and fr have different signs.

DaniGlez · 2023-07-21T19:45:14Z

IMHO the equality checks are needed anyway; I think you're right about the new formula not being OOB but it should still be possible to match the edge values (I saw that when fixing the tests) - and, in that case, you want to push it inside; otherwise you'll get stuck and maxiter. So if we already have the equality tests, I don't think there's a drawback to them being equality-or-greater tests.

oscardssmith · 2023-07-21T19:55:47Z

I think you don't need the nudge since even if the falsi returns an endpoint, it should get nudged away from the endpoint due to the attraction to the center that happens afterwards, but I've been looking at it less than you so I trust you if you say it's necessary (I'm just trying to save cycles around the edges anyway 😄 )

DaniGlez · 2023-07-21T20:33:37Z

I think you don't need the nudge since even if the falsi returns an endpoint, it should get nudged away from the endpoint due to the attraction to the center that happens afterwards, but I've been looking at it less than you so I trust you if you say it's necessary

That's probably the case most of the time, it's just the edge cases where the attraction does nothing (in the example I was studying there was a 1e-19 or so bump, ie <<< eps(5.0)).

(I'm just trying to save cycles around the edges anyway smile )

Fair enough, but in terms of per-iteration overhead we're already doing a float^float with this algorithm :P

oscardssmith · 2023-07-21T21:02:04Z

yeah that's a good point. I do kind of want us to fix the power to 1.5 or 2 so it turns into a multiplication (or a multiplication and sqrt).

DaniGlez · 2023-07-21T21:40:55Z

Either of them is a valid choice, and FWIW the benchmarks in the ITP paper use 2, so we could indeed roll with that until we have a tuning strategy

ChrisRackauckas · 2023-07-22T12:22:52Z

Awesome! @DaniGlez can you make sure your fixes to ITP also end up in SimpleNonlinearSolve? That's the more externally used case. For dependency reasons we keep the two separate, but we should make sure the SimpleNonlinearSolve ITP is up to snuff. In fact, with this I'd say @yash2798 we should update the docs for interval nonlinear problems to recommend ITP (and tutorials)

DaniGlez · 2023-07-23T00:25:23Z

Absolutely, I'll get to it tomorrow

yash2798 · 2023-07-23T00:32:38Z

Sure, i'll open a pr for updating the docs soon.

DaniGlez added 2 commits July 18, 2023 17:54

Close #916

0093b49

Fix formatting

96900ca

Update internal_falsi.jl

3e68051

Fix naming bug

c3c1c21

Add ITP & tests

8a46691

DaniGlez mentioned this pull request Jul 19, 2023

InternalFalsi (callback rootfinder) fails to solve ill-conditioned problem #916

Closed

DaniGlez marked this pull request as draft July 21, 2023 08:48

Fix InternalITP (see SimpleNonlinearSolve.jl#72)

bcaf126

DaniGlez marked this pull request as ready for review July 21, 2023 12:39

Improve ITP epsilon-level handling

c096b64

Add numerical error safeguards in the ITP algo

a2fa3dc

Improve falsi formula and bounds pushing

21c96c0

ChrisRackauckas merged commit 035395c into SciML:master Jul 22, 2023

DaniGlez mentioned this pull request Jul 23, 2023

Backport ITP bugfixes from DiffEqBase in high precision contexts SciML/SimpleNonlinearSolve.jl#74

Merged

ChrisRackauckas mentioned this pull request Aug 8, 2023

explicit RK docstrings SciML/OrdinaryDiffEq.jl#1866

Merged

oscardssmith changed the title ~~Improve robustness of the InternalFalsi solver~~ switch callback rootfinding algorithm from InternalFalsi to InternalITP to Improve robustness Nov 20, 2023

DaniGlez mentioned this pull request Dec 16, 2023

ITP default values differ from Tuning Parameters SciML/SimpleNonlinearSolve.jl#108

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

switch callback rootfinding algorithm from `InternalFalsi` to `InternalITP` to Improve robustness #917

switch callback rootfinding algorithm from `InternalFalsi` to `InternalITP` to Improve robustness #917

DaniGlez commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

DaniGlez commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

DaniGlez commented Jul 18, 2023

DaniGlez commented Jul 18, 2023 •

edited

Loading

DaniGlez commented Jul 19, 2023 •

edited

Loading

oscardssmith commented Jul 19, 2023

oscardssmith commented Jul 19, 2023

ChrisRackauckas commented Jul 19, 2023 •

edited

Loading

DaniGlez commented Jul 19, 2023

oscardssmith commented Jul 19, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

ChrisRackauckas commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023 •

edited

Loading

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

ChrisRackauckas commented Jul 22, 2023

DaniGlez commented Jul 23, 2023

yash2798 commented Jul 23, 2023

switch callback rootfinding algorithm from InternalFalsi to InternalITP to Improve robustness #917

switch callback rootfinding algorithm from InternalFalsi to InternalITP to Improve robustness #917

Conversation

DaniGlez commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

DaniGlez commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

ChrisRackauckas commented Jul 18, 2023

DaniGlez commented Jul 18, 2023

DaniGlez commented Jul 18, 2023 • edited Loading

DaniGlez commented Jul 19, 2023 • edited Loading

oscardssmith commented Jul 19, 2023

oscardssmith commented Jul 19, 2023

ChrisRackauckas commented Jul 19, 2023 • edited Loading

DaniGlez commented Jul 19, 2023

oscardssmith commented Jul 19, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

ChrisRackauckas commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023 • edited Loading

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

oscardssmith commented Jul 21, 2023

DaniGlez commented Jul 21, 2023

ChrisRackauckas commented Jul 22, 2023

DaniGlez commented Jul 23, 2023

yash2798 commented Jul 23, 2023

switch callback rootfinding algorithm from `InternalFalsi` to `InternalITP` to Improve robustness #917

switch callback rootfinding algorithm from `InternalFalsi` to `InternalITP` to Improve robustness #917

DaniGlez commented Jul 18, 2023 •

edited

Loading

DaniGlez commented Jul 19, 2023 •

edited

Loading

ChrisRackauckas commented Jul 19, 2023 •

edited

Loading

oscardssmith commented Jul 21, 2023 •

edited

Loading