Add `qPosteriorStandardDeviation` acquisition function #2634

slishak-PX · 2024-11-20T11:30:17Z

Motivation

This is a small collection of changes for improving support for optimisation with deterministic (posterior mean) and pure exploration (posterior std) acquisition functions:

Using PosteriorMeanModel with optimize_acqf is currently not supported as PosteriorMeanModel does not implement num_outputs or batch_shape.
The PosteriorStandardDeviation acquisition function has no MC equivalent.

This PR addresses the points above, and consequentially adds support for the constrained PSTD acquisition function.

Have you read the Contributing Guidelines on pull requests?

Yes

Test Plan

TODO - just submitting draft for now for discussion.

Related PRs

(If this PR adds or changes functionality, please take some time to update the docs at https://github.com/pytorch/botorch, and link to your PR here.)

Balandat · 2024-11-20T15:04:37Z

The PosteriorMean and PosteriorStandardDeviation acquisition functions have no MC equivalent.

So qSimpleRegret for q=1 is the MC-equivalent for PosteriorMean - can you just use that instead?

Using PosteriorMeanModel with optimize_acqf is currently not supported as PosteriorMeanModel does not implement num_outputs or batch_shape.

Adding these properties to DeterministicModel makes sense to me

slishak-PX · 2024-11-20T16:41:27Z

So qSimpleRegret for q=1 is the MC-equivalent for PosteriorMean - can you just use that instead?

Sorry, missed that, you're right. Will remove qPosteriorMean, but qPosteriorStandardDeviation could still be helpful (e.g. for constrained active learning).

Adding these properties to DeterministicModel makes sense to me

DeterministicModel is still an abstract class, I don't know how we'd set num_outputs. GenericDeterministicModel handles it by requiring it to be passed to __init__, and AffineDeterministicModel computes it based on the weights, but PosteriorMeanModel (and FixedSingleSampleModel) can just grab it from the base model.

Balandat · 2024-11-29T17:35:24Z

botorch/acquisition/monte_carlo.py

+        mean = obj.mean(dim=0)
+        return (obj - mean).abs()


This is not the empirical std. The sample_reduction is torch.mean() by default. If you do this then you'll get the posterior variance (without Bessel correction):

Suggested change

mean = obj.mean(dim=0)

return (obj - mean).abs()

mean = obj.mean(dim=0)

return (obj - mean)**2

But if you actually want the standard deviation you could define the sample_reduction to the superclass init as (this applies the Bessel correction, but that's probably not really all that relevant if the number of samples is reasonably large):

def std_reduction(samples: Tensor, dim: int) -> Tensor: N = samples.shape[0] return torch.mean( N / (N-1) * samples, dim=dim).sqrt()

Thanks, I also just noticed the error and corrected it using the same estimation method as in qUCB, but changing the sample_reduction is a much more sensible approach!

I think this results in different, and probably unwanted, behaviour when using constraints. The sample_reduction is applied after the constraints are applied, resulting in the feasibility indicator being square rooted. See the following comparison (green is with the std_reduction applied, yellow is the currently committed method):

slishak-PX · 2024-11-29T18:02:15Z

At this point the PR can support constrained AL (I want to minimise uncertainty in the feasible region).

For a simple example, see the image below: the task is to minimise uncertainty in output 1, but only where output 2 is negative. The blue dashed line (PSTD) is analytical, green solid line is the MC equivalent (with no Bessel correction - will add this, thanks @Balandat), and orange solid line is the constrained PSTD.

Not closed the loop and run an actual optimisation yet, interested to see how it will perform.

Code used for testing is below:

Code

(Sorry the plotting code is messy!)

import torch
from botorch import acquisition
from botorch.fit import fit_gpytorch_mll
from botorch.models import SingleTaskGP
from gpytorch.mlls import ExactMarginalLogLikelihood
from botorch.sampling import SobolQMCNormalSampler
from botorch.acquisition.objective import ScalarizedPosteriorTransform, LinearMCObjective

from plotly.subplots import make_subplots

n_train = 10
device = torch.device("cuda:1")

torch.manual_seed(3)
train_x = torch.rand(n_train, 1, dtype=torch.float64, device=device)
train_y = torch.randn(n_train, 2, dtype=torch.float64, device=device)

model = SingleTaskGP(
    train_x,
    train_y,
)
mll = ExactMarginalLogLikelihood(model.likelihood, model)
_ = fit_gpytorch_mll(mll)

sampler = SobolQMCNormalSampler(torch.Size([64]))

# Objective weights
w = torch.tensor([1, 0], dtype=torch.float64, device=device)

pstd = acquisition.PosteriorStandardDeviation(
    model, 
    posterior_transform=ScalarizedPosteriorTransform(w),
)
qpstd = acquisition.qPosteriorStandardDeviation(
    model, 
    sampler=sampler, 
    objective=LinearMCObjective(w),
)
qpstd_constr = acquisition.qPosteriorStandardDeviation(
    model, 
    sampler=sampler, 
    objective=LinearMCObjective(w), 
    constraints=[lambda samples: samples[..., 1]],
)

x = torch.linspace(0, 1, 100, device=device)
std = pstd(x[:, None, None])
qstd = qpstd(x[:, None, None])
qstd_c = qpstd_constr(x[:, None, None])

post = model.posterior(x[:, None, None])

fig = make_subplots(rows=3, cols=1, shared_xaxes=True)

for i in [0, 1]:
    fig.add_scatter(x=x.cpu().detach().numpy(), y=post.mean[:, 0, i].cpu().detach().numpy(), mode="lines", line_color="black", row=i+1, col=1, showlegend=False)
    fig.add_scatter(x=x.cpu().detach().numpy(), y=(post.mean[:, 0, i] + post.variance[:, 0, i]**0.5).cpu().detach().numpy(), line_color="grey", mode="lines", showlegend=False, row=i+1, col=1)
    fig.add_scatter(x=x.cpu().detach().numpy(), y=(post.mean[:, 0, i] - post.variance[:, 0, i]**0.5).cpu().detach().numpy(), line_color="grey", mode="lines", showlegend=False, fill="tonexty", row=i+1, col=1)
    fig.add_scatter(x=train_x[:, 0].cpu().detach().numpy(), y=train_y[:, i].cpu().detach().numpy(), mode="markers", marker_color="red", row=i+1, col=1, showlegend=False)

fig.add_hline(y=0, row=2, line_dash="dash")

fig.add_scatter(x=x.cpu().detach().numpy(), y=qstd.cpu().detach().numpy(), row=3, col=1, name="qPSTD", line_color="orange")
fig.add_scatter(x=x.cpu().detach().numpy(), y=qstd_c.cpu().detach().numpy(), row=3, col=1, name="qPSTD (constrained y2<0)", line_color="green")
fig.add_scatter(x=x.cpu().detach().numpy(), y=std.cpu().detach().numpy(), row=3, col=1, name="PSTD", line_dash="dash", line_color="blue")

fig.update_yaxes(row=1, title_text="Output 1")
fig.update_yaxes(row=2, title_text="Output 2")
fig.update_yaxes(row=3, title_text="Acquisition value")
fig.update_xaxes(row=3, title_text="x")

slishak-PX · 2024-12-03T17:11:04Z

Added unit tests, copied from the rest of the MC acquisition function tests.

This one seems incomplete as it relies on the posterior samples having some variance in order to return nonzero acquisition values, but MockPosterior seems like it's only set up to repeatedly return the same sample.

Balandat · 2024-12-08T23:14:48Z

This one seems incomplete as it relies on the posterior samples having some variance in order to return nonzero acquisition values, but MockPosterior seems like it's only set up to repeatedly return the same sample.

Yeah it's quite possible that MockPosterior does not support everything we'd like it to at this point, I don't think we've touched that in a while. I think it should be pretty straightforward to modify this line here to only apply the expansion if self._samples is not already of the appropriate size - that would allow you to return different sample values.

codecov · 2024-12-08T23:20:51Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.98%. Comparing base (851df1f) to head (3dbbe64).
Report is 7 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff            @@
##             main    #2634    +/-   ##
========================================
  Coverage   99.98%   99.98%            
========================================
  Files         200      202     +2     
  Lines       18365    18602   +237     
========================================
+ Hits        18363    18600   +237     
  Misses          2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…mples

slishak-PX · 2025-01-07T12:06:55Z

Thanks for the pointer! Unfortunately it wasn't quite as simple as modifying that one line, the base_shape also needs to be provided so that MockPosterior.base_sample_shape returns the correct shape.

The other issue is that the reparameterisation trick for qUCB (which I'm using for qPSTD) only really makes sense for Gaussian posteriors; if a set of samples is manually constructed then the standard deviation is inaccurate. This might merit further thought, because although the UCB family of acquisition functions are most naturally suited to Gaussian posteriors (to my knowledge), it might be valuable for PSTD to also work for non-Gaussian posteriors.

I've sampled from torch.randn in test_q_pstd, and am using self.assertAllClose to check that the estimated std is close to (within 2% of) the empirical uncorrected value from torch.std.

I haven't edited test_q_pstd_batch yet because for $q>1$, it's not straightforward to give a reference value for qPSTD, so I've left it with the repeated samples (and zero acquision value).

Balandat

Overall this lgtm - there are a couple of lines exposing new properties that aren't covered by the unit tests - could you please add that to the existing tests to get that coverage up to 100%

botorch/utils/testing.py

slishak-PX · 2025-01-27T09:35:20Z

Thanks @Balandat, should be all sorted now!

facebook-github-bot · 2025-01-27T16:04:52Z

@Balandat has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Balandat · 2025-01-28T14:57:57Z

@slishak-PX did you test this on a GPU? I'm running into some test failures of the following nature (could just be numerical differences on GPU vs CPU):

pytorch.botorch.test.test_cuda.TestBotorchCUDA
test: test_q_pstd (acquisition.test_monte_carlo.TestQPosteriorStandardDeviation)
error: Traceback (most recent call last):
  File "/data/users/balandat/fbsource/buck-out/v2/gen/fbcode/5973496158f44142/pytorch/botorch/__test_acquisition_cuda__/test_acquisition_cuda#link-tree/pytorch/botorch/test/acquisition/test_monte_carlo.py", line 1031, in test_q_pstd
    self.assertAllClose(res.item(), std, rtol=0.02, atol=0)
  File "/data/users/balandat/fbsource/buck-out/v2/gen/fbcode/5973496158f44142/pytorch/botorch/__test_acquisition_cuda__/test_acquisition_cuda#link-tree/botorch/utils/testing.py", line 109, in assertAllClose
    torch.testing.assert_close(
  File "/data/users/balandat/fbsource/buck-out/v2/gen/fbcode/5973496158f44142/pytorch/botorch/__test_acquisition_cuda__/test_acquisition_cuda#link-tree/torch/testing/_comparison.py", line 1519, in assert_close
    raise error_metas[0].to_error(msg)
AssertionError: Scalars are not close!

Expected 0.9553655982017517 but got 0.9280872344970703.
Absolute difference: 0.027278363704681396 (up to 0 allowed)
Relative difference: 0.028552800892167798 (up to 0.02 allowed)

  test_cuda: AssertionError: False is not true

Failures:

  1) pytorch.botorch.test.test_cuda.TestBotorchCUDA: test_cuda
    1) AssertionError: False is not true
      File "pytorch/botorch/test/test_cuda.py", line 28, in test_cuda
        self.assertTrue(run_cuda_tests(tests))
Imports took: 16.9s! Profile with --import-profiler.           --_ |""---__
Executed 1 example in 211.1s:                               |'.|  ||  .    """|
  Successful: 0                                             | ||  || /|\""-.  |
  Failed: 1                                                 | ||  ||  |    |  |
  Skipped: 0                                                | ||  ||  |   \|/ |
  Not executed: 262                                         |."|  ||  --"" '__|
https://testslide.readthedocs.io/                              --" |__---"""

slishak-PX · 2025-01-28T16:55:53Z

I only tested on CPU. I did notice that the estimation of the standard deviation needs quite a high number of samples to converge, so I kept the number of samples reasonably low and the tolerance wide. I checked, and qUpperConfidenceBound is similarly inaccurate.

However, using a Sobol sequence instead of torch.randn here converges better (just pushed that change) - only tested on CPU though.

facebook-github-bot · 2025-01-28T18:37:16Z

@Balandat has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Balandat · 2025-01-28T21:34:20Z

Thanks, I ran some stress tests on GPU, seems like this solved the flakiness issues.

facebook-github-bot · 2025-01-28T22:22:14Z

@Balandat merged this pull request in 0653bfc.

slishak-PX added 2 commits November 15, 2024 15:52

Add missing properties to PosteriorMeanModel

a3acad4

Add qPM and qPSTD acquisition functions, update docstrings

06f76bd

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Nov 20, 2024

Remove qPosteriorMean

fc8e40f

slishak-PX changed the title ~~Fix PosteriorMeanModel and add qPosteriorMean/qPosteriorStandardDeviation acquisition functions~~ Fix PosteriorMeanModel and add qPosteriorStandardDeviation acquisition function Nov 20, 2024

slishak-PX and others added 2 commits November 29, 2024 17:05

Merge branch 'main' into deterministic_model_updates

45458d1

Update acquisition __init__

135eba2

Balandat reviewed Nov 29, 2024

View reviewed changes

Correct scaling of qPSTD

4b526d7

Added unit test for qPSTD

1caf62c

slishak-PX marked this pull request as ready for review December 3, 2024 17:09

Merge branch 'main' into deterministic_model_updates

a690236

slishak-PX and others added 2 commits January 6, 2025 09:19

Merge branch 'main' into deterministic_model_updates

3ab1583

Update qPSTD tests, allow MockPosterior to return non-expanded raw sa…

87e8fe2

…mples

Balandat reviewed Jan 20, 2025

View reviewed changes

botorch/utils/testing.py Outdated Show resolved Hide resolved

slishak-PX and others added 3 commits January 27, 2025 09:24

Merge branch 'main' into deterministic_model_updates

895ebe7

Neater code, imperceptibly slower unit tests

452597d

Unit test coverage for new properties

8c283df

Balandat changed the title ~~Fix PosteriorMeanModel and add qPosteriorStandardDeviation acquisition function~~ Add qPosteriorStandardDeviation acquisition function Jan 27, 2025

Sobol samples instead of randn in test_q_pstd

3dbbe64

facebook-github-bot closed this in 0653bfc Jan 28, 2025

facebook-github-bot added the Merged label Jan 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `qPosteriorStandardDeviation` acquisition function #2634

Add `qPosteriorStandardDeviation` acquisition function #2634

slishak-PX commented Nov 20, 2024 •

edited

Loading

Balandat commented Nov 20, 2024

slishak-PX commented Nov 20, 2024 •

edited

Loading

Balandat Nov 29, 2024

slishak-PX Nov 29, 2024 •

edited

Loading

slishak-PX Dec 3, 2024

slishak-PX commented Nov 29, 2024

slishak-PX commented Dec 3, 2024

Balandat commented Dec 8, 2024

codecov bot commented Dec 8, 2024 •

edited

Loading

slishak-PX commented Jan 7, 2025 •

edited

Loading

Balandat left a comment

slishak-PX commented Jan 27, 2025

facebook-github-bot commented Jan 27, 2025

Balandat commented Jan 28, 2025

slishak-PX commented Jan 28, 2025 •

edited

Loading

facebook-github-bot commented Jan 28, 2025

Balandat commented Jan 28, 2025

facebook-github-bot commented Jan 28, 2025

Add qPosteriorStandardDeviation acquisition function #2634

Add qPosteriorStandardDeviation acquisition function #2634

Conversation

slishak-PX commented Nov 20, 2024 • edited Loading

Motivation

Have you read the Contributing Guidelines on pull requests?

Test Plan

Related PRs

Balandat commented Nov 20, 2024

slishak-PX commented Nov 20, 2024 • edited Loading

Balandat Nov 29, 2024

Choose a reason for hiding this comment

slishak-PX Nov 29, 2024 • edited Loading

Choose a reason for hiding this comment

slishak-PX Dec 3, 2024

Choose a reason for hiding this comment

slishak-PX commented Nov 29, 2024

slishak-PX commented Dec 3, 2024

Balandat commented Dec 8, 2024

codecov bot commented Dec 8, 2024 • edited Loading

Codecov Report

slishak-PX commented Jan 7, 2025 • edited Loading

Balandat left a comment

Choose a reason for hiding this comment

slishak-PX commented Jan 27, 2025

facebook-github-bot commented Jan 27, 2025

Balandat commented Jan 28, 2025

slishak-PX commented Jan 28, 2025 • edited Loading

facebook-github-bot commented Jan 28, 2025

Balandat commented Jan 28, 2025

facebook-github-bot commented Jan 28, 2025

Add `qPosteriorStandardDeviation` acquisition function #2634

Add `qPosteriorStandardDeviation` acquisition function #2634

slishak-PX commented Nov 20, 2024 •

edited

Loading

slishak-PX commented Nov 20, 2024 •

edited

Loading

slishak-PX Nov 29, 2024 •

edited

Loading

codecov bot commented Dec 8, 2024 •

edited

Loading

slishak-PX commented Jan 7, 2025 •

edited

Loading

slishak-PX commented Jan 28, 2025 •

edited

Loading