Fixed condition_on_observations in fully Bayesian models #2151

hvarfner · 2023-12-16T20:40:02Z

Motivation

Conditioning on observations in fully bayesian models - enables fully Bayesian JES & KG(?).

Have you read the Contributing Guidelines on pull requests?

Yes.

Test Plan

Tests are written to ensure functionality for inferred and fixed noise. note that the _aug_batch_shape attribute assignment was removed in condition_on_observations. In FullyBayesianGPs, this argument could not be assigned (hence the removal). I could not find the use for this argument, and all tests passed when removing it.

Other changes are commented throughout, and the changes were made so as to assure that FBGPs can have one set of training data throughout. Howver, conditioning on obervations adds a batch dim to the training data (which is necessary in GPyTorch here) to infer the correct batch dim.

hvarfner · 2023-12-16T21:32:56Z

@dme65

facebook-github-bot · 2023-12-18T15:23:37Z

@sdaulton has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

sdaulton · 2023-12-18T15:24:57Z

Thanks for putting this up @hvarfner!

dme65 · 2023-12-18T15:29:24Z

Thanks for fixing this @hvarfner! These changes generally look good to me.

saitcakmak

Thanks for making these changes. I left some, mostly cosmetic, comments in line.
One thing I want to make sure is that the model batch shape remains compatible with the model itself. To this end, could you also update the batch_shape property of SaasFullyBayesianSingleTaskGP to reflect the correct shape before & after fantasizing?

botorch/models/fully_bayesian.py

saitcakmak · 2023-12-19T03:03:17Z

botorch/models/fully_bayesian.py

+            raise ValueError(
+                "Conditioning in fully Bayesian models must contain a batch dimension."
+                "Add a batch dimension (the leading dim) with length matching the "
+                "number of hyperparameter sets to the conditioned data."


Is the number of hyper-parameter sets self.num_mcmc_samples or self.batch_shape (the two are the same, unless the model is already batched / fantasized)? If I were to attempt fantasizing from an already fantasized model, would this still hold? It'd be good to print out the expected shape as part of the error message.

Had a bit of an incorrect rushed comment before, which I deleted. =)

botorch/models/gpytorch.py

test/models/test_fully_bayesian.py

saitcakmak · 2023-12-19T03:17:50Z

test/models/test_fully_bayesian.py

@@ -656,6 +666,77 @@ def test_custom_pyro_model(self):
                    atol=5e-4,
                )

+    def test_condition_on_observation(self):


Thanks for adding these tests. For completeness, can you add a test that calls model.fantasize and a test that evaluates some acquisition function (e.g., qLogEI) with the fantasized model? I just want to make sure everything works e2e including the sampler related code inside the acquisition functions.

hvarfner · 2023-12-20T09:07:37Z

@saitcakmak thanks for the feedback!

I didn't consider fantasizing before, so I had to rework a little bit to add it. Now, the batch shapes are inferred in condition_on_observations for the fantasizing case X.shape = n x d, Y.shape = fantasy_dim x batch_dim x n x 1 and the no-batch_shape case X.shape = n x d, Y.shape = n x 1. In both cases, the output batch shape will be non-empty, as seen in the added tests.

saitcakmak · 2023-12-21T02:52:20Z

Thanks for making the changes. I had not realized that your original changes did not affect the model batch size (as in, the output shape and the shape of the lengthscales does not change in the conditioning tests). But fantasizing (which is the use case of conditioning I am most familiar with) does change the batch shape by adding a another dimension to it.

So, if I tried to use the fantasized model with some acquisition functions, it will now error out because the batch_shape property of the model is not updated. E.g., adding

            acqf = qNoisyExpectedImprovement(
                model=fantasy_model, X_baseline=train_X, prune_baseline=False
            )
            acqf(train_X)

to the end of the fantasize test will raise AssertionError: Expected the output shape to match either the t-batch shape of X, or the model.batch_shape in the case of acquisition functions using batch models; but got output with shape torch.Size([19]) for X with shape torch.Size([1, 10, 4]). This is because the model batch shape becomes fantasy_size, num_models after fantasizing but Line 427-31 defines batch_shape as torch.Size([self.num_mcmc_samples]) (equal to num_models in the test).

One option here would be to update the definition of the batch_shape property to correctly return the batch shape of both the basic and fantasized models. Another would be to roll back the fantasize support, which seems to require some more work to get working correctly (probably beyond the scope of this PR). Knowledge gradient is the most common use case for fantasy models that I know of and it seems to trigger additional errors when I add this bit to the fantasize test:

        acqf = qKnowledgeGradient(
            model=model, num_fantasies=4
        )
        acqf(train_X)

hvarfner · 2023-12-22T09:02:00Z

@saitcakmak Yeah, there seems to be some nuance to this that I failed to consider. I think there are two things that need to be ironed out if we want consistency (and retain only one batch of training data by default):

What should the shape of the FBGP posterior be? Currently, the input must be batch_shape * q * d, so that batch_shape * num_models * q * d is returned by unsqueezing exactly MCMC_DIM = -3 in the posterior (previously the forward) call. For a the aforementioned fantasy_model to work, two dims must be unsqueezed. (or (len(batch_shape) dims in general). Moreover, this makes model.posterior(train_X) rather awkward, since the shape of the posterior will have the first two dims swapped compared to the conventional call described above. acqf(train_X) does not give the intended shape for any FB-acquisition function AFAIK - it has to be three-dimensional.
When using fantasize, I think that the user would be pretty surprised if they received a
batch_shape * num_fantasies * num_models * q * d (since the fantasy dim typically goes first, right?). However, this is more in line with the current FBGP structure, and this solution could be accommodated by updating the batch_shape and unsqueezing len(batch_shape) dimensions in the posterior call. Ultimately, the models dim needs to trail the batch_shape dim due to this line, so I struggle to see a general solution that gives the consistent num_fantasies * batch_shape * num_models * q * d when fantasizing.

I am not quite sure how this can be solved in a consistent manner without reconsidering the FBGP altogether (i.e. making the model dim a leading dim), so I am for rolling back as of now!

saitcakmak · 2023-12-24T20:11:13Z

That sounds good to me. We can revisit it down the line if there's a need for fantasize support. Happy to merge this in after you push the changes rolling it back.

hvarfner · 2023-12-27T08:47:07Z

@saitcakmak done!

I think the long-term fix (which I don't think would be too much work) would be to have MCMC_DIM be a leading dim, but the last one, i.e. at index -4. I could give this a go going forward if you don't foresee any issues with it.

facebook-github-bot · 2023-12-27T18:20:24Z

@saitcakmak has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

saitcakmak

Thanks! I haven't thought too deeply about this, but I think a proper way to support fantasize would start with adding support for batched inputs more generally (in init) and add the MCMC dimension as the outer-most dimension as you suggest. We could also store the train_inputs/targets as batched (along the MCMC) dimension to bring the model internals further in-line with other batched models (e.g., SingleTaskGP). That being said, this is a decent bit of work to make these changes (will also require updating other parts of the code base that have specialized logic for this model), so it might not be worth the effort unless there's a real motivator.

facebook-github-bot · 2023-12-27T20:21:24Z

@saitcakmak merged this pull request in 0c37aac.

Fixed condition_on_observations in fully Bayesian models

f61c430

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Dec 16, 2023

saitcakmak requested changes Dec 19, 2023

View reviewed changes

hvarfner force-pushed the main branch 3 times, most recently from 01f9e5e to b4040b5 Compare December 20, 2023 08:49

hvarfner force-pushed the main branch 2 times, most recently from c52c329 to 17fdee8 Compare December 27, 2023 08:34

Selective rollback of fantasize in Fully Bayesian GPs

059549b

hvarfner force-pushed the main branch from 17fdee8 to 059549b Compare December 27, 2023 08:36

Merge branch 'main' into main

82ae365

saitcakmak approved these changes Dec 27, 2023

View reviewed changes

facebook-github-bot closed this in 0c37aac Dec 27, 2023

facebook-github-bot added the Merged label Dec 27, 2023

hvarfner mentioned this pull request Apr 5, 2024

[Bug] condition_on_observation with SaasFullyBayesianGP #1680

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed condition_on_observations in fully Bayesian models #2151

Fixed condition_on_observations in fully Bayesian models #2151

hvarfner commented Dec 16, 2023

hvarfner commented Dec 16, 2023

facebook-github-bot commented Dec 18, 2023

sdaulton commented Dec 18, 2023

dme65 commented Dec 18, 2023 •

edited

Loading

saitcakmak left a comment

saitcakmak Dec 19, 2023

hvarfner Dec 19, 2023

saitcakmak Dec 19, 2023

hvarfner commented Dec 20, 2023

saitcakmak commented Dec 21, 2023

hvarfner commented Dec 22, 2023

saitcakmak commented Dec 24, 2023

hvarfner commented Dec 27, 2023

facebook-github-bot commented Dec 27, 2023

saitcakmak left a comment

facebook-github-bot commented Dec 27, 2023

Fixed condition_on_observations in fully Bayesian models #2151

Fixed condition_on_observations in fully Bayesian models #2151

Conversation

hvarfner commented Dec 16, 2023

Motivation

Have you read the Contributing Guidelines on pull requests?

Test Plan

hvarfner commented Dec 16, 2023

facebook-github-bot commented Dec 18, 2023

sdaulton commented Dec 18, 2023

dme65 commented Dec 18, 2023 • edited Loading

saitcakmak left a comment

Choose a reason for hiding this comment

saitcakmak Dec 19, 2023

Choose a reason for hiding this comment

hvarfner Dec 19, 2023

Choose a reason for hiding this comment

saitcakmak Dec 19, 2023

Choose a reason for hiding this comment

hvarfner commented Dec 20, 2023

saitcakmak commented Dec 21, 2023

hvarfner commented Dec 22, 2023

saitcakmak commented Dec 24, 2023

hvarfner commented Dec 27, 2023

facebook-github-bot commented Dec 27, 2023

saitcakmak left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Dec 27, 2023

dme65 commented Dec 18, 2023 •

edited

Loading