Document how to implement custom models #2474

jakee417 · 2024-08-20T04:49:24Z

Motivation

Have you read the Contributing Guidelines on pull requests?

Yes. Added a tutorial which can be used for smoke tests.

Test Plan

Probabilistic linear regression, bayesian linear regression, and ensemble linear regression all yield optimization results close to (0, 0) which is groundtruth answer.

Random Forest doesn't seem to achieve groundtruth answer, likely due to its inability to incorporate gradient information into the optimization of the acquisition function.

Related PRs

N/A

codecov · 2024-08-21T03:17:10Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.98%. Comparing base (dd6448b) to head (0087151).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2474   +/-   ##
=======================================
  Coverage   99.98%   99.98%           
=======================================
  Files         191      191           
  Lines       16856    16864    +8     
=======================================
+ Hits        16854    16862    +8     
  Misses          2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

saitcakmak

This is a great tutorial, thank you very much for the contribution! I just have minor point about the sampler registration. Once it is ready, you can import the PR to fbcode. Landing it in fbcode will sync the changes to GitHub and close the PR.

tutorials/custom_model.ipynb

Balandat

This is an awesome tutorial, thanks a lot for the contribution!

Some additional linear algebraic comments:

        # Inverse of the gram matrix.
        self.V = torch.linalg.inv(x.T @ x)

It's generally inadvisable to compute the inverse of a matrix. Rather than doing that, you'd typically compute a matrix decomposition of x.T @ x and use that for solves down the line using forward-backward substitutions. Could you modify the code to that end? Happy to provide more specifics / code changes if that would be useful.

Also, some of this can get hairy in 32bit precision if the gram matrix is ill-conditioned. In general we recommend folks use botorch with 64bit precision (except when running on larger data sets on a GPU where perf really counts). Could you modify the tutorial to use torch.float64? Either by choosing the dtype explicitly or by setting the default torch dtype at the beginning of the tutorial.

jakee417 · 2024-08-21T17:30:47Z

This is an awesome tutorial, thanks a lot for the contribution!

Some additional linear algebraic comments:
        # Inverse of the gram matrix.
        self.V = torch.linalg.inv(x.T @ x)
It's generally inadvisable to compute the inverse of a matrix. Rather than doing that, you'd typically compute a matrix decomposition of x.T @ x and use that for solves down the line using forward-backward substitutions. Could you modify the code to that end? Happy to provide more specifics / code changes if that would be useful.

Also, some of this can get hairy in 32bit precision if the gram matrix is ill-conditioned. In general we recommend folks use botorch with 64bit precision (except when running on larger data sets on a GPU where perf really counts). Could you modify the tutorial to use torch.float64? Either by choosing the dtype explicitly or by setting the default torch dtype at the beginning of the tutorial.

torch.Double seems to improve the results, I see why this is recommended now. Switched the torch.linalg.inv out for torch.linalg.cholesky followed by torch.cholesky_inverse. I am not sure if torch.cholesky_inverse does forwards-backwards under the hood, but it gives the same result as torch.linalg.inv.

Balandat · 2024-08-21T17:44:32Z

torch.Double seems to improve the results, I see why this is recommended now.

Great

Switched the torch.linalg.inv out for torch.linalg.cholesky followed by torch.cholesky_inverse. I am not sure if torch.cholesky_inverse does forwards-backwards under the hood, but it gives the same result as torch.linalg.inv.

torch.cholesky_inverse still computes the inverse explicitly. It's better than torch.linalg.inv since you're using the knowledge that the matrix is pd, but instead of computing the inverse explicitly and then doing matmuls you should use torch.cholesky_solve (which does forward-backward substitution). So if you have A x = b you want to do x = torch.cholesky_solve(b, torch.linalg.cholesky(A)) instead of x = torch.cholesky_inverse(torch.linalg.cholesky(A)) @ x. You can assign self.L = torch.linalg.cholesky(A) instead of self.V = torch.cholesky_inverse(L) and use that both to compute the Least squares estimate as well as in the posterior call (swapping out self.V @ X.squeeze().T for the respective cholesky_solve() call).

jakee417 · 2024-08-21T18:43:18Z

okay, now I got it. Replaced the V @ x with torch.cholesky_solve(b, L) in the appropriate places.

facebook-github-bot · 2024-08-21T18:55:05Z

@jakee417 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

tutorials/custom_model.ipynb

esantorella · 2024-08-21T19:12:33Z

website/tutorials.json

@@ -153,4 +157,4 @@
      "title": "Composite Bayesian Optimization with Multi-Task Gaussian Processes"
    }
  ]
-}
+}


What happened here, a whitespace change?

esantorella

Awesome, this is a great tutorial and I may refer to it myself in the future. I left a couple minor suggestions, but this looks pretty good to me as-is.

facebook-github-bot · 2024-08-21T23:44:40Z

@jakee417 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-08-22T00:32:00Z

@jakee417 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-08-22T01:44:16Z

@jakee417 merged this pull request in 81f2a88.

jakee417 added 7 commits August 18, 2024 22:47

Create custom_model.ipynb

d050dcf

Update custom_model.ipynb

4bfa7e4

Update custom_model.ipynb

4170da5

Update custom_model.ipynb

270fadf

Merge remote-tracking branch 'upstream/main'

cd19be9

Update custom_model.ipynb

615cea6

Update custom_model.ipynb

a142fd3

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Aug 20, 2024

jakee417 added 2 commits August 20, 2024 22:34

Update tutorial for the website

78f115e

Merge branch 'main' into main

45359fe

saitcakmak approved these changes Aug 21, 2024

View reviewed changes

tutorials/custom_model.ipynb Outdated Show resolved Hide resolved

tutorials/custom_model.ipynb Outdated Show resolved Hide resolved

saitcakmak linked an issue Aug 21, 2024 that may be closed by this pull request

Document how to implement custom models #2306

Open

Balandat reviewed Aug 21, 2024

View reviewed changes

jakee417 added 2 commits August 21, 2024 10:26

Update custom_model.ipynb

b5b794f

Merge branch 'main' of https://github.com/jakee417/botorch

5f34f2a

use cholesky_solve

0087151