Add SphericalLinear kernel by colmont · Pull Request #2742 · cornellius-gp/gpytorch

colmont · 2026-03-30T11:34:31Z

No description provided.

gpleiss

Generally good; a few changes

gpleiss · 2026-04-10T23:18:25Z

+    stereographic projection onto a unit sphere, and :math:`(b_0, b_1)` are learned
+    mixture weights (via softmax, so :math:`b_0 + b_1 = 1`).
+
+    This kernel was proposed in `We Still Don't Understand High-Dimensional Bayesian Optimization`.


Nit: make this into a link, rather than having the arxiv link separate.

gpleiss · 2026-04-10T23:19:49Z

+        self.bounds = bounds
+
+        # Learned mixture coefficients: softmax([raw_coeffs]) -> [constant, linear]
+        self.raw_coeffs = nn.Parameter(torch.zeros(*self.batch_shape, 2))


We should have getters and setters for the non-raw values, so that they can be initialized appropriately (e.g. self.coeffs = <blah>)

gpleiss · 2026-04-10T23:19:56Z

+        self.raw_coeffs = nn.Parameter(torch.zeros(*self.batch_shape, 2))
+
+        # Global lengthscale: sigmoid(raw_glob_ls) * max_sq_norm
+        self.raw_glob_ls = nn.Parameter(torch.zeros(*self.batch_shape, 1))


Same thing here.

gpleiss · 2026-04-10T23:20:44Z

+        ard_num_dims: int | None = None,
+        lengthscale_prior: Prior | None = None,
+        lengthscale_constraint: Interval | None = None,
+        normalize_lengthscale: bool = False,


Maybe we should set it to true? And we can add a comment that it was set to False in the original paper.

gpleiss · 2026-04-10T23:21:14Z

+            return torch.ones(x1.shape[:-1], dtype=x1.dtype, device=x1.device)
+
+        if self.normalize_lengthscale:  # Enforce L2 norm = 1
+            lengthscale = torch.softmax(self.lengthscale, dim=-1).sqrt()


Perhaps it would be better if we just divided the lengthscale by its norm, so as not to distort the geometry of the lengthscale.

I tried this on a small example (SVM benchmark in high-dim BO): dividing the lengthscale by its norm is about 10x slower than using the softmax, and performance is not necessarily better. This was just a small local test with a few iterations (20-30); I have not yet double-checked this on the cluster for more problems/seeds/iterations. What do you think, worth investigating more or do you already have a strong preference on which reparameterization to go for?

gpleiss · 2026-04-10T23:24:00Z

+        projected = project_onto_unit_sphere(x)
+        self.assertEqual(projected.shape, torch.Size([10, 6]))
+        norms = projected.norm(dim=-1)
+        self.assertAllClose(norms, torch.ones(10), rtol=1e-5, atol=1e-5)


Can you add a test that inputs that are on the unit sphere get an effectively identity mapping, to ensure we did the inverse stereographic projection correctly?

gpleiss · 2026-04-10T23:24:13Z

+        kernel = SphericalLinearKernel(bounds=UNIT_BOUNDS_3D, lengthscale_prior=NormalPrior(0, 1))
+        pickle.loads(pickle.dumps(kernel))
+
+    def test_consistency_square_vs_rectangular(self):


Do we need this test?

I got rid of it

gpleiss · 2026-04-10T23:24:21Z

+    def test_pickle_with_prior(self):
+        """Kernel with prior should survive pickle round-trip."""
+        kernel = SphericalLinearKernel(bounds=UNIT_BOUNDS_3D, lengthscale_prior=NormalPrior(0, 1))
+        pickle.loads(pickle.dumps(kernel))


Do we need this test?

I got rid of it

gpleiss · 2026-04-10T23:24:28Z

+        bounds = torch.tensor([[0.0, 0.0], [1.0, 1.0]])
+        kernel = SphericalLinearKernel(bounds=bounds)
+        loaded = pickle.loads(pickle.dumps(kernel))
+        self.assertAllClose(loaded.bounds, bounds)


Do we need this test?

I got rid of it

gpleiss · 2026-04-10T23:24:39Z

+        """Should accept valid priors and reject invalid ones."""
+        SphericalLinearKernel(bounds=UNIT_BOUNDS_3D, lengthscale_prior=None)
+        SphericalLinearKernel(bounds=UNIT_BOUNDS_3D, lengthscale_prior=NormalPrior(0, 1))
+        self.assertRaises(TypeError, SphericalLinearKernel, UNIT_BOUNDS_3D, lengthscale_prior=1)


Do we need this test?

I got rid of it

colmont · 2026-04-25T15:12:10Z

I tried to address all of your comments as best as possible, and left a longer reply for the reparameterization question. When playing around with this model for the mujoco-humanoid problem in high-dim BO, my jobs instantly OOMed... even before acquiring the first point. I believe I found a bug in GPyTorch? I think when d >= n, the linear kernels should be routed to DefaultPredictionStrategy instead of LinearPredictionStrategy. I tried to fix this as best as I could in a second commit, I hope it's kinda clear!

Add SphericalLinear kernel

7e1c1fb

gpleiss requested changes Apr 10, 2026

View reviewed changes

colmont added 2 commits April 24, 2026 20:38

Address comments

575d480

Route low-rank kernels to DefaultPredictionStrategy when d >= n

b29625e

Conversation

colmont commented Mar 30, 2026

Uh oh!

gpleiss left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

colmont commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

colmont commented Apr 25, 2026 •

edited

Loading