inducer
diff --git a/‎cleared-demos/optimization/Convergence of Steepest Descent.ipynb‎
Lines changed: 247 additions & 0 deletions b/‎cleared-demos/optimization/Convergence of Steepest Descent.ipynb‎
Lines changed: 247 additions & 0 deletions
@@ -0,0 +1,247 @@
+{
+  "cells": [
+    {
+      "cell_type": "markdown",
+      "id": "e6e37e33-e07a-47ac-9cc7-62ba940f7bf3",
+      "metadata": {},
+      "source": [
+        "# Convergence of Steepest Descent\n",
+        "\n",
+        "Copyright (C) 2026 Andreas Kloeckner\n",
+        "\n",
+        "<details>\n",
+        "<summary>MIT License</summary>\n",
+        "Permission is hereby granted, free of charge, to any person obtaining a copy\n",
+        "of this software and associated documentation files (the \"Software\"), to deal\n",
+        "in the Software without restriction, including without limitation the rights\n",
+        "to use, copy, modify, merge, publish, distribute, sublicense, and/or sell\n",
+        "copies of the Software, and to permit persons to whom the Software is\n",
+        "furnished to do so, subject to the following conditions:\n",
+        "\n",
+        "The above copyright notice and this permission notice shall be included in\n",
+        "all copies or substantial portions of the Software.\n",
+        "\n",
+        "THE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\n",
+        "IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\n",
+        "FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\n",
+        "AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\n",
+        "LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\n",
+        "OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN\n",
+        "THE SOFTWARE.\n",
+        "</details>"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 1,
+      "id": "43ee5b85-2fab-4e97-9747-3723d3911082",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "import sympy as sp"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 115,
+      "id": "a90e5f57-b44e-4f49-8432-1a280cebf422",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "lam1, lam2 = sp.symbols(\"lambda_1, lambda_2\", positive=True)\n",
+        "u, v = sp.symbols(\"u,v\", positive=True)\n",
+        "\n",
+        "def grad(expr, vec):\n",
+        "    return sp.Matrix([expr.diff(vec[0]), expr.diff(vec[1])]) "
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 116,
+      "id": "31f507ac-2875-4982-b5de-f9e5b199b4df",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "A = sp.Matrix([[lam1, 0], [0, lam2]])\n",
+        "A"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 117,
+      "id": "1d93038c-90ea-4fe9-8b74-16a8e1f0cb9d",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "x = sp.Matrix([u,v])\n",
+        "x"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 118,
+      "id": "79115a9c-d9d0-4def-97c1-f4d6e939535e",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "objective = (sp.Rational(1, 2) * x.T @ A @ x)[0]\n",
+        "objective"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "1bb41284-6ebe-4a14-a3e9-5350a2d61a30",
+      "metadata": {},
+      "source": [
+        "**Question:** What is the minimizer we're looking for?"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 119,
+      "id": "c7636016-95b6-4822-b8c4-467272741cd3",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "grad_objective = grad(objective, x)\n",
+        "grad_objective"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "fb0a76a9-9f64-4ec5-bd10-900a4a484b81",
+      "metadata": {},
+      "source": [
+        "**Question:** What does this coincide with? Can you prove it?\n",
+        "\n",
+        "Set up the line search as `line` and `line_objective`:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 120,
+      "id": "14b51b3d-3d97-4971-b076-ba3e0d7ffae1",
+      "metadata": {},
+      "outputs": [],
+      "source": []
+    },
+    {
+      "cell_type": "markdown",
+      "id": "1e3b7b6c-bd0b-40af-8193-9046c3bbd855",
+      "metadata": {},
+      "source": [
+        "And find the optimal $\\alpha$:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 121,
+      "id": "f349e3a5-9283-4673-88f7-b57b73830f42",
+      "metadata": {},
+      "outputs": [],
+      "source": []
+    },
+    {
+      "cell_type": "markdown",
+      "id": "815c073b-b571-47d6-b313-2fa671d436f0",
+      "metadata": {},
+      "source": [
+        "**Question:** What is this in general? Can you prove it?\n",
+        "\n",
+        "Next, find the next iterate:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 122,
+      "id": "9415b4f1-265a-47d0-b2ad-b3f587a2f71b",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "x_next = line.subs(alpha, alpha_opt)\n",
+        "x_next"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "d8c2cb68-894e-4ae6-a745-dc42bd9512e1",
+      "metadata": {},
+      "source": [
+        "Next, consider the decrease in energy error:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 140,
+      "id": "ef221751-72eb-49a5-a59a-7fd6888bf1e9",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "def energy_error_squared(vec):\n",
+        "    return (vec.T @ A @ vec)[0]\n",
+        "\n",
+        "ratio = sp.factor(energy_error_squared(x_next)/energy_error_squared(x))\n",
+        "ratio"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "id": "8074511f-b3aa-496b-8965-e461421291a7",
+      "metadata": {},
+      "source": [
+        "Take gradient, find critical point:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 145,
+      "id": "2bf92061-339d-4620-a19a-30b4a4e4e488",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "bad_soln, = sp.solve(grad(ratio, x), x)\n",
+        "x_bad = sp.Matrix(bad_soln)\n",
+        "x_bad"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": 147,
+      "id": "c803a626-7c11-49c7-9cce-8e9117c096be",
+      "metadata": {},
+      "outputs": [],
+      "source": [
+        "sp.factor(ratio.subs(u, x_bad[0]).subs(v, x_bad[1]))"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "id": "196f51cf-8e3d-4a38-bde5-fe6ab81e8b98",
+      "metadata": {},
+      "outputs": [],
+      "source": []
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3 (ipykernel)",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbconvert_exporter": "python",
+      "pygments_lexer": "ipython3",
+      "version": "3.14.3"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 5
+}