Add Tensorization Example Applied to Battery Thermal Analysis#19

Open

jonahweiss wants to merge 1 commit intomatlab-deep-learning:mainfrom

jonahweiss:feature/tfno_example

jonahweiss commented Mar 10, 2026

The example is a live M script: tensorizedFourierNeuralOperatorForBatteryCoolingAnalysis.m demonstrating the application of the paper Multi-Grid Tensorized Fourier Neural Operator for High-Resolution PDEs to the Battery Heat Analysis example.

Once the support files containing pregenerated simulation data are live, the URL variable pregeneratedSimulationDataURL in the example will need to be set, and then the function downloadSimulationData.m may download and unzip the data from the given URL.

The tfno/ folder includes the implementation of the TFNO 3D model.

The lossFunctions/ folder includes the implementation of the relative H1 loss.

The trainingPartitions.m and createBatteryModuleGeometry.m functions are taken from the existing Battery Heat Analysis example.


          initial commit

a2682ec

bwdGitHub requested review from bwdGitHub and conordaly0

March 11, 2026 12:48

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/h1Norm.m

		@@ -0,0 +1,166 @@
		function H1 = h1Norm(X, params)

Collaborator

bwdGitHub Mar 11, 2026

How are these functions called, given they're in a sub-directory? You'd either have to change directory or addpath right?

Personally I prefer using a namespace +lossFunctions so you can call everything like lossFunctions.h1Norm from the base directory of this example. Maybe something more standard is to just put everything in the base directory of the example - that might be more or less what doc examples do when you use the openExample command.

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/h1Norm.m

+              %     X = randn(B,C,S1,S2);
+              %     H1 = h1Norm(X);
+              %
+              % Copyright 2026 The MathWorks, Inc.

Collaborator

bwdGitHub Mar 11, 2026

We tend to separate the copyright from the m-help so it isn't displayed in help(h1Norm).

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/h1Norm.m

Comment on lines +35 to +37

+              %   Input X must be a numeric array of size [B, C, S1, S2, ..., SD]
+              %   where B is batch size, C is number of channels, and S1...SD are
+              %   spatial dimensions.

Collaborator

bwdGitHub Mar 11, 2026

Why BC(S..S)? That seems more like PyTorch's layout, whereas dlarray default orders to "SSCB" when using labels.

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/h1Norm.m

+                      params.Spacings = ones(1, D);
+                  else
+                      if numel(params.Spacings) ~= D
+                          error('params.Spacings must have length equal to the number of spatial dimensions (D).');

Collaborator

bwdGitHub Mar 11, 2026

We'd probably wouldn't include params here - it's a variable name in the implementation, not something the user is aware of without looking into the implementation.

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/h1Norm.m

+                      dm = 1 + d;  % Dimension index of this spatial axis in reshaped X.
+                      % Central difference with wrap.
+                      fd = (circshift(X, -1, dm) - circshift(X, 1, dm)) / (2 * delta);

Collaborator

bwdGitHub Mar 11, 2026

Just a warning that circshift isn't a dlarray method, so the way it supports dlarray functionality like dlgradient and dlaccelerate is that we trace the dlarray-s through the circshift implementation - if that implementation happens to use only dlarray compatible methods and patterns, things should work out.

I expect you need dlgradient and dlaccelerate would be beneficial for a loss function. A couple reasons to be cautious with stuff that's not explicitly a dlarray method, but work through this "tracing" approach:

There are many codepaths underlying circshift and other functions - you'd need to verify that all of those are dlarray compatible code, or ensure that you only ever go down codepaths that are.
Since circshift isn't a dlarray method, there's no reason it couldn't be replaced in a future release by a C/C++ built-in in future which would not support dlgradient or dlaccelerate - I wouldn't expect us to have internal tests that would catch this because circshift isn't a dlarray method and we can't reasonably say that every function in MATLAB that supports dlarray through tracing should always support it in future.

bwdGitHub reviewed

View reviewed changes

...-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/permuteDimFirst.m

+                  Dim = finddim(X, dim);
+                  permuteOrder = [Dim setdiff(1:ndims(X), Dim, 'stable')];
+                  X = permute(stripdims(X), permuteOrder);
+                  X = dlarray(X, fmt);

Collaborator

bwdGitHub Mar 11, 2026

Does it matter if the format still makes sense here - e.g. x = dlarray(rand(5,4),"CB"); y = permuteDimFirst(x,"B") will re-label x-s batch dim as y-s channel dim.

I think if you need the dimensions in a particular layout, it's probably best to just work without format labels for as long as that's needed, since the dlarray label auto-permutes are always going to fight back against non-default layouts. If you still need dlarray methods when you don't have format labels, most methods that require labelled data should also have something like a DataFormat name-value pair.

bwdGitHub reviewed

View reviewed changes

...d-fourier-neural-operator-for-battery-module-cooling-analysis/lossFunctions/relativeH1Loss.m

+                      SquareRoot=params.SquareRoot, ...
+                      Periodic=params.Periodic);
+                   loss = num./(den + eps);

Collaborator

bwdGitHub Mar 11, 2026

We sometimes make this eps settable, e.g. layernorm has an Epsilon name-value pair - I suppose because eps can still be very small and num./(den+eps) is only bounded above by num*2.3e16 or something since eps is about 2.2e16.

bwdGitHub reviewed

View reviewed changes

...ized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/depthwiseConv3dLayer.m

		@@ -0,0 +1,73 @@
		classdef depthwiseConv3dLayer < nnet.layer.Layer & ...

Collaborator

bwdGitHub Mar 11, 2026

Could this be convolution3dLayer(1,numChannels) and convolution3dLayer(1,numChannels,BiasLearnRateFactor=0) when UseBias==false?

bwdGitHub reviewed

View reviewed changes

...ized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/depthwiseConv3dLayer.m

+                          assertValidNumConvolutionDimensions(3, hasTimeDimension, numSpatialDimensions);
+                          % Check the input data has a channel dimension
+                          assertInputHasChannelDim(1, cdim);

Collaborator

bwdGitHub Mar 11, 2026

assertInputHasChannelDim(3,cdim)

bwdGitHub reviewed

View reviewed changes

...ized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/depthwiseConv3dLayer.m

+                          % Same initialization as convolution2Dlayer, from
+                          % /matlab/toolbox/nnet/cnn/+nnet/+internal/+cnn/+layer/+learnable/+initializer/Normal.m
+                          layer.Weight = dlarray(randn(weightSize), layout.Format) * 0.01;

Collaborator

bwdGitHub Mar 11, 2026

This is the "narrow-normal" weight initializer, but the default for conv layers is Glorot/Xavier initialization which uses uniform random + a scale factor.

bwdGitHub reviewed

View reviewed changes

...ized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/depthwiseConv3dLayer.m

+                          % /matlab/toolbox/nnet/cnn/+nnet/+internal/+cnn/+layer/+learnable/+initializer/Normal.m
+                          layer.Weight = dlarray(randn(weightSize), layout.Format) * 0.01;
+                          if layer.UseBias
+                              layer.Bias = dlarray(zeros(weightSize), layout.Format);

Collaborator

bwdGitHub Mar 11, 2026

Most built-in layers initialize weights as single since most dlnetwork stuff is happening in single by default

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/fnoBlock3D.m

+                      layerNormalizationLayer(Name="ln1"), ...
+                      additionLayer(2, Name="add1"), ...
+                      geluLayer(Name="gelu1"), ...
+                      convolution3dLayer(1, latentChannelSize * args.MLPExpansion, Name="channelMLP1"), ...

Collaborator

bwdGitHub Mar 11, 2026

You should use something like ceil(latentChannelSize * args.MLPExpansion) or floor or round here.

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/fnoBlock3D.m

+                      net = connectLayers(net, "channelSkip", "add2/in2");
+                  else
+                      net = connectLayers(net, "in", "add2/in2");
+                  end

Collaborator

bwdGitHub Mar 11, 2026

Could LinearFNOSkip and ChannelMLPSkip be merged into a SkipConnectionMode = ["identity","linear"]? That would miss the option of using "linear" for just one of the skips, but I don't expect that's common.

bwdGitHub reviewed

View reviewed changes

...r-neural-operator-for-battery-module-cooling-analysis/tfno/iPositiveAndNegativeFrequencies.m

		@@ -0,0 +1,30 @@
		function [pos,neg] = iPositiveAndNegativeFrequencies(N)

Collaborator

bwdGitHub Mar 11, 2026

The i prefix convention is for internal functions, i.e. internal inside another function/class file.

bwdGitHub reviewed

View reviewed changes

...d-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/spatialEmbeddingLayer3D.m

@@ @@ -0,0 +1,57 @@ @@
+              classdef spatialEmbeddingLayer3D < nnet.layer.Layer & ...
+                      nnet.layer.Formattable & nnet.layer.Acceleratable %#codegen
+              %SPATIALEMBEDDINGLAYER3D - 3D spatial embedding layer.

Collaborator

bwdGitHub Mar 11, 2026

I've seen this called grid embedding elsewhere - I'd probably include that phrase somewhere.

bwdGitHub reviewed

View reviewed changes

...d-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/spatialEmbeddingLayer3D.m

+                      function layer = spatialEmbeddingLayer3D(spatialLimits, args)
+                          arguments
+                              spatialLimits (3, 2) double
+                              args.Name (1, 1) string = "depthwiseConv"

Collaborator

bwdGitHub Mar 11, 2026

The default name should be something else.

bwdGitHub reviewed

View reviewed changes

...d-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/spatialEmbeddingLayer3D.m

+                          S3 = linspace(layer.SpatialLimits(3, 1), ...
+                              layer.SpatialLimits(3, 2), sSize(3));
+                          [embedding1, embedding2, embedding3] = meshgrid(S1, S2, S3);

Collaborator

bwdGitHub Mar 11, 2026

I think all this should be done once at data preprocessing/feature engineering rather than on every iteration.

You could also compute the embedding once at initialize time and store it as a property to be returned from predict. The repmat over batch dimension would have to happen in predict though.

bwdGitHub reviewed

View reviewed changes

...ier-neural-operator-for-battery-module-cooling-analysis/tfno/tensorizedSpectralConv3dLayer.m

+              %   creates a spectral convolution 3d layer. outChannels
+              %   specifies the number of channels in the layer output.
+              %   numModes specifies the number of modes which are combined
+              %   in Fourier space for each of the 2 spatial dimensions.

Collaborator

bwdGitHub Mar 11, 2026

3 spatial dimensions

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/README.md

		@@ -0,0 +1,63 @@
		# Tensorized Fourier Neural Operator for 3D Battery Heat Analysis

		This example builds off of the [Fourier Neural Operator for 3D Battery Heat Analysis](https://github.com/matlab-deep-learning/SciML-and-Physics-Informed-Machine-Learning-Examples/tree/main/battery-module-cooling-analysis-with-fourier-neural-operator) example to apply a Tensorized Fourier Neural Operator (TFNO) [1, 2] to heat analysis of a 3D battery module. The TFNO compresses the standard Fourier Neural Operator using tensorization, achieving 14.3x parameter reduction while maintaining accuracy.

Collaborator

bwdGitHub Mar 11, 2026

Might be able to link the example with a relative path in the repo.

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/README.md

Comment on lines +5 to +6

		![](./images/prediction_vs_gt.png)
		![](./images/absolute_error.png)

Collaborator

bwdGitHub Mar 11, 2026

It's worth adding alt-text descriptions for these.

bwdGitHub reviewed

View reviewed changes

...ed-fourier-neural-operator-for-battery-module-cooling-analysis/createBatteryModuleGeometry.m

		@@ -0,0 +1,100 @@
		function [geomModule, domainIDs, boundaryIDs, volume, boundaryArea, ReferencePoint] = createBatteryModuleGeometry(numCellsInModule, cellWidth,cellThickness,tabThickness,tabWidth,cellHeight,tabHeight, connectorHeight )

Collaborator

bwdGitHub Mar 11, 2026

There might already be an open discussion or issue to handle shared helpers on this repo, so we don't have duplicate implementations of things like this. It's probably best to keep the example self contained as it is currently for now, since we haven't fixed on one solution.

bwdGitHub reviewed

View reviewed changes

tensorized-fourier-neural-operator-for-battery-module-cooling-analysis/tfno/tfno3d.m

Comment on lines +53 to +54

		convolution3dLayer(1, liftingChannels, Name="lifting1"), ...
		convolution3dLayer(1, latentChannelSize, Name="lifting2")];

Collaborator

bwdGitHub Mar 11, 2026

Should there be a nonlinearity between these 2 - it's usually a bit odd to have 2 consecutive linear layers (since you could just product the two weight matrices together and make it 1 linear layer) and I don't see "lifting1" connected to anything else that would require it to be split up like this.

bwdGitHub reviewed

View reviewed changes

...ier-neural-operator-for-battery-module-cooling-analysis/tfno/tensorizedSpectralConv3dLayer.m

+                          Xout = zeros([N1,N2,N3,this.OutputSize,size(X,5)],like=X);
+                          Xout(xFreq,yFreq,zFreq,:,:) = X;
+                          % Make Xout conjugate symmetric.

Collaborator

bwdGitHub Mar 11, 2026

We could add a bit more detail to this comment and say it's so the ifft output is real valued, and reference the Algorithms section of the ifftn doc page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet