Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
55f26f2
Switched to new GPU Codegen.
philip-paul-mueller May 6, 2026
574ed87
Was this the error.
philip-paul-mueller May 6, 2026
7a96e6b
This should be the thing.
philip-paul-mueller May 6, 2026
48a8f1c
Let's try this fix.
philip-paul-mueller May 7, 2026
6d52e24
Let's try this fix.
philip-paul-mueller May 7, 2026
246cc24
Merge remote-tracking branch 'origin/main' into dace_new_gpu_codegen
philip-paul-mueller May 7, 2026
cc69d9a
Updated DaCe dependency.
philip-paul-mueller May 11, 2026
1d69a81
Merge remote-tracking branch 'gt4py/main' into dace_new_gpu_codegen
philip-paul-mueller May 26, 2026
038d47a
Updated the DaCe version.
philip-paul-mueller May 26, 2026
815006e
Updated the DaCe version again.
philip-paul-mueller May 26, 2026
446e5cf
This should handle the issue.
philip-paul-mueller May 27, 2026
21e39f7
Not always a transient.
philip-paul-mueller May 27, 2026
e599d19
Realized that I have not yet used the new GPU code generator.
philip-paul-mueller May 27, 2026
597d4dc
Updated DaCe
philip-paul-mueller May 27, 2026
9d3138d
Fixed some double references in subsets.
philip-paul-mueller May 27, 2026
3261016
Fixed some more double references in subsets.
edopao May 27, 2026
2cf71af
Merge branch 'main' into dace_new_gpu_codegen
edopao May 27, 2026
ad06ab1
Merge branch 'main' into dace_new_gpu_codegen
edopao May 27, 2026
670d254
Removed dublicate Memlet.
philip-paul-mueller May 28, 2026
b4357bf
Merge branch 'main' into dace_new_gpu_codegen
edopao May 28, 2026
77d999d
Merge branch 'main' into dace_new_gpu_codegen
edopao May 28, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 4 additions & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -100,7 +100,7 @@ dependencies = [
'click>=8.0.0',
'cmake>=3.22',
'cytoolz>=1.0.1',
'dace>=2.0.0a3',
'dace==2.3.7',
'deepdiff>=8.1.0',
'devtools>=0.6',
'factory-boy>=3.3.3',
Expand Down Expand Up @@ -476,6 +476,9 @@ url = 'https://gridtools.github.io/pypi/'
# dace = {index = "gridtools"}
[tool.uv.sources]
atlas4py = {index = "test.pypi"}
dace = [
{git = "https://github.com/philip-paul-mueller/dace", branch = "phimuell__new-gpu-codegen-dev"}
]

# -- versioningit --
[tool.versioningit]
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -109,6 +109,9 @@ def set_dace_config(
# This setting allows to throw an exception if any implicit Copy-Map slips thorugh.
dace.Config.set("compiler.cuda.allow_implicit_memlet_to_map", value=False)

# Use the new GPU code generator
dace.Config.set("compiler.cuda.implementation", value="experimental")

# In some stencils, for example `apply_diffusion_to_w`, the cuda codegen messes
# up with the cuda streams, i.e. it allocates N streams but uses N+1. The first
# idea was to use just one stream. However, even in that case the generator
Expand Down
7 changes: 3 additions & 4 deletions uv.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.