[Cpp API Compatibility] Fix `CUDAContext.h` to align with Pytorch by youge325 · Pull Request #78584 · PaddlePaddle/Paddle

youge325 · 2026-04-04T05:56:52Z

PR Category

Execute Infrastructure

PR Types

Bug fixes

Description

修复 #include <ATen/cuda/CUDAContext.h> 时没有 #include <c10/cuda/CUDAStream.h> 的错误，用于 DeepEP 的编译

对齐 getStreamFromPool 函数，详细兼容架构文档已在 PFCCLab/PaddleCppAPITest#59 中更新

是否引起精度变化

否

paddle-bot · 2026-04-04T05:56:59Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copilot

Pull request overview

This PR updates the ATen CUDA compat header(s) to better match PyTorch’s include surface so that #include <ATen/cuda/CUDAContext.h> also brings in the CUDA stream definitions needed by downstream builds (e.g., DeepEP).

Changes:

Add missing transitive includes (ATen/cuda/Exceptions.h, c10/cuda/CUDAStream.h) to ATen/cuda/CUDAContext.h.
Update CUDAContext.cpp to include ATen/cuda/CUDAContext.h instead of CUDAContextLight.h.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

File	Description
paddle/phi/api/include/compat/ATen/cuda/CUDAContext.h	Adds missing includes for CUDA stream / exceptions, but changes conditional include behavior.
paddle/phi/api/include/compat/ATen/cuda/CUDAContext.cpp	Switches to including the main CUDAContext header.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-04T06:00:57Z

paddle/phi/api/include/compat/ATen/cuda/CUDAContext.h

 #include <ATen/cuda/CUDAContextLight.h>
-#endif
+
+#include <ATen/cuda/Exceptions.h>
+#include <c10/cuda/CUDAStream.h>


CUDAContext.h used to only include CUDAContextLight.h when PADDLE_WITH_CUDA/PADDLE_WITH_HIP was enabled. After this change the header unconditionally includes CUDAContextLight.h and c10/cuda/CUDAStream.h, which pulls in CUDA-only APIs (e.g., cudaError_t / cudaStream_t) and will break compilation in non-CUDA/HIP builds that still include this header transitively. Please restore a build-flag guard around the CUDA-specific includes (or provide CPU stubs matching the previous behavior) while still adding c10/cuda/CUDAStream.h for CUDA/HIP builds.

Please restore a build-flag guard around the CUDA-specific includes (or provide CPU stubs matching the previous behavior) while still adding c10/cuda/CUDAStream.h for CUDA/HIP builds.

已看到最新提交在 ATen/cuda/CUDAContext.h 恢复了 PADDLE_WITH_CUDA/HIP 外层 guard，这个兼容性问题已经修复。我这边再等当前这轮 CI 跑完做最终确认。

ShigureNyako

这次 PR 的主目标我理解是：让 #include <ATen/cuda/CUDAContext.h> 的 include surface 更接近 PyTorch，同时补齐 c10::cuda::CUDAStream 相关实现，解决 DeepEP 侧的编译接入问题。

我这边先卡一个兼容性问题：当前改动把 ATen/cuda/CUDAContext.h 改成了无条件包含 c10/cuda/CUDAStream.h，但 c10/cuda/CUDAStream.h 在非 CUDA/HIP 构建下并没有完整兜住 cudaStream_t 等类型暴露，CPU-only/非 GPU 构建存在头文件编译回退风险。这个问题建议先修掉，再继续看后续是否还有需要补的地方。

CI 方面我看到目前还有若干 job 在运行；Linux-IXUCA 已失败，日志看起来是下游补丁未能 apply，未必是这组改动直接引起，但当前也还不是全绿状态。

ShigureNyako · 2026-04-04T10:28:06Z

paddle/phi/api/include/compat/c10/cuda/CUDAStream.h

+#if defined(PADDLE_WITH_CUDA)
+#include <cuda_runtime_api.h>
+#elif defined(PADDLE_WITH_HIP)
+#include <hip/hip_runtime.h>
+#endif
+
 #include <ostream>
-#include <tuple>

-#include "paddle/phi/api/include/context_pool.h"
 #include "paddle/phi/backends/gpu/gpu_info.h"
 #include "paddle/phi/common/place.h"


这里有一个兼容性回退：这一版把 paddle/phi/core/cuda_stream.h 去掉后，cudaStream_t 只会在 PADDLE_WITH_CUDA/HIP 下通过 runtime header 引入；但同一个头里 CUDAStream 类、make_cuda_stream、getStreamFromExternal 等声明仍然会在所有构建里直接暴露 cudaStream_t。再叠加 ATen/cuda/CUDAContext.h 现在无条件 #include <c10/cuda/CUDAStream.h>，非 CUDA/HIP 构建一旦传递包含到这里，就会遇到未声明类型的编译错误。

建议至少二选一处理：

恢复 ATen/cuda/CUDAContext.h 的外层 build flag guard；

或者保留旧实现那种可在非 CUDA/HIP 下成立的前置声明 / stub，使 c10/cuda/CUDAStream.h 本身仍然是“可被安全包含”的。

youge325 · 2026-04-04T13:07:32Z

/re-run all-failed

ShigureNyako

我重点复核了上一轮卡住的 blocker：ATen/cuda/CUDAContext.h 现在已经恢复 PADDLE_WITH_CUDA/HIP 外层 guard，非 CUDA/HIP 构建不会再被动包含 c10/cuda/CUDAStream.h，之前提到的头文件可包含性回退已经解除。

另外我继续看了这轮后续补丁：CUDAStream 相关接口形状仍然是在向 PyTorch 对齐，当前没有再看到新的 BREAKING include 风险或“半迁移”状态；最新的 fix dcu again 主要是在 CUDAException.h 里把 runtime API 依赖显式补齐，属于构建兼容性收尾。CI 方面，Linux-DCU / Build 已恢复通过；Linux-IXUCA 仍然失败，但结合前面日志更像下游 patch/apply 或环境侧问题，不像这组代码本身引入的新 blocker。其余还有少量 job 在跑，不过按当前代码状态我这边可以通过。

LGTMeow
LGTMeow 🎰+🐾

codecov-commenter · 2026-04-04T17:00:58Z

Codecov Report

❌ Patch coverage is 94.44444% with 4 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (develop@2202902). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
...dle/phi/api/include/compat/c10/cuda/CUDAStream.cpp	94.44%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             develop   #78584   +/-   ##
==========================================
  Coverage           ?   94.44%           
==========================================
  Files              ?        1           
  Lines              ?       72           
  Branches           ?        0           
==========================================
  Hits               ?       68           
  Misses             ?        4           
  Partials           ?        0

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

[Cpp API Compatibility] Fix CUDAContext.h to align with Pytorch

043933d

Copilot AI review requested due to automatic review settings April 4, 2026 05:56

paddle-bot bot added the contributor External developers label Apr 4, 2026

Copilot started reviewing on behalf of youge325 April 4, 2026 05:57 View session

youge325 mentioned this pull request Apr 4, 2026

Use clean at::cuda::CUDAStream but not wrapped with std::optional PFCCLab/DeepEP#11

Open

Copilot AI reviewed Apr 4, 2026

View reviewed changes

fix

a68b7d4

youge325 force-pushed the cRefactor branch from c65ab5f to a68b7d4 Compare April 4, 2026 08:15

youge325 added 2 commits April 4, 2026 16:33

simplify getStreamFromPool

ffd2b86

implement getStreamFromPool and getStreamFromExternal in CUDAStream.cpp

6d71323

ShigureNyako suggested changes Apr 4, 2026

View reviewed changes

youge325 added 2 commits April 4, 2026 19:30

fix

dafd7e8

try to fix dcu

2e6b1b7

fix dcu again

ecb94cf

ShigureNyako approved these changes Apr 4, 2026

View reviewed changes

SigureMo approved these changes Apr 4, 2026

View reviewed changes

SigureMo merged commit 6533c69 into PaddlePaddle:develop Apr 4, 2026
84 of 85 checks passed

ShigureNyako mentioned this pull request Apr 4, 2026

[Cpp API Compatibility] Normalize adapted-from-PyTorch declarations #78590

Merged

youge325 deleted the cRefactor branch April 5, 2026 00:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cpp API Compatibility] Fix `CUDAContext.h` to align with Pytorch#78584

[Cpp API Compatibility] Fix `CUDAContext.h` to align with Pytorch#78584
SigureMo merged 7 commits intoPaddlePaddle:developfrom
youge325:cRefactor

youge325 commented Apr 4, 2026 •

edited

Loading

Uh oh!

paddle-bot bot commented Apr 4, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 4, 2026

Uh oh!

ShigureNyako Apr 4, 2026

Uh oh!

ShigureNyako left a comment

Uh oh!

ShigureNyako Apr 4, 2026

Uh oh!

youge325 commented Apr 4, 2026

Uh oh!

ShigureNyako left a comment

Uh oh!

codecov-commenter commented Apr 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

youge325 commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

是否引起精度变化

Uh oh!

paddle-bot bot commented Apr 4, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

ShigureNyako Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

ShigureNyako left a comment

Choose a reason for hiding this comment

Uh oh!

ShigureNyako Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

youge325 commented Apr 4, 2026

Uh oh!

ShigureNyako left a comment

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Apr 4, 2026

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

youge325 commented Apr 4, 2026 •

edited

Loading