Reduce the diff with the pytrch:main branch by gouzil · Pull Request #3 · PFCCLab/paddlecodec

gouzil · 2026-04-04T16:50:16Z

减少与 Pytorch mian 分支的 diff

可以通过 meta-pytorch/torchcodec@main...PFCCLab:paddlecodec:paddle 查看现阶段主分支与 Pytorch main diff
可以通过 meta-pytorch/torchcodec@main...gouzil:paddlecodec:feat/reduce_diff 查看当前分支 Pytorch main diff

Pull request overview

This PR aims to reduce divergence from the upstream PyTorch torchcodec main branch by aligning several C++ core behaviors and operator registrations.

Changes:

Implemented previously stubbed/unimplemented paths for audio resampler flush (maybeFlushSwrBuffers) and tensor-backed AVIO output (AVIOToTensorContext::getOutputTensor).
Adjusted custom op registration blocks (including adding a BackendSelect registration block) and minor include/formatting alignment.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

File	Description
src/torchcodec/_core/SingleStreamDecoder.cpp	Implements returning flushed resampled audio samples instead of throwing.
src/torchcodec/_core/custom_ops.cpp	Reorders includes and changes `TORCH_LIBRARY_IMPL` registrations (adds `BackendSelect`).
src/torchcodec/_core/AVIOTensorContext.cpp	Implements returning the encoded output tensor slice instead of throwing.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-04T16:53:47Z

src/torchcodec/_core/SingleStreamDecoder.cpp

@@ -1419,11 +1419,8 @@ std::optional<torch::Tensor> SingleStreamDecoder::maybeFlushSwrBuffers() {
  auto actualNumRemainingSamples = swr_convert(
      swrContext_.get(), outputBuffers.data(), numRemainingSamples, nullptr, 0);


swr_convert() can return a negative value on error. Passing that directly as the length to lastSamples.narrow(...) will throw an unrelated/cryptic error (or potentially create an invalid slice). Add an explicit check for actualNumRemainingSamples < 0 and surface a clear failure (e.g., TORCH_CHECK(actualNumRemainingSamples >= 0, ...)) before narrowing; optionally also handle the == 0 case explicitly.

Suggested change

swrContext_.get(), outputBuffers.data(), numRemainingSamples, nullptr, 0);

swrContext_.get(), outputBuffers.data(), numRemainingSamples, nullptr, 0);

TORCH_CHECK(

actualNumRemainingSamples >= 0,

"swr_convert() failed while flushing buffered audio samples: ",

actualNumRemainingSamples);

if (actualNumRemainingSamples == 0) {

return std::nullopt;

}

Copilot · 2026-04-04T16:53:47Z

src/torchcodec/_core/custom_ops.cpp

+TORCH_LIBRARY_IMPL(torchcodec_ns, BackendSelect, m) {
  m.impl("create_from_file", &create_from_file);
  m.impl("create_from_tensor", &create_from_tensor);
  m.impl("_create_from_file_like", &_create_from_file_like);


Registering create_from_tensor under BackendSelect is risky because it has a Tensor argument and the implementation assumes host-accessible memory (no video_tensor.is_cpu() check; it is passed into AVIOFromTensorContext which reads via data_ptr()/memcpy). With this dispatch key change, CUDA tensors may no longer fail dispatch and could reach this CPU-only code path. Consider registering create_from_tensor under CPU (and only keeping no-Tensor factory ops like create_from_file / _get_json_ffmpeg_library_versions under BackendSelect), or add a strict CPU device check for video_tensor if BackendSelect is intentional.

gouzil added 2 commits March 29, 2026 16:42

reduce narrow, see: PaddlePaddle/Paddle#77544

3a6ca0c

feat: support BackendSelect

22b2c3e

Copilot AI review requested due to automatic review settings April 4, 2026 16:50

Copilot started reviewing on behalf of gouzil April 4, 2026 16:50 View session

gouzil mentioned this pull request Apr 4, 2026

【Hackathon 10th Spring No.2】Add schema parser and related functionality for Torch compatibility PaddlePaddle/Paddle#77938

Open

Copilot AI reviewed Apr 4, 2026

View reviewed changes

gouzil mentioned this pull request Apr 4, 2026

[common] Add support for std::optional in error messages PaddlePaddle/Paddle#78521

Merged

fix: enable audio frame decoding check in getFramesPlayedInRangeAudio

c348bb8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the diff with the pytrch:main branch#3

Reduce the diff with the pytrch:main branch#3
gouzil wants to merge 3 commits intoPFCCLab:paddlefrom
gouzil:feat/reduce_diff

gouzil commented Apr 4, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 4, 2026

Uh oh!

Copilot AI Apr 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -1419,11 +1419,8 @@ std::optional<torch::Tensor> SingleStreamDecoder::maybeFlushSwrBuffers() {
		auto actualNumRemainingSamples = swr_convert(
		swrContext_.get(), outputBuffers.data(), numRemainingSamples, nullptr, 0);

-      swrContext_.get(), outputBuffers.data(), numRemainingSamples, nullptr, 0);
+      swrContext_.get(), outputBuffers.data(), numRemainingSamples, nullptr, 0);
+  TORCH_CHECK(
+      actualNumRemainingSamples >= 0,
+      "swr_convert() failed while flushing buffered audio samples: ",
+      actualNumRemainingSamples);
+  if (actualNumRemainingSamples == 0) {
+    return std::nullopt;
+  }

Conversation

gouzil commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

相关链接

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gouzil commented Apr 4, 2026 •

edited

Loading