[WIP][CORE] Pass Spark task attempt id and pool name from Java to native runtime/memory-manager by taiyang-li · Pull Request #12376 · apache/gluten

taiyang-li · 2026-06-26T04:02:16Z

What changes were proposed in this pull request?

This patch extends the JNI bridge between the Java/Scala layer and the native C++ layer to propagate Spark task-level identification information:

RuntimeJniWrapper.createRuntime: Added long taskAttemptId parameter
NativeMemoryManagerJniWrapper.create: Added String name (pool name) parameter
NativeMemoryManagerJniWrapper.hold: Added String name and long taskAttemptId parameters
NativeMemoryManagerJniWrapper.release: Added long taskAttemptId parameter

On the native C++ side:

Runtime gains a taskAttemptId_ member with getter/setter
MemoryManager gains a name_ member with getter/setter

The values are set via setters after object creation, so no changes are needed for existing backend subclasses (Velox, ClickHouse). The Factory typedefs and constructor signatures remain unchanged.

This makes per-task diagnostics and per-pool logging possible, and unblocks future backends (e.g. Bolt) that need task-level identification at the native layer.

How was this patch tested?

Static verification: All JNI method signatures match between Java and C++ (parameter count and types verified).
No changes to Velox/CH backend code - all existing subclasses compile unchanged.
The patch is purely additive (new members + getters/setters), with no behavioral changes for existing backends.
Full build verification is pending; this is a minimal infrastructure patch.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude claude-sonnet-4-6

…native runtime/memory-manager This patch extends RuntimeJniWrapper.createRuntime and the NativeMemoryManagerJniWrapper create/hold/release JNI methods with extra parameters that propagate the Spark task attempt id (and a memory-pool name) from the JVM down to the native side. The native Runtime and NativeMemoryManager just store the values and expose simple getters; no behavior change for existing Velox/CH backends. This makes per-task diagnostics and per-pool logging possible, and unblocks future backends (e.g. Bolt) that need task-level identification at the native layer. Generated-by: Claude claude-sonnet-4-6 Co-Authored-By: Aime <aime@bytedance.com> Change-Id: Iff1124ec0e8946f46dbbed096f9197ff45c9f433

github-actions Bot added the VELOX label Jun 26, 2026

taiyang-li changed the title ~~[CORE] Pass Spark task attempt id and pool name from Java to native runtime/memory-manager~~ [WIP][CORE] Pass Spark task attempt id and pool name from Java to native runtime/memory-manager Jun 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP][CORE] Pass Spark task attempt id and pool name from Java to native runtime/memory-manager#12376

[WIP][CORE] Pass Spark task attempt id and pool name from Java to native runtime/memory-manager#12376
taiyang-li wants to merge 1 commit into
apache:mainfrom
taiyang-li:extract/jni-pass-task-attempt-id

taiyang-li commented Jun 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

taiyang-li commented Jun 26, 2026

What changes were proposed in this pull request?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant