[server] When applying projection pushdown, return empty records instead of skipping them to ensure offset movement in the client. #2370

loserwang1024 · 2026-01-14T02:53:49Z

Purpose

Linked issue: the detailed analysis is in #2369.

Brief change log

Not filter empty log in projection.

Tests

org.apache.fluss.client.table.FlussTableITCase#testFirstRowMergeEngine

API and Format

Documentation

…ead of skipping them to ensure offset movement in the client.

wuchong

@loserwang1024 , I left some comments to optimize the test.

wuchong · 2026-01-20T08:59:57Z

fluss-server/src/test/java/org/apache/fluss/server/kv/KvTabletTest.java

                DEFAULT_COMPRESSION);
    }

+    private MemoryLogRecords logRecords(


not used, remove

wuchong · 2026-01-20T09:23:06Z

fluss-client/src/test/java/org/apache/fluss/client/table/FlussTableITCase.java

+        //  In this way, if skip empty batch, the read will in stuck forever.
+        conf.set(ConfigOptions.REMOTE_LOG_TASK_INTERVAL_DURATION, Duration.ZERO);
+        conf.set(
+                ConfigOptions.LOG_SEGMENT_FILE_SIZE,
+                new MemorySize(5 * V0_RECORD_BATCH_HEADER_SIZE));
+        conf.set(
+                ConfigOptions.CLIENT_SCANNER_LOG_FETCH_MAX_BYTES_FOR_BUCKET,
+                new MemorySize(5 * V0_RECORD_BATCH_HEADER_SIZE));
+        final FlussClusterExtension flussClusterExtension =
+                FlussClusterExtension.builder()
+                        .setNumOfTabletServers(3)
+                        .setClusterConf(conf)
+                        .build();
+        flussClusterExtension.start();


In my local environment, the original test takes about 15 seconds, but it increases to 55 seconds after applying this PR. I think we should optimize it. Here are several optimization opportunities:

The test starts a new Fluss cluster in addition to the existing one from the test base class, which adds significant overhead.

The issue only occurs when projection pushdown is enabled (doProjection = true), so there’s no need to test doProjection = false.

The table bucket count can be reduced to 1, as it’s sufficient for reproducing the problem.

The number of empty record batches can be reduced from 10 to 2, this should still reliably reproduce the issue while avoiding 10 unnecessary RPC round-trips that dominate the runtime.

I suggest introducing a dedicated test class like CustomFlussClusterITCase (without extending ClientToServerITCaseBase) for tests that require manual cluster management, and adding a focused test method such as testProjectionPushdownWithEmptyBatches that incorporates all these optimizations.

[server] When applying projection pushdown, return empty records inst…

072b0f2

…ead of skipping them to ensure offset movement in the client.

loserwang1024 requested review from swuferhong and wuchong January 14, 2026 07:09

wuchong reviewed Jan 20, 2026

View reviewed changes

wuchong linked an issue Jan 20, 2026 that may be closed by this pull request

FirstRowMergeEngine: Read Stuck for Hours Due to Empty Log Entries in Projection Pushdown Queries #2369

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[server] When applying projection pushdown, return empty records instead of skipping them to ensure offset movement in the client. #2370

[server] When applying projection pushdown, return empty records instead of skipping them to ensure offset movement in the client. #2370

loserwang1024 commented Jan 14, 2026

Uh oh!

wuchong left a comment

Uh oh!

wuchong Jan 20, 2026

Uh oh!

wuchong Jan 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[server] When applying projection pushdown, return empty records instead of skipping them to ensure offset movement in the client. #2370

Are you sure you want to change the base?

[server] When applying projection pushdown, return empty records instead of skipping them to ensure offset movement in the client. #2370

Conversation

loserwang1024 commented Jan 14, 2026

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

wuchong left a comment

Choose a reason for hiding this comment

Uh oh!

wuchong Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

wuchong Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants