[spark] Fix empty projection causing Invalid metadata length for COUNT(*)/COUNT(1) by Kaixuan-Duan · Pull Request #3227 · apache/fluss

Kaixuan-Duan · 2026-04-28T13:54:13Z

Purpose

Linked issue: close #2724

When Spark pushes down an empty column projection for COUNT(*)/COUNT(1) queries, the Fluss server fails with IllegalStateException("Invalid metadata length") in FileLogProjection.project(), causing the client to retry indefinitely and the query to hang.

This PR fixes the issue from two sides:

Server side: reject empty projection early with a clear InvalidColumnProjectionException instead of crashing with an internal error.
Spark connector side: fall back to projecting the first column when Spark pushes down an empty projection, so the row count is preserved without fetching unnecessary data.

Brief change log

FileLogProjection#setCurrentProjection: add a guard that throws InvalidColumnProjectionException when selectedFieldPositions is empty.
FileLogProjectionTest: add testEmptyProjectionRejectsWithClearError to verify the server-side guard.
FlussBatch#projection / FlussMicroBatchStream#projection: when readSchema yields an empty projection, fall back to Array(0) (first column).
SparkLogTableReadTest: add COUNT(*) and COUNT(1) end-to-end tests for log tables.
SparkPrimaryKeyTableReadTest: add COUNT(*) end-to-end test for primary key tables.

Tests

./mvnw -pl fluss-common -DskipTests=false -Dtest=FileLogProjectionTest#testEmptyProjectionRejectsWithClearError test

./mvnw -pl fluss-spark/fluss-spark-ut -am install -DskipTests
./mvnw -pl fluss-spark/fluss-spark-ut -Dsuites='org.apache.fluss.spark.SparkLogTableReadTest' test
./mvnw -pl fluss-spark/fluss-spark-ut -Dsuites='org.apache.fluss.spark.SparkPrimaryKeyTableReadTest' test

API and Format

Documentation

…T(*)/COUNT(1)

Yohahaha · 2026-04-30T02:54:02Z

            SchemaGetter schemaGetter,
            ArrowCompressionInfo compressionInfo,
            int[] selectedFieldPositions) {
+        // Empty projection (selectedFieldPositions.length == 0) is currently not supported on the


It would be good to also verify this behavior and the fix in the Flink connector if needed.

luoyuxia · 2026-04-30T03:00:55Z

@Kaixuan-Duan Hi, seems is it same with #2725. cc @beryllw

Yohahaha · 2026-04-30T03:06:10Z

    }
  }

+  test("Spark Read: COUNT(*) without filter") {


What happens when there is a filter?

[spark] Fix empty projection causing Invalid metadata length for COUN…

450c493

…T(*)/COUNT(1)

Yohahaha reviewed Apr 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Fix empty projection causing Invalid metadata length for COUNT(*)/COUNT(1)#3227

[spark] Fix empty projection causing Invalid metadata length for COUNT(*)/COUNT(1)#3227
Kaixuan-Duan wants to merge 1 commit intoapache:mainfrom
Kaixuan-Duan:issue-2724-empty-projection

Kaixuan-Duan commented Apr 28, 2026 •

edited

Loading

Uh oh!

Yohahaha Apr 30, 2026 •

edited

Loading

Uh oh!

luoyuxia commented Apr 30, 2026 •

edited

Loading

Uh oh!

Yohahaha Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Kaixuan-Duan commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

Yohahaha Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

luoyuxia commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Yohahaha Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Kaixuan-Duan commented Apr 28, 2026 •

edited

Loading

Yohahaha Apr 30, 2026 •

edited

Loading

luoyuxia commented Apr 30, 2026 •

edited

Loading