[SPARK-55483] Fix NPE in PivotFirst when pivot column is a non-atomic type with null values #54267

miland-db · 2026-02-11T11:30:49Z

What changes were proposed in this pull request?

Added a null-safe findPivotIndex lookup method that returns -1 for null keys on the TreeMap path, since null can never be a valid TreeMap key. The HashMap path (atomic types) is unchanged, as it handles null keys safely and allows null as a valid pivot value.

Why are the changes needed?

When a PIVOT query uses a non-atomic pivot column (struct, array), PivotFirst stores pivot values in a TreeMap with a comparison-based ordering. If the pivot column contains null values (e.g., from a GROUP BY null group), the TreeMap.getOrElse lookup calls compare(null, existingKey), which throws a NullPointerException.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Added unit test in DataFramePivotSuite.scala.

Was this patch authored or co-authored using generative AI tooling?

No.
Please refer to the ASF Generative Tooling Guidance for details.
-->

Init commit

f6975af

miland-db changed the title ~~Init commit~~ [SPARK-55483] Fix NPE in PivotFirst when pivot column is a non-atomic type with null values Feb 11, 2026

cloud-fan approved these changes Feb 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-55483] Fix NPE in PivotFirst when pivot column is a non-atomic type with null values #54267

[SPARK-55483] Fix NPE in PivotFirst when pivot column is a non-atomic type with null values #54267

miland-db commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-55483] Fix NPE in PivotFirst when pivot column is a non-atomic type with null values #54267

Are you sure you want to change the base?

[SPARK-55483] Fix NPE in PivotFirst when pivot column is a non-atomic type with null values #54267

Conversation

miland-db commented Feb 11, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants