[core] introduce Placeholder for Blob File Format by steFaiz · Pull Request #7889 · apache/paimon

steFaiz · 2026-05-18T11:16:56Z

Purpose

This is the first part of #7881
Including:

Bump Blob File Format to V2, introducing a PlaceHolder Blob.
Introduce a fallbackReader for blob to skip placeholders. This is a two-level abstraction:
a. At first, all data files will be divided according to max_seq_num
b. within each group, create a sequential reader to logically concat files and fill missing gaps. For example: If the full row range of normal files is [0, 100], but some group only have one file with range [20, 80], the output is: [0, 19] -> filled with placeholders; [20, 80] -> records from files; [81, 100] -> filled with placeholders.
c. create readers for each group, and read the blob from the max group whose value is NOT a placeholder.

The mechanism can be illustrated as below:

Tests

ITCase and Unit tests

JingsongLi · 2026-05-20T09:34:41Z

+     * The placeholder blob, mainly for blob update in data-evolution. It should never be exposed to
+     * users.
+     */
+    Blob PLACE_HOLDER =


This is strange, maybe just use NULL as place holder?

Thanks for your advise! But in #7125 we supports storing nulls in blob file. I'm not clear how to distinguish placeholders and native NULLs if so.

From the semantics, NULLs are exposed to users, users know that they store some nulls. But placeholders are fully internal used, users should never be aware about them. If users set some rows as nulls, we may fallback those rows to earlier versions, this is not expected in our design.

Could you please give me some advise?

Perhaps you can consider using row number in blob to determine how to merge? You can just return valid blobs with row number.

The row number is actually the primary key.

I understand that you not only need this class for reading, but also for writing. If you skip these elements, the changes will be significant.

I thin you can just introduce a BlobPlaceHolder implements Blob, Serializable for this, use instance of is better.

Thanks! I'll modify my code!

JingsongLi

Review Comments for PR #7889

Overall this is a well-designed change with thorough tests. The sequence-group fallback mechanism for blob placeholders is a solid approach. A few observations:

Issues

1. Typo in comment (DataEvolutionSplitRead.java)

} else if (bunch instanceof BlobFileBunch) {
    // for blob funch, fallback on placeholders

→ "funch" should be "bunch"

2. Potential resource leak in BlobFallbackRecordReader.readBatch()

for (int i = 0; i < groupReaders.size(); i++) {
    RecordIterator<InternalRow> iterator = groupReaders.get(i).readBatch();
    if (iterator == null) {
        return null;  // ← iterators[0..i-1] are not released
    }
    iterators[i] = iterator;
}

If the k-th reader returns null, the already-obtained iterators [0, k-1] will never have releaseBatch() called. Should release them before returning null.

3. Memory pressure from ForceSingleBatchReader wrapping all group readers

Each sequence group is fully materialized into memory via ForceSingleBatchReader. When there are many sequence groups with large row ranges, this could cause significant memory pressure, especially when blob-as-descriptor is disabled and actual blob data is loaded. The TODO comment acknowledges this - just want to confirm this is acceptable for the first iteration.

4. Singleton placeholder row reuse in BlobSequenceGroupRecordReader

private InternalRow placeHolderRow() {
    if (placeholderRow == null) {
        GenericRow row = new GenericRow(readRowType.getFieldCount());
        row.setField(blobIndex, BlobPlaceholder.INSTANCE);
        placeholderRow = row;
    }
    return placeholderRow;
}

This returns the same mutable GenericRow instance for all placeholder positions. It works here because ForceSingleBatchReader copies the data, but it's fragile - a future caller that holds references to returned rows would see aliased data. Consider adding a comment noting this intentional reuse.

5. BlobFileBunch doesn't validate schemaId across files (by design?)

VectorFileBunch.add() enforces file.schemaId() == files.get(0).schemaId(), but BlobFileBunch.add() only checks writeCols. This seems intentional since blob files from different sequences naturally have different schemas, but worth confirming.

6. DataEvolutionFileReader contract relaxation

- checkArgument(readers != null && readers.length > 1, "Readers should be more than 1");
+ checkArgument(readers != null && readers.length >= 1, "should not pass empty readers.");

This relaxes the precondition for all callers of DataEvolutionFileReader, not just the blob path. Is there a case where a single reader is passed from non-blob paths? If not, consider keeping > 1 for non-blob scenarios or documenting when single-reader is expected.

Minor / Style

In BlobFallbackRecordReaderTest, the ReadResult.add() method silently treats placeholder rows differently (counts them vs collecting rowIds). This is fine for testing but the test would be more explicit if assertions on placeholder count were always paired with total row count assertions.
The fixedBlobBytes helper in BlobUpdateTest allocates 2 * 1024 * 124 bytes (≈248KB). Was 2 * 1024 * 1024 (2MB) intended? Or is this intentionally smaller to keep tests fast?

Positive

The separation of SpecialFieldBunch into BlobFileBunch and VectorFileBunch is a good refactoring - they have fundamentally different semantics now.
The backward-compatible version 1 reading test (testReadLegacyVersionOneBlobFile) is a nice addition.
The BlobSequenceGroupRecordReader javadoc with ASCII art is very helpful for understanding the complex layout.
Python-side changes properly mirror the Java changes with appropriate error handling for unsupported placeholder in read_arrow_batch.

steFaiz marked this pull request as draft May 18, 2026 11:17

steFaiz changed the title ~~[core] introduce Placeholder for Blob File Format~~ [wip][core] introduce Placeholder for Blob File Format May 18, 2026

steFaiz marked this pull request as ready for review May 19, 2026 06:19

steFaiz changed the title ~~[wip][core] introduce Placeholder for Blob File Format~~ [core] introduce Placeholder for Blob File Format May 19, 2026

JingsongLi reviewed May 20, 2026

View reviewed changes

steFaiz force-pushed the placeholder_blob branch from 5b22d7d to 355a05b Compare May 22, 2026 15:25

[core] introduce Placeholder for Blob File Format

b2cc640

steFaiz force-pushed the placeholder_blob branch from 355a05b to b2cc640 Compare May 22, 2026 15:34

steFaiz added 2 commits May 22, 2026 23:50

fix tests

29781d7

fix tests

5fdb965

JingsongLi reviewed May 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] introduce Placeholder for Blob File Format#7889

[core] introduce Placeholder for Blob File Format#7889
steFaiz wants to merge 3 commits into
apache:masterfrom
steFaiz:placeholder_blob

steFaiz commented May 18, 2026 •

edited

Loading

Uh oh!

JingsongLi May 20, 2026

Uh oh!

steFaiz May 20, 2026

Uh oh!

JingsongLi May 20, 2026

Uh oh!

JingsongLi May 20, 2026

Uh oh!

JingsongLi May 20, 2026

Uh oh!

steFaiz May 20, 2026

Uh oh!

JingsongLi left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

steFaiz commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Tests

Uh oh!

JingsongLi May 20, 2026

Choose a reason for hiding this comment

Uh oh!

steFaiz May 20, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi May 20, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi May 20, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi May 20, 2026

Choose a reason for hiding this comment

Uh oh!

steFaiz May 20, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi left a comment

Choose a reason for hiding this comment

Review Comments for PR #7889

Issues

Minor / Style

Positive

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

steFaiz commented May 18, 2026 •

edited

Loading