Skip to content

PEP: Redesign multi topic realtime ingestion support #18739

@rseetham

Description

@rseetham

Multitopic realtime ingestion as it currently exists has some issues. In this PR, @Jackie-Jiang said

The multi-topic partition id overloading existing partition id has caused so many issues:

We are missing checks here and there
Stream with custom partition id fails (See #17260)
I think it is the time to revisit this design decision and avoid the troubles going forward. I can see 2 possible solutions:

Make partition_id part within the table name x_y where x is the topic id and y is the partition id. For single topic name, it remains the same
Leverage logical table for multiple topics, each topic a separate physical table

This is the PEP request to make those changes. I've detailed both suggestions in this doc:
https://docs.google.com/document/d/1dY6ZvaG_-FbTAQ5e1lxB4uvi8VGbR_WfCfCl5zD76A4/

Metadata

Metadata

Assignees

No one assigned

    Labels

    PEP-RequestPinot Enhancement Proposal request to be reviewed.real-timeRelated to realtime table ingestion and serving

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions