HBASE-28814 Add OpenLineage reporting support for Spark connector#135
Open
ddebowczyk92 wants to merge 1 commit intoapache:masterfrom
Open
HBASE-28814 Add OpenLineage reporting support for Spark connector#135ddebowczyk92 wants to merge 1 commit intoapache:masterfrom
ddebowczyk92 wants to merge 1 commit intoapache:masterfrom
Conversation
|
💔 -1 overall
This message was automatically generated. |
164f9ac to
ba0f9b5
Compare
|
💔 -1 overall
This message was automatically generated. |
ba0f9b5 to
1c8b91a
Compare
|
🎊 +1 overall
This message was automatically generated. |
1c8b91a to
2d3690c
Compare
|
🎊 +1 overall
This message was automatically generated. |
|
@petersomogyi @ndimiduk @NihalJain hey, as HBase Committers active in this repository, could you find the time to take a look at this PR and provide any feedback? Thanks from another Apache committer 🙂 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR introduces OpenLineage support to the Spark HBase connector. The following changes and enhancements have been made:
Integration with OpenLineage: Implemented the
LineageRelationProviderandLineageRelationinterfaces in theDefaultSourceandHBaseRelationclasses, respectively, to provide input and output dataset identifiers.Metadata Enrichment: Enhanced the connector to publish detailed lineage information, including datasets and operation facets.
Compatibility: Ensured compatibility with existing Spark jobs using the connector, allowing seamless lineage tracking without requiring significant modifications.
Key Benefits:
Please review the changes and provide feedback. Your input is valuable in ensuring the robustness and utility of this integration.