-
Notifications
You must be signed in to change notification settings - Fork 180
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HBASE-28814 Add OpenLineage reporting support for Spark connector #135
base: master
Are you sure you want to change the base?
Conversation
💔 -1 overall
This message was automatically generated. |
164f9ac
to
ba0f9b5
Compare
💔 -1 overall
This message was automatically generated. |
ba0f9b5
to
1c8b91a
Compare
🎊 +1 overall
This message was automatically generated. |
1c8b91a
to
2d3690c
Compare
🎊 +1 overall
This message was automatically generated. |
@petersomogyi @ndimiduk @NihalJain hey, as HBase Committers active in this repository, could you find the time to take a look at this PR and provide any feedback? Thanks from another Apache committer 🙂 |
This PR introduces OpenLineage support to the Spark HBase connector. The following changes and enhancements have been made:
Integration with OpenLineage: Implemented the
LineageRelationProvider
andLineageRelation
interfaces in theDefaultSource
andHBaseRelation
classes, respectively, to provide input and output dataset identifiers.Metadata Enrichment: Enhanced the connector to publish detailed lineage information, including datasets and operation facets.
Compatibility: Ensured compatibility with existing Spark jobs using the connector, allowing seamless lineage tracking without requiring significant modifications.
Key Benefits:
Please review the changes and provide feedback. Your input is valuable in ensuring the robustness and utility of this integration.