-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HUDI-2397] Add --enable-sync
parameter
#3608
[HUDI-2397] Add --enable-sync
parameter
#3608
Conversation
@pratyakshsharma is
=> Is it okay to replace this to |
@pratyakshsharma there are quite a few requests around the multi delta streamer. Are you still interested in helping improve it and be point person for all feature contribution? please let me know |
Big yes to that :) . Please let me know if I can help for anything. |
@djKooks Thank you for raising this. Will check this today. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it okay to replace this to enableMetaSync?
Yes we can replace it with enableMetaSync here.
Small comments, rest looks good. :)
@@ -138,7 +139,7 @@ public void testMultiTableExecutionWithKafkaSource() throws IOException { | |||
testUtils.sendMessages(topicName1, Helpers.jsonifyRecords(dataGenerator.generateInsertsAsPerSchema("000", 5, HoodieTestDataGenerator.TRIP_SCHEMA))); | |||
testUtils.sendMessages(topicName2, Helpers.jsonifyRecords(dataGenerator.generateInsertsAsPerSchema("000", 10, HoodieTestDataGenerator.SHORT_TRIP_SCHEMA))); | |||
|
|||
HoodieMultiTableDeltaStreamer.Config cfg = TestHelpers.getConfig(PROPS_FILENAME_TEST_SOURCE1, dfsBasePath + "/config", JsonKafkaSource.class.getName(), false); | |||
HoodieMultiTableDeltaStreamer.Config cfg = TestHelpers.getConfig(PROPS_FILENAME_TEST_SOURCE1, dfsBasePath + "/config", JsonKafkaSource.class.getName(), false, true); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let us keep enableMetaSync as false where enableHiveSync is also false? Otherwise it might lead to confusion.
@@ -187,7 +188,7 @@ public void testMultiTableExecutionWithParquetSource() throws IOException { | |||
// add only common props. later we can add per table props | |||
String parquetPropsFile = populateCommonPropsAndWriteToFile(); | |||
|
|||
HoodieMultiTableDeltaStreamer.Config cfg = TestHelpers.getConfig(parquetPropsFile, dfsBasePath + "/config", ParquetDFSSource.class.getName(), false, | |||
HoodieMultiTableDeltaStreamer.Config cfg = TestHelpers.getConfig(parquetPropsFile, dfsBasePath + "/config", ParquetDFSSource.class.getName(), false, true, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ditto
@@ -218,7 +219,7 @@ public void testMultiTableExecutionWithParquetSource() throws IOException { | |||
|
|||
@Test | |||
public void testTableLevelProperties() throws IOException { | |||
HoodieMultiTableDeltaStreamer.Config cfg = TestHelpers.getConfig(PROPS_FILENAME_TEST_SOURCE1, dfsBasePath + "/config", TestDataSource.class.getName(), false); | |||
HoodieMultiTableDeltaStreamer.Config cfg = TestHelpers.getConfig(PROPS_FILENAME_TEST_SOURCE1, dfsBasePath + "/config", TestDataSource.class.getName(), false, true); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ditto
@pratyakshsharma thanks for comment. Updated 🙏 |
LGTM. Will merge it once CI passes. Thank you for your contribution @djKooks |
@pratyakshsharma thanks for help~please share if there's other features needs to be improved/fixed 🙏 |
Definitely! You can search for jiras with newbie label and pick them up one by one :) |
@pratyakshsharma This broke master due to a failing test. Please be careful about merging. by ensuring the CI in azure actually passes before you land. #3654 I am going to land this soon, which I think fixes this. |
Tips
What is the purpose of the pull request
HoodieMultiTableDeltaStreamer is just a wrapper on top of HoodieDeltaStreamer and
--enable-sync
parameter needs to be introduced there so that both the streamer classes are in sync.Verify this pull request
enableMetaSync
config forHoodieMultiTableDeltaStreamer
, to sync withHoodieDeltaStreamer
Committer checklist
Has a corresponding JIRA in PR title & commit
Commit message is descriptive of the change
CI is green
Necessary doc changes done or have another open PR
For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.