Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[HUDI-5409] Avoid file index and use fs view cache in COW input format #7493

Merged
merged 2 commits into from
Dec 17, 2022

Conversation

codope
Copy link
Member

@codope codope commented Dec 17, 2022

Change Logs

This PR falls back to the original code path using fs view cache as in 0.10.1 or earlier, instead of creating file index.

Impact

Query engines using initial InputFormat based integration will not be using file index. Instead directly fetch file status from fs view cache.

Risk level (write none, low medium or high below)

medium

Documentation Update

Describe any necessary documentation update if there is any new feature, config, or user-facing change

  • The config description must be updated if new configs are added or the default value of the configs are changed
  • Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the
    ticket number here and follow the instruction to make
    changes to the website.

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@nsivabalan nsivabalan added priority:blocker release-0.12.2 Patches targetted for 0.12.2 labels Dec 17, 2022
@codope codope changed the title Avoid file index and use fs view cache in COW input format [HUDI-5409] Avoid file index and use fs view cache in COW input format Dec 17, 2022
@hudi-bot
Copy link

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@nsivabalan
Copy link
Contributor

There is one flaky test (col stats). CI is green otherwise.
Screen Shot 2022-12-17 at 9 30 14 AM

@nsivabalan nsivabalan merged commit cc1c1e7 into apache:master Dec 17, 2022
nsivabalan pushed a commit that referenced this pull request Dec 17, 2022
#7493)

- This PR falls back to the original code path using fs view cache as in 0.10.1 or earlier, instead of creating file index.

- Query engines using initial InputFormat based integration will not be using file index. Instead directly fetch file status from fs view cache.
codope added a commit to codope/hudi that referenced this pull request Dec 21, 2022
codope added a commit to codope/hudi that referenced this pull request Dec 23, 2022
codope added a commit to codope/hudi that referenced this pull request Dec 23, 2022
codope added a commit that referenced this pull request Dec 24, 2022
nsivabalan pushed a commit to nsivabalan/hudi that referenced this pull request Mar 22, 2023
fengjian428 pushed a commit to fengjian428/hudi that referenced this pull request Apr 5, 2023
apache#7493)

- This PR falls back to the original code path using fs view cache as in 0.10.1 or earlier, instead of creating file index.

- Query engines using initial InputFormat based integration will not be using file index. Instead directly fetch file status from fs view cache.
fengjian428 pushed a commit to fengjian428/hudi that referenced this pull request Apr 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority:blocker release-0.12.2 Patches targetted for 0.12.2
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

4 participants