Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[HUDI-8967] Add a config to fail job if duplicate data files detected during reconcileAgainstMarkers #12792

Merged
merged 1 commit into from
Feb 11, 2025

Conversation

lokeshj1703
Copy link
Contributor

Change Logs

The PR adds a config which fails the entire job if an invaild file is detected. The config is disabled by default.

Impact

NA

Risk level (write none, low medium or high below)

low

Documentation Update

NA

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

@github-actions github-actions bot added the size:S PR with lines of changes in (10, 100] label Feb 6, 2025
@lokeshj1703
Copy link
Contributor Author

@nsivabalan Can you please take a look?

@danny0405
Copy link
Contributor

danny0405 commented Feb 6, 2025

@lokeshj1703 Can you add some context in which scenario the user needs to enable it? Because in production, the duplication happens in high frequency.

@hudi-bot
Copy link

hudi-bot commented Feb 7, 2025

CI report:

Bot commands @hudi-bot supports the following commands:
  • @hudi-bot run azure re-run the last Azure build

@nsivabalan nsivabalan merged commit 2e06f50 into apache:master Feb 11, 2025
43 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size:S PR with lines of changes in (10, 100]
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants