-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Common: use run manifest #81
Comments
Let's just go for this one. It will solve the problem and be much more robust than our consistency check... |
Just found another quick and dirty solution to consistency problem (not going to implement it as it is dangerous, just sharing). For example, last ETL had |
Makes sense @chuwy, thanks for sharing. |
This looks like a most bullet-proof solution against inconsistent S3. Instead of relying on listing S3, we can collect data in shredder (like we do for Snowflake) and write to external manifest (DynamoDB).
Unlike consistency check this does not add idle time and should be very reliable.
Should be optional to reduce maintainance routine for pipelines that don't suffer from inconsistency.
The text was updated successfully, but these errors were encountered: