Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RDB Loader: fix eventual consistency problem #3113

Closed
alexanderdean opened this issue Feb 14, 2017 · 1 comment
Closed

RDB Loader: fix eventual consistency problem #3113

alexanderdean opened this issue Feb 14, 2017 · 1 comment
Assignees
Milestone

Comments

@alexanderdean
Copy link
Member

We have a problem with users who a) have large volumes and b) have high event/context cardinalities where:

  • The StorageLoader code for discovering folders in S3 of shredded entities detects already-deleted "ghost" folders (this is due to S3's notorious eventual consistency problems), but
  • The Redshift COPY statement (correctly) does not find this ghost folder (nor any contents of it), and thus
  • The overall COPY statement (correctly) fails

We need to fix this - we can hope that just porting the StorageLoader S3 folder discovery code might make the problem go away, but it would be better to actively guard against this issue in this release.

/cc @leonmaas

@alexanderdean alexanderdean added this to the R9x [HAD] StorageLoader reboot milestone Feb 14, 2017
@chuwy chuwy changed the title Relational Database Loader: fix eventual consistency problem RDB Loader: fix eventual consistency problem Feb 20, 2017
@gregbonnette
Copy link

Good talk on this issue and some potential solutions... https://spark-summit.org/east-2017/events/spark-and-object-stores-what-you-need-to-know/

@chuwy chuwy closed this as completed in a9e0a08 Jul 26, 2017
chuwy added a commit to snowplow/snowplow-rdb-loader that referenced this issue Sep 5, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants