Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[STRMCMP-569] Failures during job startup should trigger immediate rollback #100

Merged
merged 1 commit into from
Sep 6, 2019

Conversation

mwylde
Copy link
Contributor

@mwylde mwylde commented Sep 6, 2019

When the user specifies a wrong entry class or the main method throws an exception, the Flink API returns back a 500 error with a org.apache.flink.client.program.ProgramInvocationException.

In order to minimize downtime, we want to rollback immediately in this case rather than waiting for retries to be exhausted.

This PR also improves the error reporting a bit for this situation. Previously, the error wrapping functions took an array of strings which were reported verbatim. This produced confusing output that implied that the API itself was returning an array. As all actual calls to these methods were using either 0 or 1 arguments, I've split it out into those particular cases.

@mwylde mwylde merged commit b1607d0 into master Sep 6, 2019
@mwylde mwylde deleted the micah_startup_rollback branch September 6, 2019 21:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants