Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failure Introspection leaves me wanting more #26

Closed
jolexa opened this issue Nov 2, 2017 · 2 comments
Closed

Failure Introspection leaves me wanting more #26

jolexa opened this issue Nov 2, 2017 · 2 comments
Assignees

Comments

@jolexa
Copy link
Contributor

jolexa commented Nov 2, 2017

Hello, I was really hopeful that -log-level debug would assist me, but it still leaves me wanting more. I don't know how to get anymore info out of levant in this case. Any ideas?

20:49:09.479 [DEBUG] levant/deploy: running dynamic job count updater for job sre-example-app
20:49:09.485 [INFO] levant/deploy: using dynamic count 3 for job sre-example-app and group production
20:49:09.485 [INFO] levant/deploy: triggering a deployment of job sre-example-app
20:49:09.495 [DEBUG] levant/deploy: beginning deployment watcher for job sre-example-app
20:49:09.500 [DEBUG] levant/deploy: Nomad returned an empty deployment for evaluation 7d96d04e-92f6-f9be-e4cc-6e0c7d904d1e; retrying
20:49:11.521 [DEBUG] levant/deploy: deployment 1ac0c116-f987-af95-be37-292f8842c707 running for 2.02595389s
20:49:11.521 [ERROR] levant/deploy: deployment 1ac0c116-f987-af95-be37-292f8842c707 has status failed, Levant will now exit
20:49:11.529 [ERROR] levant/command: deployment of job sre-example-app failed
@jrasell
Copy link
Member

jrasell commented Nov 2, 2017

Hey @jolexa; it could be because I don't currently reflect through the allocation errors which is something I have been meaning to update and only use a case statement to catch the messages which has potential to miss ones which are not explicitly checked. I will have a look into this tomorrow and should have an update early UK time to test out. In the meantime if you do have the output of manually checking the alloc-status log that would help me ensure I fix the right thing.

@jrasell jrasell added the bug label Nov 2, 2017
@jrasell jrasell self-assigned this Nov 3, 2017
@jolexa
Copy link
Contributor Author

jolexa commented Nov 3, 2017

I understand what you are saying. Unfortunately, I cannot reproduce this issue today. One thing that changed is that I think there was a bad deployment that nomad couldn't update and today I "stopped" the dead job. I'm pretty new to nomad and not sure what happened.

jrasell pushed a commit that referenced this issue Nov 7, 2017
Previously, Levant was only able to catch a small number of alloc
failure cases. This changes expands the case statement with the
ability to catch many more.

Closes #26
jrasell added a commit that referenced this issue Nov 7, 2017
Previously, Levant was only able to catch a small number of alloc
failure cases. This changes expands the case statement with the
ability to catch many more.

Closes #26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants