-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Getting 'Error finding Wavefront Alert' sometimes #16
Comments
Hi, I suspect this is some sort of low-level transport problem maybe caused by a mismatch between keepalives on the client side and idle timeout on the server. The link you posted is actually for Events, not Alerts, but the underlying client is the same. I think this could be fixed by disabling keepalives on the HTTP transport https://golang.org/pkg/net/http/#Transport so that each HTTP request uses a separate TCP connection instead of trying to re-use them. I'm happy to give this a go. |
Just some further questions:
We are discussing how we can implement debug logging. The facility exists on the wavefront client, we just need to figure the best way to enable it via terraform. |
NOTE: I tried using a different WF CLI client to see if I am actually being rate-limited by WF, but it didn't seem to fail -- although the Gist: https://gist.github.com/stephenchu/07df8b63971e72d3ea54cd9ae44182d0 Sorry - probably won't work out-of-the-box on all these but I'm sure you can figure it all out. TIA |
I seem to be getting this error 9 out of 10 times now somehow... :-( Looks related: golang/go#22158 |
Hi @stephenchu, Apologies for the delay, we are working to allocate some time in our internal sprint to look into this. |
No problem at all. I'm already grateful for your attention on this matter. One thought on how it could be fixed. If this was indeed due to the aforementioned golang bug, then it is a bug exposed when doing http POST. But, in this case, the "POST to search by alert id" can also be done via a "GET to describe by alert id", which should be sufficient to bypass this bug. But, of course, if you think turning off KeepAlive is a better and cheaper fix, I will not have a problem with that. |
Hi @stephenchu, I'm afraid I'm unable to reproduce this issue. I have successfully applied, planned and destroyed 400 test alerts several times without hitting the problem. I'm sure you're aware, but there was a known issue with parallel requests made against the Wavefront API, which they have recently fixed. See this commit. Before that you should have been running updates with All of my repro attempts have been made with no explicit parallelism setting. I'm thinking/hoping that the Wavefront API fix may have made this problem go away. Could you confirm that you are still seeing this problem? If so, there is still a chance your cluster has not been upgraded, and in that case could you try running with |
I am happy to confirm that I somehow magically no longer see the problem... I guess you were right about something about my WF account/cluster changed under the hood recently. I am closing this now. Thanks a bunch for your time! |
Occasionally I am getting
Error finding Wavefront Alert
errors:Is it the case the WF's search API is not that reliable? Or is it that the
alerts.Find()
here is not waiting enough for results to be returned?Is there any way to maybe turn on "debug" on these request/response made by the provider, so I can see more information?
Thanks!
The text was updated successfully, but these errors were encountered: