-
Notifications
You must be signed in to change notification settings - Fork 326
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Packet clearing troubleshooting runbook #2612
Comments
Seems very similar to this issue @greg-szabo try using |
I think we have two problem we can fix on the Hermes logs/return message side:
This is technically an error/corner case, not a success. |
Additional context: the underlying problem is from Tendermint, which is overwriting a tx index if it succeeded the first time, and it being fixed here tendermint/tendermint#8625 |
Checked with MZ and HS today from Cephalopod and the packets are still stuck. We also checked and one of the clients is expired.
We'll try to manually debug the stuck packets once the client is recovered. Channel ends are:
|
I have exactly the same issue between juno uni-5 and nois-testnet-003. |
Proposal
I've been trying to clear some packets between Terra and Sifchain. There's one stuck and the runbook I have in my mind doesn't give me enough info to figure out what's happening. I would like more ideas (maybe even document it in a runbook) to do to identify issues while clearing packets.
Basic steps:
clear packet
Step 1: you can see there's one packet not completely cleared yet. (I re-ran the steps multiple times to see that other processes don't clear it.)
Step 2: try to clear it:
It is indicated that it has been cleared. Older versions of Hermes used to show a list of packets involved in the step, I wish that came back so we know our exact seq.num 2196 was the found packet.
Step 3: query again:
Step 4: let's see what we know of that packet
Welp, not very useful.
Maybe pending packets will help?
Um... no. All I can think of is that we need to time this packet out. It's been in the queue for a while.
Running
acks
,pendig-acks
,pending-sends
shows empty results. This packet is there to be committed.As a last hail Mary, we could try to force a timeout packet, let's see:
But still the packet is listed under commitments. (I tried sending a packet-ack because why not, but that obviously gave an empty "SUCCESS" back.)
So, how should we get to know more about the packet and the necessary steps to send it on its way?
Acceptance Criteria
Commitment queue empty.
For Admin Use
The text was updated successfully, but these errors were encountered: