Skip to content

What determines when a replication job is considered 'crashing'? #3550

Answered by nickva
QualityControll asked this question in Q&A
Discussion options

You must be logged in to vote

With remote connections unless there is a periodic ping or timeout involved, the socket might not know that the cable was pulled. If the documents have all replicated, for example, we'd only find out if the connection is broken when the _changes feed times out. The timeout on the changes feed will be derived from the connection_timeout config parameter and since you set it to 10000 (10 seconds) so it seems you should find earlier than a minute. Good idea to lower retries_per_request too.

I think you meant _scheduler/docs/_replicator? Maybe monitor the logs and see when you start seeing errors in the log and if you poll _scheduler/jobs or _scheduler/docs when you start seeing the first sta…

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@QualityControll
Comment options

@nickva
Comment options

@QualityControll
Comment options

@nickva
Comment options

Answer selected by nickva
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants