Spreading Queue Failover Load
When replication is active, a subset of Region Servers in the source cluster is responsible for shipping edits to the sink. This responsibility must be failed over like all other Region Server functions if a process or node crashes. The following configuration settings are recommended for maintaining an even distribution of replication activity over the remaining live servers in the source cluster:
- Set - replication.source.maxretriesmultiplierto- 300.
- Set - replication.source.sleepforretriesto- 1(1 second). This value, combined with the value of- replication.source.maxretriesmultiplier, causes the retry cycle to last about 5 minutes.
- Set - replication.sleep.before.failoverto- 30000(30 seconds) in the source cluster site configuration.

