That is a bug in doc. The default BatchCount is 100, so a relay serving 1000 endpoints would notify 100 each second (default BatchDelay) for 10 seconds. If the relay serves 5000 endpoints, it would notify them over 50 seconds, by default. Depending on what activity is being triggered by the notification (e.g. site gather vs fast query vs large download), that rate could be much faster than the relay can serve. So it makes sense to me to increase the delay and/or lower the count a bit to minimize requests by endpoints that we don’t expect to be serviceable.
Unfortunately, there is no single answer for best values to support 5k, so you may have to try a few options and monitor to see how endpoints reporting to that relay respond to various requests. Logical options to try are:
- Default values
- Increase BatchDelay to 2000
- Decrease Batch Count to 50 and increase BatchDelay to 2000