Disclaimer: The information provided in this article is for general informational purposes only. The author and the publisher do not guarantee the accuracy, completeness, or reliability of the information. The techniques and suggestions mentioned may cause harm if not implemented correctly. The author and the publisher are not liable for any damages resulting from the use or misuse of the information. Use your own judgment and exercise caution when applying the suggestions in this article. Always consult official documentation and experts, and test changes in a controlled environment before applying them to production systems. By using this information, you agree to hold the author and the publisher harmless from any liability or claim.
Applies To:
- Environments using agentless monitoring tools (e.g., LRE, SiteScope, custom monitoring solutions)
- Systems integrated with domain-based authentication (e.g., Active Directory)
- Any monitoring setup that uses stored domain credentials for resource polling
Issue:
A monitoring server triggered an extremely high volume of authentication errors — reaching over 300,000 failed logins per hour — directed at domain controllers. This caused significant performance issues across the authentication infrastructure, leading to high resource usage on the domain controllers and increased risk of service disruption.
Attempts to access the monitoring system’s UI to investigate were blocked due to missing client software and certificate issues.
Cause
The root cause was found to be invalid or outdated credentials stored in the monitoring tool. These credentials were repeatedly used by multiple scheduled monitors, each attempting to authenticate and failing continuously.
This created an authentication storm, severely overloading the domain controllers.
Steps
- Verify Monitoring Service State
- Check if the monitoring service is running. In this case, it was already stopped to reduce impact.
- Attempt to Access the UI
- Try to access the monitoring UI to inspect credentials.
- If inaccessible due to missing software (e.g., Java client) or certificate errors, proceed to an alternate method.
- Use an Offline Tool to Modify Credentials
- Utilize a configuration-level tool (e.g., a persistency viewer or config file editor) to update the stored domain account password used in monitors.
- Restart Monitoring Service
- Start the monitoring service after updating credentials.
- Confirm from logs that:
- The service starts successfully.
- Monitors are running without errors.
- No authentication failures are recorded.
- Confirm with Identity Team
- Have the authentication/infrastructure team check domain controller logs to confirm that the error volume has dropped and that the service account is authenticating successfully.
- Establish a Baseline (Optional)
- To validate the fix, stop the monitoring service temporarily (e.g., for 48 hours) to observe if the authentication traffic stabilizes.
- Restart the service after the baseline period and monitor for any recurrence of the issue.
Additional Info:
- Always validate stored credentials after password changes or account policy updates.
- Use dedicated service accounts for monitoring with limited privileges.
- Keep alternate access options available in case the monitoring UI becomes inaccessible.
- Set alerts on domain controllers to detect authentication spikes early.
We hope this article provided you with the assistance you needed. If you would like to find out more regarding the Services and Support Offerings we provide, please reach out to our Sales Team.
0 Comments