Monitor worker nodes to detect data source connection issues. Automatically reconfigure problematic nodes to reestablish a functional connection.
Avoid query bottlenecks by monitoring and automatically scaling worker and coordinator nodes. Dynamically adjust query.max-memory and query.max-memory-per-node.
Monitor node health and performance, then automatically increase driver/executor memory allocation limits or scale the underlying nodes. Upload logs to Amazon S3 for further analysis.
Oversee read/write capacity unit usage and get notified when reservations are near capacity.
Monitor and automatically compact collections, repair databases, and resize the oplog. Automatically increase EC2 node disk sizes for self-hosted clusters with minimal downtime.
Get notified and (optionally) restart daemons when process count, max_used_connections, or aborted connections exceed your thresholds. Also, watch and alert on deadlocks and row locks.
Identify and diagnose locking issues due to concurrent queries. Dynamically enable logging and automate Cypher Shell CLI calls to kill or rerun problematic queries.
Monitor service health and automatically generate a new Nagios configuration with updated DNS servers if the service goes down.
Watch Prometheus CPU, memory, and disk usage. Automatically scale the Prometheus server to meet demands, alter configuration files, and restart the process.
Perform real-time monitoring of uptime, configurations, directory listings, server signatures, active worker requests, resource usage, and more. Automatically notify operators and (optionally) restart.
Receive automatic notifications when Envoy and Istio diffs breach a user-defined threshold. Watch for Istio proxy container connectivity issues and get immediate alerts.
Avoid stale redirects by monitoring upstream CNAME IP address changes. Automatically alter nginx.conf with non-cached endpoints and reload NGINX to replace any outdated addresses.
Shoreline helps you eliminate repetitive tickets and increase your availability at the same time. Get started today by scheduling a call with us to see a demo.