This repository has been archived by the owner on Aug 30, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 24
monitoring 4.1 Sensors wish list
Steve Jones edited this page Sep 7, 2017
·
1 revision
-
Timing data for each phase of run-instance path
-
Java Common
- Threads per thread-pool
- lock count
- db connection count
- Timing for db queries
- Per-service API call failure count
-
time for Describe* calls to each cluster from CLC
- DescribeServices
- DescribeInstances
- DescribeResources
- DescribeSensors
-
NC
- Network usage (not-euca specific)
- VM migrations incoming
- VM migrations outgoing
- space left in blob-store
- #of cores used, available
- RAM used, available
- Monitoring thread execution time (e.g. is it taking longer and longer or constant)
-
SC
-
Snapshot uploads in progress
-
Bandwidth per snap
-
Aggregate bandwidth
-
Concurrent volume operations
-
Connectivity status to backend
-
Successful pings & failed pings
-
-
Run-Instance timing
- Synchronous path
- Async full path (pending→running)
-
CloudWatch
-
Queue depth for data processing queues
-
Incoming metrics per time unit
-
Processed metrics per time unit (to detect dropped metrics)
-
Alarms
-
Number evaluated per minute
-
Number transitioned per minute
-
Total number of data points in the system
-
-
AutoScaling
- Number of scaling groups
- Scaling actions taken
-
ELB
- backend service pings succeeded & failed
- event listeners fired (e.g. vm failure detected and removed from rotation??)
-
VPC/Networking
- Public IPs in system
- Public IPs in use
- VPC count
- Subnet count
- midonet API calls failed
- midonet API calls succeeded