Last Updated: 3-3-2017
Multi-faceted IT issues can be difficult to pinpoint and diagnose with cloud, virtualization, hybrid IT, storage area networks, converged infrastructure, application servers, and distributed application architectures. When an issue surfaces, for example a badly performing application or server, the investigation can take significant time to locate the core issue. The problem could be in storage, network connectivity, user access, or a mix of resources and configurations.
Create troubleshooting projects with the Performance Analysis (PerfStack™) dashboard that visually correlate historical time series data from multiple SolarWinds products and entity types in a single view.
For VMAN, the possibilities are endless for application analysis and hybrid environments:
The following example shows you how to identify a root cause for a VM experiencing performance issues. In this scenario, a virtual host encountered a resource and performance issue to the point where users encounter slower responses and access. The issue triggered an alert, which notified your application owner, who escalated the issue to system and network administrators.
Create a new troubleshooting project to investigate the issue to compare metrics for the host and all related virtual environment systems to track trends and spikes in usage.
Interested in all associated nodes, applications, servers, and more to this selected node? Click the related entities icon. All related entities display in the Metric Palette providing more options for metrics possibly causing issues.
Select the syd host node to view and select metrics to drag and drop onto the dashboard. You can drag them into the same chart to compare values between metrics.
To start investigating, pull a series of metrics for the host and cluster, comparing metrics to find spikes or high usage. For this scenario, add these host metrics:
For the cluster, add these metrics:
The charts and graphs display with data and alerts for the Last 12 hours of metrics. You can expand the date and time to see additional historical metrics over the course of the alert.
Add usage metrics for VMs on the host to compare network usage and activity.
Analyzing the data, the issue looks to be a noisy neighbor for one of the virtual machines consuming resources and experiencing high traffic causing bottlenecks and issues for VMs sharing the host. Basically, another server, service, or application is consuming higher bandwidth, disk I/O, CPU, and other resources causing issues for this specific application.
This information gives your network and system administrators a direction for further investigation and resolving latency issues. To resolve, they can reallocate resources or move the high-consumption application to another location.
Click Save and give the project a name.
The project saves as a dashboard with the selected metrics in the set date and time range.
When saved, the URL becomes a sharable link. Copy and share the link to the saved dashboard in tickets or emails sent by the system and network administrators and the product owner. They can access the link to review the gathered data and troubleshoot.
After reallocating resources and making network changes, reopen the dashboard to verify changes and new usage trends for polled metrics.