Submit a ticketCall us

Solarwinds & Cisco Live! Barcelona
Join us from the 29th of January to the 2nd of February at Cisco Live 2018 in Barcelona, where we will continue to show how monitoring the network with SolarWinds will keep you ahead of the game. At our booth (WEP 1A), we will demonstrate how SolarWinds network solutions can help. As a bonus, we are also hosting a pre-event webinar - Blame the Network, Hybrid IT Edition with our SolarWinds Head Geek™, Patrick Hubbard on January 24th - GMT (UTC+0): 10:00 a.m. to 11:00 a.m. There's still time to RSVP.

Home > Success Center > Server & Application Monitor (SAM) > SAM Linux Agent suddenly crashes

SAM Linux Agent suddenly crashes

Updated December 28th, 2016

Overview

 

This article covers an issue where a deployed Linux Agent installed on a node crashes and fails to respond. 

Environment

  • SAM 6.3+ and later
  • Linux Agent v1.6

 

Cause 

  • RPC Issue on Target Node
     
  • Error from the Orion Web Console:
    Agent is not responding, troubleshooting not possible
     
  • Sample errors from the syslogs coming from the node:
    Dec 14 21:09:27 rzslack01 systemd[1]: swiagentd.service: Unit entered failed state.
    Dec 14 21:09:27 rzslack01 systemd[1]: swiagentd.service: Failed with result 'exit-code'.
    Dec 14 21:09:33 rzslack01 systemd[1]: swiagentd.service: Service hold-off time over, scheduling restart.
    Dec 14 21:09:33 rzslack01 systemd[1]: Stopped SolarWinds Agent Service.
    Dec 14 21:09:33 rzslack01 systemd[1]: Starting SolarWinds Agent Service...
    Dec 14 21:09:33 rzslack01 systemd[1]: Started SolarWinds Agent Service.
    Dec 14 21:09:33 rzslack01 SolarWinds Agent[21611]: int main(int, PortingUtilities::utfstrings::utf_char**) - [swiagent] failed. Error [0xffffffffffffffff] : w32_exception caught: Error [0xffffffff], [virtual void AgentRpcServiceBase::Start(bool, DWORD) - failed to wait [90000] ms for service to start, wait code [1]], File: /src/agent/Src/EminentWare.Agent.Service/Agent.Service.cpp, Line: 903
    Dec 14 21:09:33 rzslack01 systemd[1]: swiagentd.service: Main process exited, code=exited, status=1/FAILURE
    Dec 14 21:09:33 rzslack01 systemd[1]: swiagentd.service: Unit entered failed state.
    Dec 14 21:09:33 rzslack01 systemd[1]: swiagentd.service: Failed with result 'exit-code'.

 

Resolution

  1. Uninstall the Linux Agent from the Linux Box.
    Note: steps for this can vary depending on the Linux Distro but there is a helpful article available on the following Website (Copyright © 2006-2016 How-To Geek, LLC., available at http://www.howtogeek.com/, obtained December 27, 2016.) which includes steps and screenshots on how to uninstall from the Linux Command Line. 
  2. Delete the Agent from the Orion Web console and try re-deploy it using the downloadable .RPM file from the Web Console.
    1. Go to Settings > Agent Settings > Agent Settings. 
    2. Download Agent software.
    3. Select Linux.
    4. Follow the on screen prompts to select the correct distro to download. 

 

Disclaimer: Please note, any content posted herein is provided as a suggestion or recommendation to you for your internal use. This is not part of the SolarWinds software or documentation that you purchased from SolarWinds, and the information set forth herein may come from third parties. Your organization should internally review and assess to what extent, if any, such custom scripts or recommendations will be incorporated into your environment.  You elect to use third party content at your own risk, and you will be solely responsible for the incorporation of the same, if any.

 

 

 

Last modified

Tags

Classifications

Public