How Automated Response Takes Pressure Off IT Tech Support

Solutions & Products
- Solutions & Products
- Cloud Services
  Cloud Services
  
  World-class data management and storage solutions in the biggest public clouds.
  Visit Cloud Services
  
  Solutions
  
  Microsoft Azure
  
  Google Cloud
  
  AWS
  
  IBM Cloud
  
  Products
  
  Azure NetApp Files
  
  Amazon FSx for NetApp ONTAP
  
  Cloud Volumes Service for Google Cloud
  
  Cloud Volumes ONTAP
  
  Compute Optimization
  
  Cloud Sync
  
  Cloud Data Sense
  
  Cloud Tiering
  
  Cloud Backup Service
  
  Cloud Volumes Edge Cache
  
  Global File Cache
  
  Cloud Manager
  
  Astra
  
  Cloud Insights
  
  File Services / File Sharing
  
  MySQL
  
  PostgreSQL
  
  Kubernetes
  
  Quick Links
  
  Cloud Central
  
  Data Fabric
  
  Why NetApp for Cloud Services
  
  Spot by NetApp
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Hybrid Cloud
  Hybrid Cloud
  
  Build your business on the best of cloud and on premises together with Hybrid Cloud Infrastructure solutions.
  Visit Hybrid Cloud
  
  Solutions
  
  Virtualization
  
  Service Provider Infrastructure
  
  IT Automation
  
  Private Clouds
  
  VMware
  
  Red Hat
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for Hybrid Cloud
  
  What is Hybrid Cloud
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Data Storage
  Data Storage
  
  NetApp is the proven leader when it comes to modernizing and simplifying your storage environment.
  Visit Data Storage
  
  Solutions
  
  SAN
  
  Scale-Out NAS
  
  Unstructured Data Solutions
  
  Products
  
  AFF A-Series
  
  AFF C190
  
  E-Series
  
  EF-Series
  
  FAS
  
  FlexPod
  
  SolidFire
  
  StorageGRID
  
  Disk Shelves & Storage Media
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for Data Storage
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Cyber Resilience
  Cyber Resilience
  
  Our industry-leading solutions are built so you can protect and secure your sensitive company data.
  Visit Cyber Resilience
  
  Solutions
  
  Data Protection
  
  Ransomware Protection
  
  Business Continuity / Disaster Recovery
  
  Data Backup and Recovery
  
  Data Compliance
  
  ONTAP Data Security
  
  Products
  
  SnapCenter
  
  Cloud Backup
  
  Quick Links
  
  Data Fabric
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Data Management
  Data Management
  
  Get complete control over your data with simplicity, efficiency, and flexibility.
  Visit Data Management
  
  Solutions
  
  Simplicity365
  
  Products
  
  Active IQ
  
  Element Software
  
  OnCommand Insight
  
  ONTAP Data Management
  
  SANtricity Software
  
  Virtual Infrastructure Management
  
  Quick Links
  
  Data Fabric
  
  Data Management Specialists
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Enterprise Applications
  Enterprise Applications
  
  Speed application development, improve software quality, reduce business risk, and shrink costs.
  Visit Enterprise Applications
  
  Solutions
  
  SAP
  
  Oracle
  
  MS SQL
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for Enterprise Applications
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- DevOps
  Devops
  
  Our solutions remove friction to help maximize developer productivity, reduce time to market, and improve customer satisfaction.
  Visit Devops
  
  Solutions
  
  Configuration Management
  
  Containers
  
  Google Clouds Anthos
  
  Continuous Integration Continuous Delivery
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for DevOps
  
  What is DevOps
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- AI
  AI
  
  NetApp AI solutions remove bottlenecks at the edge, core, and the cloud to enable more efficient data collection.
  Visit AI
  
  Solutions
  
  Big Data Analytics
  
  High Performance Computing
  
  Products
  
  ONTAP AI
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for AI
  
  What is AI
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- VDI
  VDI
  
  Provide a powerful, consistent end-user computer (EUC) experience—regardless of team size, location, complexity.
  Visit VDI
  
  Products
  
  Spot PC
  
  Virtual Desktop Service
  
  Quick Links
  
  Data Fabric
  
  What is VDI
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Services
  Services
  
  We have a service for your every need, plus the ones you’re about to discover.
  Visit Services
  
  Services
  
  Professional Services
  
  Support Services
  
  Quick Links
  
  Data Fabric
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
Support & Training
How to Buy
Community

First, Understand the Process

The bulk of any automation project and its success lies in first understanding the process and workflows. Our Command Center engineers were challenged to define a standard response process for a handful of first-level incidents with a high-volume of service tickets. They started with relatively simple-to-resolve incidents, such as rebalancing storage capacity or restarting an offline application. The team used scripts as building blocks and applied the relevant responses as needed. Today they are building our library of automated responses while continuing to provide day-to-day IT support.

Then, Apply the Technology

The automated response process is designed to work within our existing ecosystem. When an incident is received by our Zenoss monitoring system, it creates a ticket in our ServiceNow service management platform. If the ticket is flagged with auto response enabled, a script is executed using Ansible. The script directs the affected application to run certain commands, collect the results, and place the information into the ticket for a tech support person to access.

With auto response, it is important to ensure the problem is really resolved. Until the Command Center becomes fully confident that the script is doing its job, team members will verify the resolution. Albeit a slow process, we’ve gained a huge head start in incident resolution. When the tech support person opens the ticket, s/he can review the results of the basic information gathered by the auto response and begin troubleshooting immediately. This eliminates the time delay that comes from running tests to diagnosing the issue.

For those incidents with auto response enabled, it takes 3 to 4 minutes (on average) to execute an automation script and approximately one day from when the ticket is opened to it being resolved (known as ticket duration). Without auto response, the average ticket duration was three days and it took an engineer approximately one hour to assess the situation. Other benefits of automating incident resolution include:

Elimination of human error, increase in productivity, and reduction of rework;

A more engaged Command Center team as they intentionally look for incidents where auto response can be enabled so they can troubleshoot more complex issues that will up level their skills;

Better reporting from an integrated ecosystem that reports volumes, success rates, areas of chronic failures, and more.

Auto Response (AR) Examples

Below are some auto response (AR) examples the NetApp IT team has implemented. These examples provide the benefit of ensuring the service is always up, prevents interruption, and/or eliminates manual intervention by the IT operations teams to check and restore service.

Auto restart of Kitchen Police service for Storage Operations: Occurs when Zenoss detects the Kitchen Police service is down on the Storage Admin server, automatically opens a Service Now ticket and invokes AR to restart the service.

Auto restart of WebLogic service from Out-of-Memory alert for IAM Operations: Occurs when Splunk forwards an Out-of-Memory alert on IAM servers, then Zenoss will create the event and automatically open a Service Now ticket and invoke AR to restart the WebLogic service on the impacted server.

Auto restart the Tivoli Workload Scheduler (TWS) service on servers: Occurs when Zenoss detects the TWS service is down on the server, it automatically opens a Service Now ticket and invokes AR to restart the TWS service.

Automation Response to auto restart of OpenShift nodes: Occurs when OpenShift detects the nodes are not ready, an event will be generated and captured by Splunk. Splunk forwards that event to Zenoss to have an AR generated to create an incident and trigger the script to automatically restart the node. Also attaches log to the incident.

Response automation for Fujitsu RMA process for Unix Operations: Occurs when a new Service Request email is received from Fujitsu to NetApp IT Service Now, automatically opens a Service Now ticket and invokes AR to collect the required logs from the impacted server and upload to Fujitsu FTP server.

The NetApp-on-NetApp blogs feature advice from subject matter experts from NetApp IT who share their real experiences using NetApp’s industry-leading data management solutions to support business goals. Visit www.NetAppIT.com to learn more.

Andy Kranjec

First, Understand the Process

Then, Apply the Technology

Auto Response (AR) Examples

Andy Kranjec

Next Steps

Blogs

Community