Build StaaS with microservices: Storage automation (Part 2)

Solutions & Products
- Solutions & Products
- Cloud Services
  Cloud Services
  
  World-class data management and storage solutions in the biggest public clouds.
  Visit Cloud Services
  
  Solutions
  
  Microsoft Azure
  
  Google Cloud
  
  AWS
  
  IBM Cloud
  
  Products
  
  Azure NetApp Files
  
  Amazon FSx for NetApp ONTAP
  
  Cloud Volumes Service for Google Cloud
  
  Cloud Volumes ONTAP
  
  Compute Optimization
  
  Cloud Sync
  
  Cloud Data Sense
  
  Cloud Tiering
  
  Cloud Backup Service
  
  Cloud Volumes Edge Cache
  
  Global File Cache
  
  Cloud Manager
  
  Astra
  
  Cloud Insights
  
  File Services / File Sharing
  
  MySQL
  
  PostgreSQL
  
  Kubernetes
  
  Quick Links
  
  Cloud Central
  
  Data Fabric
  
  Why NetApp for Cloud Services
  
  Spot by NetApp
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Hybrid Cloud
  Hybrid Cloud
  
  Build your business on the best of cloud and on premises together with Hybrid Cloud Infrastructure solutions.
  Visit Hybrid Cloud
  
  Solutions
  
  Virtualization
  
  Service Provider Infrastructure
  
  IT Automation
  
  Private Clouds
  
  VMware
  
  Red Hat
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for Hybrid Cloud
  
  What is Hybrid Cloud
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Data Storage
  Data Storage
  
  NetApp is the proven leader when it comes to modernizing and simplifying your storage environment.
  Visit Data Storage
  
  Solutions
  
  SAN
  
  Scale-Out NAS
  
  Unstructured Data Solutions
  
  Products
  
  AFF A-Series
  
  AFF C190
  
  E-Series
  
  EF-Series
  
  FAS
  
  FlexPod
  
  SolidFire
  
  StorageGRID
  
  Disk Shelves & Storage Media
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for Data Storage
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Cyber Resilience
  Cyber Resilience
  
  Our industry-leading solutions are built so you can protect and secure your sensitive company data.
  Visit Cyber Resilience
  
  Solutions
  
  Data Protection
  
  Ransomware Protection
  
  Business Continuity / Disaster Recovery
  
  Data Backup and Recovery
  
  Data Compliance
  
  ONTAP Data Security
  
  Products
  
  SnapCenter
  
  Cloud Backup
  
  Quick Links
  
  Data Fabric
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Data Management
  Data Management
  
  Get complete control over your data with simplicity, efficiency, and flexibility.
  Visit Data Management
  
  Solutions
  
  Simplicity365
  
  Products
  
  Active IQ
  
  Element Software
  
  OnCommand Insight
  
  ONTAP Data Management
  
  SANtricity Software
  
  Virtual Infrastructure Management
  
  Quick Links
  
  Data Fabric
  
  Data Management Specialists
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Enterprise Applications
  Enterprise Applications
  
  Speed application development, improve software quality, reduce business risk, and shrink costs.
  Visit Enterprise Applications
  
  Solutions
  
  SAP
  
  Oracle
  
  MS SQL
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for Enterprise Applications
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- DevOps
  Devops
  
  Our solutions remove friction to help maximize developer productivity, reduce time to market, and improve customer satisfaction.
  Visit Devops
  
  Solutions
  
  Configuration Management
  
  Containers
  
  Google Clouds Anthos
  
  Continuous Integration Continuous Delivery
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for DevOps
  
  What is DevOps
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- AI
  AI
  
  NetApp AI solutions remove bottlenecks at the edge, core, and the cloud to enable more efficient data collection.
  Visit AI
  
  Solutions
  
  Big Data Analytics
  
  High Performance Computing
  
  Products
  
  ONTAP AI
  
  Quick Links
  
  Data Fabric
  
  Why NetApp for AI
  
  What is AI
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- VDI
  VDI
  
  Provide a powerful, consistent end-user computer (EUC) experience—regardless of team size, location, complexity.
  Visit VDI
  
  Products
  
  Spot PC
  
  Virtual Desktop Service
  
  Quick Links
  
  Data Fabric
  
  What is VDI
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
- Services
  Services
  
  We have a service for your every need, plus the ones you’re about to discover.
  Visit Services
  
  Services
  
  Professional Services
  
  Support Services
  
  Quick Links
  
  Data Fabric
  
  Customer Stories
  
  Test Drive
  
  Free Trials
  
  How to Buy
Support & Training
How to Buy
Community

The problem of provisioning

One of our biggest issues in implementing storage as a service (StaaS) was request resolution time. Our median time to complete a request was 5 business days, which was way too long. The other issue was process quality. Although we had relatively few errors in our provisioning process, we were not able to triage them effectively. We had to go back to find the original ticket, talk to the engineer who handled it, recover any data needed, and recreate the provision request correctly. This process took time and damaged our team’s reputation within the organization.

Data protection required an additional ad hoc request made to a data protection specialist. Because we didn’t have a standard process, we had to ask the requester what they needed in terms of data protection. Our lack of standards led to governance complications, a nonstandard disaster recovery process, and a lack of clarity around meeting the RTO needs of the organization.

A provisioning automation project was needed to reduce time to market on provisioning requests, eliminate provisioning errors, and standardize our data protection process and capabilities.

But first, we had to decide what data was required as input to the provisioning workflow. We needed some main data points about the application, including:

RPO and RTO requirements, sorted by application criticality
Application criticality for data protection standards
Application data classification for data placement and encryption needs
Service-level definition (Extreme, Performant, or Value)—we’ve since added two additional high-performance levels beyond Extreme
Application performance profile for service-level mapping (IOPS and throughput)

We worked with our security and service mapping teams to define RPOs and RTOs, application criticality, and data classification by application, using ServiceNow. We then characterized service levels for volume types like databases and logs.

We also created user acceptance performance profiles for application volumes and determined the default performance for requests when we couldn’t determine a service level. After initial provisioning, we used our monitoring platform to determine service level changes and migration needs.

Meet our new pipeline

Our pipeline starts with a ServiceNow form asking the requester what application the storage is for, whether they need block or file, and what hosts need to connect, as well as what environment and capacity are required. ServiceNow sends the applications’ criticality and data classifications to the pipeline. The pipeline then configures the storage on the appropriate hardware based on the environment.

If application performance is specified, the request is assigned an appropriate service level. If performance can’t be ascertained, the request is placed in our default service level. Data protection frequency and location needs are configured based on criticality and RPO and RTO requirements. All storage is encrypted based on application data classification.

Next, we add the storage into our monitoring platforms via API calls. We developed triage storage monitoring designed for storage administrators, and we designed a simple monitoring tool for application owners, which provides an overview of application health. We then add the storage to our logging platform, which logs macro information about the storage to help us plan annual purchasing and budget requirements. Pipeline outputs are added to the logging platform for audit and success rate metrics.

Finally, the pipeline performs nightly feedback to ServiceNow to tie the application to all its infrastructure resources.

As a result of our provisioning automation effort, we have achieved our objectives:

Provisioning requests no longer have to sit in a queue waiting for human hands to complete them.
Quality has improved by minimizing human error.
Any errors in the workflow are much easier to triage and fix permanently using our new logging capabilities.
Our storage engineers now have more time to spend enhancing our service, including working toward our strategic initiative: container persistent storage.

In the next part of this blog series, I’ll dive into automation for deployment of new equipment, the efficiency gained, and some surprising quality benefits.

To learn more, check out this video we did at NetApp INSIGHT^® 2021, “Build Storage as a Service (StaaS) with Micro-Services”.

Share this page

Guest Author: Tony Johnson, SRE Automation Lead, IBM

The problem of provisioning

Meet our new pipeline

Guest Author: Tony Johnson, SRE Automation Lead, IBM

Next Steps

Blogs

Community