Business has never before been so reliant on a stable Internet connection; the cloud has now displaced a huge amount of historically on-premise solutions, from accounting software through to CRM systems. It’s mobile; it’s always on and readily available. But this raises the question, just what happens when the faithful cloud (or to be exact, services hosted in it) does go offline?
While it’s rare that it occurs, there have been instances when the worst has happened. When Amazon went down there was an estimated loss of $1,100 per second. And with systems like Amazon Web Services, applications and services that use it are affected, taking down with it the likes of Airbnb and Instagram.
In the event of service outages, it’s important that the impact is minimised as much as possible. Here are 10 tips to assist:
- Sign A SLA
A service level agreement is standard when it comes to cloud services and agrees to certain level of uptime from a provider. Though if a service guarantees an availability of 99.5%, there is 1.83 days of downtime a year, or 3.60 hours in scheduled or unscheduled downtime a month. While this may sound small, critical applications must take availability into account.
- On-Premise Solutions Still Have A Place
On-premise solutions can still give significant benefits. If a service has specific in-house skills attached to, such as bespoke applications, a business may be unable to place them onto external services or need a more ‘hands-on’ approach to maintaining the service. There is still a place for on-premise solutions and while cloud may be the hot topic, some things are best kept within an organisation’s wall.
- Plan For Failure
If a failure does happen, plan for it. There are several options surrounding redundancy and failover systems to make sure if systems do fail there are all of the necessary precautions in place to maintain an organisations processes.
- Quantify The Cost Of Downtime
Quantifying the cost of downtime can give great insight into just what the cost of downtime leads too and give leverage when it comes to reasoning further investment into both business infrastructure and IT provisioning.
- Be Both Proactive & Reactive To Downtime
When downtime occurs it’s easy to be reactive to the situation, though as soon as the service is restored it’s even more crucial to be proactive and continue to find out how, why and what went wrong and rectify the original issues
- Review IT Processes When Downtime Occurs
When downtime does occur it’s important to re-evaluate where the issue came from and look at how IT processes can be improved upon to make sure that an organisation is in the best position to deal with downtime when it occurs.
- Monitor Internal & External Services
Monitoring can drastically aid organisations when it comes to being alerted when downtime has occurred, understanding the knock-on effects and also giving insight into the reasons for the downtime. Monitoring should be seen as a critical part of the infrastructure of an organisation to enable businesses to keep track of their network and business services.
- Remove Complexity
In a contradictory way, removing complexity when it comes to networks enables a faster, more efficient and effective response to downtime. While planning for failure includes adding redundancy and failover procedures, thus increasing complexity, it’s also important to simplify to enable an easier view of what issues cause downtime.
- Follow Best Practices
Any IT team should follow best practices, such as ITIL, to enable them to manage an entire business service from end-to-end. This complete view of a businesses means there’s an alignment with business needs as well as gives clear accountability when issues do arise.
- Understand The Risks
Downtime is not the only risk that businesses face when dealing with cloud issues, security breaches and network outages both have the ability to incapacitate a businesses processes and can be, especially in the case of a serious data breach, more damaging to a business.
While downtime is somewhat sporadic it should always be planned for. All businesses should try to minimise the impact and the amount of downtime it has to ensure a level of system availability for their customers and staff.