For customers yet to transform to an SD-WAN solution, configuration errors also pose a threat to your network, causing an estimated 20% of network downtime. As WANs continue to expand to encompass greater use of cloud services, more and more individual configurations across network layers are required – exponentially increasing the potential for manual error or simply not testing failover processes.
And that’s not all. Organisations need to factor in the potential for power outages or defective hardware as drivers of network downtime too.
So, what can businesses do about these known issues?
The key is instilling a culture of proactive prevention across the entire organisation, beginning with the network, IT, and engineering teams.
For example, Telstra is spearheading the drive to instil a mindset that ‘prevention is better than the cure’ through an approach called Zero Preventable Downtime.
The framework drives best-practice behaviour in a range of ways, focusing on learning from experiences, significantly increasing the transparency of network paths end-to-end, and understanding issues and challenges affecting the network at both the design stage and the delivery stage, for example.
Then the key is to embed downtime mitigation activities across design, delivery and operations. To do that, Telstra ensures robust controls are in place to provide evidence of diversity at the design & delivery stages, periodically runs preventative audits, and integrates technology solutions to reduce risk and increase availability.
But what does that look like in a real business?
As an example, take one of the world’s most innovative healthcare manufacturers.
Telstra implemented the ‘zero preventable downtime’ framework to drive a real mindset change, the team worked hand-in-hand with the customer to reduce Network Downtime by over 50% over a period of 12-months.
Following a range of workshops and audits, the team began integrating ITIL (IT Infrastructure Library) and Zero Preventable Downtime frameworks to proactively identify potential network issues. Ultimately, the customer was able to create preventative solutions to solve problems before they manifested themselves into costly downtime.
What is clear is that each individual tactic is less important than the over-arching culture the healthcare manufacturer was able to create.
In a world of rapid digital change that is increasingly dependent on networks always being on, our collective ambition for 100% application availability requires a change of mindset from every business and every employee. That means investing the same resources on learning from the past as you do in anticipating potential future causes of downtime, and it means taking preventative measures from day one. Only then can we truly mitigate the risk – and cost – of network downtime.