Ditzes Cause Data Fritzes
Be careful which switch you flick. Operator error is one of the main reasons data centers go bang
December 10, 2004
Human errors are the biggest cause of data center downtime, according to vendors and users at the NDCF-sponsored Data Center Forum 2004 in New York this week.
Research presented by American Power Conversion Corp. revealed that operator errors, along with shortcomings in data center design and construction, are responsible for more than 60 percent of all data center failures. Equipment failures accounted for around one third of outages, and external causes such as floods and earthquakes were responsible for just over 5 percent of downtime.
Data center failure is a sensitive subject; one IT manager attending the conference agreed to talk to NDCF on condition that his name was not published. Nonetheless, he was not exactly shocked by APCs findings. “Data centers are a very dynamic environment. There’s a lot happening, and not everyone takes the time to update their documentation,” he says.
"Documentation" refers to the manuals for the different pieces of the data center kit, which are usually kept in a central directory that staff can refer to. The IT manager says some data centers are better at keeping these directories up to date than others. However, he admits that staff turnover and lack of experienced ITers contribute to the problem.
Neil Rasmussen, APC’s founder and CTO, cites training as an area that could be improved to help reduce the risk of human error. Common standards such as the data center markup language (DCML) will also help by standardizing systems management, he adds, although this is still some way off.DCML is an XML-based standard for utility computing and system management sponsored by the likes of BMC Software Inc. (NYSE: BMC), EMC Corp. (NYSE: EMC), and Inkra Networks. Just more than a year old, DCML is now under the wing of the Oasis consortium, which has a strong track record developing e-business standards (see DCML Hits Milestone).
Another emerging data center standard is the IT Infrastructure Library. Essentially a framework for IT service management, ITIL has recently been attracting a great deal of interest from data center managers (see Afcom Sets Standards).
Rasmussen confirms that APC intends to align its products with ITIL, which covers provision of IT services and the data center infrastructure needed to support them.
— James Rogers, Site Editor, Next-Gen Data Center Forum
Read more about:
2004You May Also Like