Your server is made up of multiple components, any one of which could possibly fail. If you’ve ever found your company suffering because “the server’s down again” then something is probably wrong with the way it was setup. Any server, even a low spec one, should have as few ‘Single Points of Failure’ as possible (i.e., as few places as possible where only one ‘widget’ is doing the job, and failure of that ‘widget’ brings the whole server down).
Here’s how to review your server components and make sure you’re protected against single component failure. You don’t need to be a technical guru to check these things, the quickest way is to examine your original purchase records to check the specification of your server, if in doubt, ask the original supplier or the person who set it up.
- Disks. Disks are one of the most common points of failure on a server and one disk failing should NOT bring your server down. You have choices on how to have the disks setup, but they all revolve around something called RAID (see http://en.wikipedia.org/wiki/RAID). In a nutshell, you duplicate disks so they share the work, and if one goes down, everything keeps running. There are two choices of disk type, SATA (cheaper) and SAS (faster), and there are two main types of RAID configuration; RAID1 (initially cheaper but becomes expensive as you increase your storage capacity), and RAID5 (initially more expensive, but cheaper for more disk space as you don’t need to buy as many disks). All your servers drives should be provided using RAID so if one disk fails, your server keeps running. Even better is ‘hot pluggable’ disks, which are much easier to replace (with care almost anyone can do it in five minutes), they just slot in, and slot out, and you can do this while the server is running.
- Memory. If you have only one memory module, then that module is a ‘Single Point of Failure’. If it dies, so does the server. A better option is to use two, or even four memory modules. Ideally if one module fails you should still have the minimum required to run all server functions, so pad a bit (i.e. if your server runs OK with 2GB, buy 2 x 2GB modules, it will probably run faster, and if one fails it will still keep going).
- Network card. Most low end servers come with a network card built in, it’s wise to supplement this with another network card. You don’t need to set up any technically complex or expensive configuration, it just means that if one fails, you (or your technical support provider) should be able to quickly configure it to replace the failed component, instead of waiting a day for a new network card to be delivered by your supplier.
- Power Supply Unit (PSU). The PSU failing will instantly bring your server down. Add a second PSU which runs simultaneously with the first, so a PSU failure doesn’t even slow the server down.
Costs
You might think this ‘big business IT’ stuff costs a lot of money, but it doesn’t. Below are the costs1 of implementing the redundancy measures outlined above on an HP Proliant ML350 G5 server (which purchased with one disk would cost £858 ex VAT).
- Disks. Adding 3 more disks to allow you to implement RAID1 would cost £221
- Memory. The server comes with one 1GB memory module. Upgrading this to 4GB (2 x 2GB modules) would cost £120
- Network card. The server comes with one network adapter, adding a second adapter would cost £29
- PSU. The server comes with one redundant power supply, adding a redundant PSU to an ML350 G5 would cost £104
The total cost of all these options is £474, making the total server cost £ 1,332.
Think that’s too expensive? Offset it against the cost of downtime. Taking a model company2 of 20 staff, the cost in salary alone (i.e. the percentage of computer-related productivity lost in terms of employee salary) is over £900 per day! This doesn’t include any lost sales, the negative impact on customer relationships, the cost of the time taken to recover from the outage once it is over etc.
If you’ve had one outage in the last year, it’s well worth looking at these practices, because they will save you money, keep you more competitive, and possible make the difference between a good year, and a bad year.
Richard Bartlett offers expert and independent IT Consultancy Services to review your IT, help buy the right systems, install them, train your users to use them, and ensure you have appropriate IT Support. For more information on services including installation and configuration services which can give your systems resilience see http://www.rgbartlett.co.uk/services/
1 All prices taken from ebuyer.com on 28th August 2008.
2 Based on a model using average salaries for the Cambridgeshire region supplier by Reed, and an algorithm developed to calculate downtime costs. If you would like to know the cost of downtime for your business get in contact (see http://www.rgbartlett.co.uk/contact/).
