Xero taken offline by massive US data centre failure

One of the drawbacks of cloud computing was dramatically illustrated today as Rackspace - one of the world's largest web hosts - when offline for 45 minutes. New Zealand’s Xero was one of many SaaS (software-as-a-service) providers knocked out by the failure, with glitches continuing for hours.

The accounting software provider went offline around 8.30am as Rackspace, which hosts all of Xero's data, was hit by a still-unexplained, catastrophic failure.

All Xero servers were back up and running by 9.10am chief operations officer Alistair Grigg told NBR.

Some customers were still reporting problems logging on through the morning and early afternoon, as recorded on Xero’s blog. Mr Grigg says these were cookie and DNS (domain name server) issues, which were resolved by asking customers to restart.

SaaS, interrupted
Rackspace’s catastrophic outage has shocked the computer community worldwide.

The fault was caused by a power failure at the US company’s giant data centre in Dallas.

But with Rackspace maintaining server farms around nine locations in the US, UK and Hong Kong, it’s not clear why a failure at one facility took its systems completely offline.

Mr Grigg says Rackspace has multiple utility providers, backup generators and uninterruptable power supply systems at its Dallas facility, as would be expected from one of the world’s largest data hosts.

So far, no official explanation has been forthcoming.

The power fault also took out Rackspace's own website and help centre, adding to the confusion. It was left to the company's Twitter account to relay the disaster to the outside world.

When Rackspace came back online, it was running on a mix of utility and backup power, Mr Grigg notes.

The COO speculates that “there must have been some pretty significant component failure possibly at the point where maintenance work was being done”.

The outage marks Xero’s first ever hosting failure during its first two and a half years of operation. Mr Grigg says only a handful of UK and Australian customers were affected, due to the timing of the outage.

Has his faith in the cloud been shaken?

No. Mr Grigg says that after 27 months, Xero still has a record of more than 99.99% uptime.

He said the company could look at a second cloud host, but that such a disaster recovery switch typically took several hours, whereas Rackspace got Xero back online within 45 minutes (some customer with more complex systems took longer).

Xero customers could take some comfort in they were among millions hits - Rackspace hosts sites and services for more than 62,000 companies - with US singer Justin Timberlake leading complaints via, what else, Twitter.

Comments

Our Experience

Really disappointed with Rackspace. We pay a premium for their service, not because we expect perfection, but because we want 'the best' when something does go wrong.

Once our sites went down, we couldn't get through to Rackspace on the phone. Their online support system was unavailable. And when we finally did get through received no information at all. Their twitter account was even slow to post updates.

After their incident in November 2007 (truck crash took out our servers) and then reading about reported incidents from other companies earlier this year, I've lost faith. It does not seem acceptable that their backup systems have failed on multiple occasions - it makes one wonder about whether they promise more than they can deliver.

We are not a huge customer, but we are not small either - around $100K/yr. We are going to look for alternatives, and if we do indeed move on, I'm skeptical that our account rep will even notice or care. This just may be a case of growing big and indifferent. It's a shame.

I was down for 5 hours...

I was down for 5 hours...

Data Center Alternatives

We work with several data centers throughout the US that have tested and proven back up power and generating equipment. Testing is required on a regular basis to ensure that when needed.....IT WORKS! If you are looking for alternatives, call me at 817-529-9675.

The Irony

This just goes to show, no matter whether your accounting package is locally hosted or in the cloud, we will always be susceptible crashes. I guess in defence of the cloud option, at least we did not have to pay for an IT consultant to come round and fix the problem.

Post new comment

The information entered here will appear with your comment.
Leaving this field blank will default to anonymous.

More information about formatting options