Yesterday’s Hotmail, Outlook.com, SkyDrive outage? Hotmail got too hot

overheatedSome Hotmail.com, Outlook.com, and SkyDrive services were down for as much as 16 hours yesterday, with an outage that started at 1:35pm PDT and wasn’t fully restored until this morning at 5:43 am PDT, according to a blog post by Microsoft Corporate Vice President Arthur de Haan on the Outlook blog.  The outage, according to a Microsoft spokesperson (as quoted by GigaOm), “affect(ed) a small number of users’ access to Hotmail and Outlook.com, but was serious enough to warrant the blog post today.

de Haan explained the outage in a “root cause analysis”:

On the afternoon of the 12th, in one physical region of one of our datacenters, we performed our regular process of updating the firmware on a core part of our physical plant. This is an update that had been done successfully previously, but failed in this specific instance in an unexpected way. This failure resulted in a rapid and substantial temperature spike in the datacenter. This spike was significant enough before it was mitigated that it caused our safeguards to come in to place for a large number of servers in this part of the datacenter.

These safeguards prevented access to mailboxes housed on these servers and also prevented any other pieces of our infrastructure to automatically failover and allow continued access. This area of the datacenter houses parts of the Hotmail.com, Outlook.com, and SkyDrive infrastructure, and so some people trying to access those services were impacted.

Now we’re not datacenter experts (we have enough of a time keeping our one server running), but a firmware update that causes “a rapid and substantial temperature spike”, and one that needed “a mix of infrastructure software and human intervention” to bring the datacenter back online sounds a bit ominous.  Of course, for those affected (our numerous Hotmail.com accounts didn’t seem to be impacted), just being down for up to 12 hours was probably ominous enough.  Still, service was restored, everyone has all their data back online, and nothing (supposedly) caught on fire.  We can be grateful for that, at least.


  • http://twitter.com/jjMustang Jerad

    A customer that I admin has 5 servers, and when the a/c went out in their server room, the temperature went from 70-degrees F to 90-degrees F in a matter of a few hours. MS doesn’t say if it was server firmware or perhaps firmware on an air conditioner controller/sensor, but multiply the heat output I was talking about by 100 or 500 and you’ve got serious heat.

  • Guest

    The recent Azure incident report had much more detail on what happened as well as what was learned and would change in future. This one leaves lots of unanswered questions. MS needs to also look at why their outages take so long to recover from relative to competitors.

  • Tom

    Please go in to my profile and if you want give me like it
    I need help

    http://www.facebook.com/pages/Beggar-no-choice/425112154248540

  • http://www.facebook.com/people/Jer-Ming-Chen/833833653 Jer Ming Chen

    A long time user of Hotmail, I have yet to see these many hiccups compare to outlook.com. Combine with Azure problems. MS has a lot to prove on their server stabilities.

  • http://twitter.com/New_Tech_World Latest Technology

    This iw just down badly just like windows surface and windows 8 is down lol., all customer are facing big problems. i dnt want to see same position of ps4

    http://sonyps4playstation.com/?s=microsoft