This is the Message Centre for Icy North

A Major Incident Manager's Lot is Not a Happy One

Post 1

Icy North

I can cope with most IT major incidents at work.

I first talk to the people on the help desk, to see who's ringing in with the problems, which locations they're at and what is/isn't working. I also talk to the technical support teams, who triage the incident and work out the best plan of attack for getting users working again. Sometimes they've seen it happen before, so they know how to fix it. Sometimes they can switch users over to some sort of backup system while they investigate. Sometimes they see that it's a problem with a product, so they raise a call with the vendor.

I usually open a conference call so everyone can discuss what they're doing. I send out communications via SMS or e-mail to the users affected, as well as to the management. I get recorded messages put on the help desk phone system, too. Eventually everything gets fixed, then we investigate what caused it to happen in the first place and work out how we can prevent it in future. I'll then write a report about it all.

Typically it takes half a day to a day to get all this done (assuming we fix it in that time). I have to stop everything else I'm doing to manage a major incident, so it creates a backlog of the day-to-day tasks, which I then have to catch up with in due course.

I can cope with a few major incidents, maybe one a week, without it seriously affecting my schedule. For the last few months, that was about the rate they came in - until October. October was meltdown. We had around 17 major incidents, including one that took nearly a week to fix (I was working days and nights on it.)

The worst thing is when they happen simultaneously. Not only does the help desk get overwhelmed, but the triage gets confused too. It's not immediately apparent whether or not the different incidents are caused by the same fault. Do you assume they are and get, say, the networks team to look at it? Or do you assume they're different, and get the server team to look at one and the database team to look at the other one? Communications get confused too. Do you send out separate e-mails to the people who reported the incidents, or do you send out a wider e-mail to everyone who might be affected if the cause is wider?

Fortunately, getting simultaneous major incidents is rare.

Or, it used to be. This morning I had 3 simultaneous ones smiley - sadface


A Major Incident Manager's Lot is Not a Happy One

Post 2

Elektragheorgheni -Please read 'The Post'

smiley - yikes That must be why they pay you the big bucks! smiley - tongueincheek


A Major Incident Manager's Lot is Not a Happy One

Post 3

Icy North

smiley - roflsmiley - roflsmiley - roflsmiley - biggrinsmiley - smileysmiley - ermsmiley - sadfacesmiley - wah


A Major Incident Manager's Lot is Not a Happy One

Post 4

Pirate Alexander LeGray

I don't know; if you didn't have incidents you would be bored. I think incidents are created to keep you interested. Just imagine the perfect program that never went wrong.


A Major Incident Manager's Lot is Not a Happy One

Post 5

Deb

smiley - cheerup


A Major Incident Manager's Lot is Not a Happy One

Post 6

Recumbentman

I love programs that never go wrong.


A Major Incident Manager's Lot is Not a Happy One

Post 7

Vip

I think I would take boredom!

That's pretty hard work, Icy. smiley - sadface Do you get tune if in lieu, or is it all seen as part of the job?

smiley - fairy


A Major Incident Manager's Lot is Not a Happy One

Post 8

Icy North

Yes, Vip, they recently gave us the option of taking either overtime or free music downloads.

smiley - biggrin


A Major Incident Manager's Lot is Not a Happy One

Post 9

Titania (gone for lunch)

(smiley - strawberry)


A Major Incident Manager's Lot is Not a Happy One

Post 10

Deb

I'm not a fan of LOL, but that really did! smiley - rofl

Deb smiley - cheerup


A Major Incident Manager's Lot is Not a Happy One

Post 11

Gnomon - time to move on

I like the sound of tune in lieu.smiley - musicalnote


A Major Incident Manager's Lot is Not a Happy One

Post 12

Amy Pawloski, aka 'paper lady'--'Mufflewhump'?!? click here to find out... (ACE)

[Amy P]


Key: Complain about this post

More Conversations for Icy North

Write an Entry

"The Hitchhiker's Guide to the Galaxy is a wholly remarkable book. It has been compiled and recompiled many times and under many different editorships. It contains contributions from countless numbers of travellers and researchers."

Write an entry
Read more