This is the Message Centre for Icy North
Icy Naj 24 - You Are The Tech...
Icy North Started conversation Nov 26, 2014
This is another in my occasional series of entries about the life of IT technicians. I describe an incident. You have to say what you think caused it and what they should do next.
This is a true story. Only the names have been changed, to protect the incompetent.
* * *
An engineer visits a modern data centre to install some new network equipment. Let's say it's on a business park somewhere between London and Swindon. The data centre contains lots of racks of IT equipment: servers, network switches, storage devices, etc. The equipment belongs to two different organisations. In the racks at one end is the equipment for a well-known utility company, let's call them O-Power. In the racks at the other end is the equipment running vital services for a high-street bank. Let's call them HTSB.
The engineer installs new network equipment for HTSB into an empty rack. It's switched on, and everything works fine.
A week later they go back to the data centre to decommission all the redundant HTSB equipment. They unplug all the network cables and power supplies, then remove all the servers and devices off the racks. When they've finished, they have an empty rack, and lots of equipment piled haphazardly on the floor behind them.
Then the phone rings.
"We've lost access to O-Power's systems! This is critical! Please can you put everything back in reverse order so we can find out what caused it!"
The engineer looks up at the empty rack, then down at the haphazard pile of hardware and wishes he'd brought a spare pair of trousers.
What should he do next?
* * *
Icy Naj 24 - You Are The Tech...
2legs - Hey, babe, take a walk on the wild side... Posted Nov 26, 2014
*thinks* Hmmm.... did they switch the power off at the master mains/power switch, before removing just the bits they were there to remove, and, thusly, basically powered all the stuff in all the racks down? . . well... ...
Icy Naj 24 - You Are The Tech...
Icy North Posted Nov 26, 2014
They just flipped the power switches on the devices then yanked the leads out.
Icy Naj 24 - You Are The Tech...
Amy Pawloski, aka 'paper lady'--'Mufflewhump'?!? click here to find out... (ACE) Posted Nov 26, 2014
[Amy P]
Icy Naj 24 - You Are The Tech...
Gnomon - time to move on Posted Nov 26, 2014
It appears that one of the servers in the HTSB end was actually an O-power server.
My first response would be - switch to the other data centre.
Icy Naj 24 - You Are The Tech...
Icy North Posted Nov 26, 2014
The servers were in the right rack in this case.
Invoking Disaster Recovery is probably exactly what I'd have suggested in the circumstances, although it would presumably be others in operations management who would have made that decision. Automatic failover would have been nice, too.
In this case, the engineer kept a cool head and worked out what had happened.
Icy Naj 24 - You Are The Tech...
Florida Sailor All is well with the world Posted Nov 26, 2014
I would try powering up all the removed severs off-line and see if any contain O-Power files.
I would try to ping the routers from the other bank in the meantime.
F S
Icy Naj 24 - You Are The Tech...
2legs - Hey, babe, take a walk on the wild side... Posted Nov 26, 2014
I'd probably try screaming at it, then get out the hammer... but that might not work so well... works for a lot of stuff though
Icy Naj 24 - You Are The Tech...
2legs - Hey, babe, take a walk on the wild side... Posted Nov 26, 2014
Maybe! There are some people that hold solumly to the addage you need only two tools; WD40 and duct tape, and, to these people, I say they are wrong; You need three tools; WD40, duct take, and a hammer. Curiously, its also the same list, for my medical supplys, kept in the house; WD40 Duct tape and a hammer... fixes 99% of all DIY and medical emergencys....
Icy Naj 24 - You Are The Tech...
Bluebottle Posted Nov 27, 2014
Coincidentally my son put his finger in a hole in a railing yesterday, where a bolt had been, and got it stuck. Guess which of the 3 tools suggested above was used to extract him?
<BB<
Icy Naj 24 - You Are The Tech...
Icy North Posted Nov 27, 2014
Any more takers for this?
I'll tell you that the engineer decided to wander over to the O-Power racks and take a look.
Icy Naj 24 - You Are The Tech...
Icy North Posted Nov 28, 2014
...and he saw that most of the devices were happily flashing away, but that 3 or 4 of them appeared to be powered off...
Icy Naj 24 - You Are The Tech...
Beatrice Posted Nov 28, 2014
Was it something else that had happened, not related to the engineer at all?
Or else it was the cleaner.
Icy Naj 24 - You Are The Tech...
Florida Sailor All is well with the world Posted Nov 28, 2014
I did consider this might be a coincidental incident, but as more than one unit was down, I would suspect they had been pulling power from the other rack.
I would start pulling the units that were down and trace their power supplies and other connections.
I suspect I would have found this while examining O-Powers rack
F S
Icy Naj 24 - You Are The Tech...
Icy North Posted Nov 28, 2014
Yes, you're right, FS. It was a power issue.
What happened was that when the engineer switched off one of the HTSB units it happened to trip a circuit breaker. What he didn't realise was that this unit was actually powered through the same circuit breaker as some of the O-Power machines.
When the data centre operators were quizzed on it, they admitted that it had been like that for years, from a time when other equipment was installed in the racks. They had never properly checked the power circuits since.
But, hey, it only brought down a major utility's IT systems.
Key: Complain about this post
Icy Naj 24 - You Are The Tech...
- 1: Icy North (Nov 26, 2014)
- 2: 2legs - Hey, babe, take a walk on the wild side... (Nov 26, 2014)
- 3: Icy North (Nov 26, 2014)
- 4: Amy Pawloski, aka 'paper lady'--'Mufflewhump'?!? click here to find out... (ACE) (Nov 26, 2014)
- 5: Gnomon - time to move on (Nov 26, 2014)
- 6: Icy North (Nov 26, 2014)
- 7: Florida Sailor All is well with the world (Nov 26, 2014)
- 8: 2legs - Hey, babe, take a walk on the wild side... (Nov 26, 2014)
- 9: Florida Sailor All is well with the world (Nov 26, 2014)
- 10: 2legs - Hey, babe, take a walk on the wild side... (Nov 26, 2014)
- 11: Bluebottle (Nov 27, 2014)
- 12: Icy North (Nov 27, 2014)
- 13: Icy North (Nov 28, 2014)
- 14: scorp (Nov 28, 2014)
- 15: Beatrice (Nov 28, 2014)
- 16: Florida Sailor All is well with the world (Nov 28, 2014)
- 17: Icy North (Nov 28, 2014)
More Conversations for Icy North
Write an Entry
"The Hitchhiker's Guide to the Galaxy is a wholly remarkable book. It has been compiled and recompiled many times and under many different editorships. It contains contributions from countless numbers of travellers and researchers."