<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Midwest IT Survival &#187; hero</title>
	<atom:link href="http://midwestitsurvival.com/tag/hero/feed/" rel="self" type="application/rss+xml" />
	<link>http://midwestitsurvival.com</link>
	<description>Discussion on IT roles in non-Silicon Valley yet tech savvy companies</description>
	<lastBuildDate>Thu, 19 Apr 2012 12:41:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>The Dreaded Root Cause Meeting for the Engineer, Part 4</title>
		<link>http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-4/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-dreaded-root-cause-meeting-for-the-engineer-part-4</link>
		<comments>http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-4/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 04:24:33 +0000</pubDate>
		<dc:creator>jfbauer</dc:creator>
				<category><![CDATA[Engineering]]></category>
		<category><![CDATA[competing priorities]]></category>
		<category><![CDATA[confused]]></category>
		<category><![CDATA[DBA]]></category>
		<category><![CDATA[hero]]></category>
		<category><![CDATA[play it safe]]></category>
		<category><![CDATA[players]]></category>
		<category><![CDATA[project sponsor]]></category>
		<category><![CDATA[projectified]]></category>
		<category><![CDATA[root cause]]></category>
		<category><![CDATA[root cause analysis]]></category>
		<category><![CDATA[ruggedization]]></category>
		<category><![CDATA[safe]]></category>
		<category><![CDATA[surprised]]></category>
		<category><![CDATA[surprised and confused]]></category>

		<guid isPermaLink="false">http://www.midwestitsurvival.com/?p=214</guid>
		<description><![CDATA[Anyone that has had to participate in a meeting to determine why some IT system went down is echoing a collective groan as they read this title.  For both IT managers and engineers alike, it is the least desired activity following a system failure of any kind.  Business and/or product owners outside of IT are [...]
Related posts:<ol>
<li><a href='http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-3/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 3'>The Dreaded Root Cause Meeting for the Engineer, Part 3</a></li>
<li><a href='http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-2/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 2'>The Dreaded Root Cause Meeting for the Engineer, Part 2</a></li>
<li><a href='http://midwestitsurvival.com/2009/09/the-dreaded-root-cause-meeting-for-the-engineer-part-1/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 1'>The Dreaded Root Cause Meeting for the Engineer, Part 1</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Anyone that has had to participate in a meeting to determine why some IT system went down is echoing a collective groan as they read this title.  For both IT managers and engineers alike, it is the least desired activity following a system failure of any kind.  Business and/or product owners outside of IT are waiting, after the dust settles and the system is restored to working condition, to have primarily two questions answered</p>
<ol>
<li>Why did the system go down in the first place?</li>
<li>What is IT going to do to make sure this doesn’t happen again?</li>
</ol>
<p>In the <a title="The Dreaded Root Cause Meeting for the Engineer, Part 1" href="http://www.midwestitsurvival.com/2009/09/the-dreaded-root-cause-meeting-for-the-engineer-part-1/" target="_self">first article</a>, I outlined the business context of the root cause analysis exercise in general and the complexities in clearly and logically arriving at a true root cause for a system outage due to the interconnected players involved.  In the <a title="The Dreaded Root Cause Meeting for the Engineer, Part 3" href="http://www.midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-3/" target="_self">previous article</a>, I outline a particular IT engineering resource approach entitled “Openly Be the Hero” to participating in the root cause analysis process.  This article introduces “Play it Safe”.</p>
<p><strong>IT Engineering Participatory Approach C</strong> = Play it Safe</p>
<div id="attachment_215" class="wp-caption alignleft" style="width: 160px"><img class="size-full wp-image-215" src="http://184.173.252.147/~bauerjf/wp-content/uploads/2009/09/blog-The-Dreaded-Root-Cause-Meeting-for-the-Engineer-Part-4.jpg" alt="Play it Safe!" width="150" height="150" /><p class="wp-caption-text">Play it Safe!</p></div>
<p>Having seen the potential pros and cons of approaches A and B, I assume you are wondering is there any way to play the root cause situation safely?  There is, but you are going to have to put your engineering brain and ego on hold a bit.</p>
<p><strong>Step 1</strong> = Resist the urge to be either “surprised and confused” or the Hero.</p>
<p>At the onset, avoid meetings, emails, hallway conversations and basically any situation that might put you in a position to start down the road of approach A or B.  Reply with vague “I’m not sure.  I think we are still looking into that.  I am waiting on &lt;whatever&gt;, let me get back to you” type answers.</p>
<p><strong>Step 2</strong> = Get with your management ASAP and give them a full run down of what is going on, the situation and the players involved.</p>
<p>As succinctly as humanly possible, state the problem “we may very well be part of the root cause for this outage”.  That should get management’s attention very quickly.  Follow-up with “here is what I know, stop me if I am going to fast or you already know all this already” and then quickly and briefly step through the problem clearly indicating where you believe/feel/think each “fact” ranks in authority.  In other words, don’t claim something is a fact unless you hold a log file printout in your hand that date and time stamps what you are saying.  “The commonly held view is the temporary storage volume filled up before anyone could purge files as the system needs, etc., etc., etc.”  “I am 50% confident based on this log entry that disk space was an issue.”  Be prepared to be stopped and asked all kinds of questions pertaining to how you know this, from whom, who else knows, etc.  What is happening is management is starting to build the story of what is taking place factually, the black and white versus gray-ness of those facts and how all the players are positioned to take the blame.</p>
<p><strong>Step 3</strong> = If management doesn’t define your role and thematically what to say and not to say, suggest your role and seek confirmation</p>
<p>Equally as important as step 2, confirming how management wants you to proceed is critical.  If you complete step 2 but then go off and “Being the Hero”, you will be susceptible to all of the cons associated with being the hero. Rather, if you are going forward and executing your role under the clear direction of management, as long as you indeed execute and seek clarity when a unclear situation presents itself, it will be exceedingly difficult to fall victim to the cons associated with “Being the Hero”.</p>
<p><strong>Step 4</strong> = Execute your role and keep management informed of major milestones</p>
<p>Go forth and help the post outage root cause investigation effort always being mindful of your role as indicated by your management.  As you are made aware of “major” milestones, make sure you go back and update management as soon as possible.  The timely updates directly assist in reshaping the story and may be accompanied by some tweak in direction to your role.  “Major” represents any event or new information that changes the shape of events.  “Bob in infrastructure just shared that the daily disk utilization report was indeed showing a reduction in free space for the last two weeks” = share ASAP.  “Bob just shared he forgot his lunch at home” = ignore.  Yes, these are rather obvious examples of what to share and not to share, but the goal here is to develop your own system for listening strategically to all the information that is being shared in order to parse out the noise and direct significant facts back to management.</p>
<p>In summary, the approaches and recommendations here may seem a bit extreme to many.  If you are lucky enough to belong to an organization that is culturally rational and fact based, you may not be forced into these scenarios.  Yet, all it takes is one situation to get out of control and the scenarios above become reality.  The rational, fact based logical analysis of an outage is replaced by the panicking, irrationality and emotion of engineers and managers faced with the notion of job loss due to failure to prevent an outage disaster that had major reputational and/or financial impacts.</p>
<p>Related posts:</p><ol>
<li><a href='http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-3/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 3'>The Dreaded Root Cause Meeting for the Engineer, Part 3</a></li>
<li><a href='http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-2/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 2'>The Dreaded Root Cause Meeting for the Engineer, Part 2</a></li>
<li><a href='http://midwestitsurvival.com/2009/09/the-dreaded-root-cause-meeting-for-the-engineer-part-1/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 1'>The Dreaded Root Cause Meeting for the Engineer, Part 1</a></li>
</ol>]]></content:encoded>
			<wfw:commentRss>http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-4/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Dreaded Root Cause Meeting for the Engineer, Part 3</title>
		<link>http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-3/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-dreaded-root-cause-meeting-for-the-engineer-part-3</link>
		<comments>http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-3/#comments</comments>
		<pubDate>Mon, 05 Oct 2009 04:24:43 +0000</pubDate>
		<dc:creator>jfbauer</dc:creator>
				<category><![CDATA[Engineering]]></category>
		<category><![CDATA[competing priorities]]></category>
		<category><![CDATA[confused]]></category>
		<category><![CDATA[DBA]]></category>
		<category><![CDATA[hero]]></category>
		<category><![CDATA[players]]></category>
		<category><![CDATA[project sponsor]]></category>
		<category><![CDATA[projectified]]></category>
		<category><![CDATA[root cause]]></category>
		<category><![CDATA[root cause analysis]]></category>
		<category><![CDATA[ruggedization]]></category>
		<category><![CDATA[surprised]]></category>
		<category><![CDATA[surprised and confused]]></category>

		<guid isPermaLink="false">http://www.midwestitsurvival.com/?p=209</guid>
		<description><![CDATA[Anyone that has had to participate in a meeting to determine why some IT system went down is echoing a collective groan as they read this title.  For both IT managers and engineers alike, it is the least desired activity following a system failure of any kind.  Business and/or product owners outside of IT are [...]
Related posts:<ol>
<li><a href='http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-2/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 2'>The Dreaded Root Cause Meeting for the Engineer, Part 2</a></li>
<li><a href='http://midwestitsurvival.com/2009/09/the-dreaded-root-cause-meeting-for-the-engineer-part-1/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 1'>The Dreaded Root Cause Meeting for the Engineer, Part 1</a></li>
</ol>]]></description>
			<content:encoded><![CDATA[<p>Anyone that has had to participate in a meeting to determine why some IT system went down is echoing a collective groan as they read this title.  For both IT managers and engineers alike, it is the least desired activity following a system failure of any kind.  Business and/or product owners outside of IT are waiting, after the dust settles and the system is restored to working condition, to have primarily two questions answered:</p>
<ol>
<li>Why did the system go down in the first place?</li>
<li>What is IT going to do to make sure this doesn’t happen again?</li>
</ol>
<p>In the <a title="The Dreaded Root Cause Meeting for the Engineer, Part 2" href="http://www.midwestitsurvival.com/2009/09/the-dreaded-root-cause-meeting-for-the-engineer-part-1/" target="_self">first article</a>, I outlined the business context of the root cause analysis exercise in general and the complexities in clearly and logically arriving at a true root cause for a system outage due to the interconnected players involved.  In the <a title="The Dreaded Root Cause Meeting for the Engineer, Part 2" href="http://www.midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-2/" target="_self">previous article</a>, I outline a particular IT engineering resource approach entitled “Surprised and Confused” to participating in the root cause analysis process.  This article introduces “Openly Be the Hero”:</p>
<p><strong>IT Engineering Participatory Approach B</strong> = Openly Be the Hero</p>
<p>“I know what happened, the temporary storage volume …..”</p>
<div id="attachment_211" class="wp-caption alignright" style="width: 160px"><img class="size-full wp-image-211" src="http://184.173.252.147/~bauerjf/wp-content/uploads/2009/09/blog-The-Dreaded-Root-Cause-Meeting-for-the-Engineer-Part-3.jpg" alt="I will save the day with facts no one can refute!" width="150" height="100" /><p class="wp-caption-text">I will save the day with facts no one can refute!</p></div>
<p>This approach, which is diametrically opposed to the surprised and confused approach, comes with some different risks.  By standing up and sharing every technical fact you can get your hands on to point out what really is going on can back fire in exactly the opposite way as the surprised and confused option.  People will tend to latch on to the one spouting off all the undeniable facts and suddenly the masses will associate the one with all of the answers as the one being in a position to have avoided the problem all together.  As far as your management goes, if they aren’t on board, you’ve placed them in a difficult spot to be supportive if the tide turns towards the root cause being the hero’s perceived lack of involvement.  Your peers, fearing their job might be in some jeopardy, will most likely slink down in their chairs to remain quiet and allow you to stand tall to take the proverbial daggers of blame.</p>
<p>Now if you are one that has put in the extra energy to understand how the system or systems were constructed, the “why” behind the seemingly architecturally backwards ways certain business processes are completed you may struggle with avoiding the hero trap.  You may be thinking: “The facts that I possess clearly indicate without compromise that what I known to be the root cause is the root cause.  Why can’t everyone just go with the facts and be done with it?”  Not everyone is comfortable accepting the facts even if they are the facts.  What if the facts suggest a particular individual or group of individuals have been linked to the last five system outages?  Maybe these five outages are legit and the individual or group is trying desperately to improve their system management activities.  The last thing they need is another problem piled on top of their previous problems to further put pressure on management to take some action.  In an effort to save their jobs and buy more time to get out from underneath their pile of problems they can redirect the masses to focus on the hero’s involvement and thus take the heat off themselves.</p>
<p>“Let me understand, the Hero knew that this problem was going to happen but didn’t do anything to stop it?  Why is the Hero hiding knowledge that would help the company?  This is yet another example of the Hero not sharing and not partnering.  How can the Hero just sit idly by and allow this to happen.  Something needs to be done about the Hero …”</p>
<p>And this “something that needs to be done” … and get ready, this is going to make any logical thinking IT engineer’s head spinning … could be as severe as disciplinary action cast upon the Hero.  Why is such an illogical outcome such as the individual that amassed such valuable knowledge to be able to assemble together all the puzzle pieces of the problem become the victim of some disciplinary action?  The answer falls more on the organizational hierarchy than on conventional logic.  If the individual that is uttering those statements about the Hero is significantly high on the organizational chart, then the layers below, who have been focusing on all sorts of other fires, are caught without a good story as to why this situation occurred and why the Hero is not the root of all evil.  Not being armed with a story that shields the Hero, the management layers in between are somewhat constrained and thus the blame lands on the Hero.  For more of the management side of the Hero’s plight, see the articles that cover this in the management section.</p>
<p>Sure, your peers might find you after meetings and give you kudos for standing up for the facts, but is being technically “right” worth the cost of being put through this ancillary pain?</p>
<p>The next article introduces the hybrid approach which I’ve entitled “Play it Safe”</p>
<p>Related posts:</p><ol>
<li><a href='http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-2/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 2'>The Dreaded Root Cause Meeting for the Engineer, Part 2</a></li>
<li><a href='http://midwestitsurvival.com/2009/09/the-dreaded-root-cause-meeting-for-the-engineer-part-1/' rel='bookmark' title='The Dreaded Root Cause Meeting for the Engineer, Part 1'>The Dreaded Root Cause Meeting for the Engineer, Part 1</a></li>
</ol>]]></content:encoded>
			<wfw:commentRss>http://midwestitsurvival.com/2009/10/the-dreaded-root-cause-meeting-for-the-engineer-part-3/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

