Incident versus Problem Management

Years back when I was still earning a living through writing and maintaining codes, I’d a colleague, a System Engineer who is looking after the servers that our application was running on. As with any application, ours had bugs too. And, occasionally, the bugs surfaced, bring about a service disruption. This affected quite a lot of users, and naturally our first response was to restore the service (incident management).

Our colleague, however, was more focused on problem resolution. A scheduled weekly restart at off-peak hours were not acceptable. Likewise, we cannot be always increasing the heap size to address OOM issue. He would insist on fixing the problem instead of having quick fixes. At that point of time, these did frustrate me.

Back then, I was still green and lacked ITIL knowledge. Now, I begin to appreciate that our quick fix or workarounds were not wrong, our intention was to quickly restore the service so that business continues as usual. My colleague was not wrong either, for insisting on problem resolution.

Through the hard way of learning, I come to realize that not all problems have a solution and sometimes, a workaround could actually become the permanent fix to a problem. Not to mention that problem resolution takes time, and while searching for the answer, business must continue and hence, a workaround if available can provide the required time.

Perhaps what was missing at that time was the ITIL training for both the application teams and our system engineers. If we knew what were incident and problem management, perhaps both teams could have worked together with better synergy.


2 responses to “Incident versus Problem Management

  • Craig Kelly

    good point. Very important to take steps to protect service, which will allow the Problem investigation to progress in its own time. Workarounds on the surface can be cost effective, however an I.T. estate with thousands of workarounds can get complicated.

    • cw_l

      Yes, you’re right that it could get complicated if there were a huge number of workarounds to keep tab on. But in practice, this shouldn’t happen, cos if it does, something is very wrong with the quality of work.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s

%d bloggers like this: