Skip to main content

Command Palette

Search for a command to run...

👨‍💻Standard Operating Procedures (SOP) Boilerplate for Information Technology (IT) Service Management including Data Centres🚂

Most IT environments do not necessarily utilize SOPs (Standard Operating Procedures) but in reality it is a crucial documentation that is a prerequisite

Updated
👨‍💻Standard Operating Procedures (SOP) Boilerplate for Information Technology (IT) Service Management including Data Centres🚂
R

Driving SD-WAN Adoption in South Africa

The Importance of Documentation in IT Environments

In the world of technology, documentation often feels like an afterthought—an unnecessary burden that techies avoid like the plague. Many IT environments are barren landscapes when it comes to Standard Operating Procedures (SOPs), leaving new hires to "sink or swim." This pervasive reluctance to document creates a vicious cycle: managers who endured a lack of SOPs during their early careers often perpetuate the same culture, ridiculing documentation as bureaucratic and unnecessary. The unfortunate result? A hero culture where critical knowledge resides in the heads of a few individuals rather than being accessible to the broader team.

The argument, “Would you rather I solve problems than document them?” is a common retort. However, documenting while solving problems is precisely what should happen. Imagine a new hire being handed a comprehensive SOP on day one—how much more productive could they be? Instead, most techies spend weeks fumbling in the dark, trying to piece together how the systems they’re responsible for even work.


The Impact of Missing Documentation

From years of involvement in problem management, I’ve observed a recurring pattern: the lack of documentation directly contributes to major incidents. Let’s break it down:

  1. Misconfigurations
    Most major incidents stem from misconfigurations. Why? Techies either don’t know how a system should be configured or are "flying blind" due to a lack of documentation. SOPs would provide the precise steps needed to configure systems correctly, significantly reducing these errors.

  2. Component Failures
    When incidents arise from hardware failures, the root cause often ties back to missing information—where components are located, their maintenance schedules, or warranty details. Proper documentation of these details could save hours of troubleshooting and ensure timely replacement or repair.

  3. Vendor Issues
    Mismanagement of carriers and service providers often boils down to undocumented service-level agreements (SLAs) and processes. Clear SOPs would ensure everyone understands vendor roles and responsibilities, streamlining resolution times.


What SOPs Bring to the Table

The absence of documentation isn’t just a minor inconvenience; it’s a critical gap that affects operational efficiency, incident resolution, and compliance. Here’s why SOPs are invaluable:

  • Operational Clarity: They provide techies with the necessary environmental and operational knowledge to do their jobs effectively.

  • Consistency: SOPs ensure IT operations follow a standard process, maintaining quality across people, processes, partners, and products.

  • Business Continuity: With SOPs in place, processes can run uninterrupted, avoiding delays and costly errors.

  • Incident Prevention: Proper procedures documented in SOPs help prevent failures that could escalate into major incidents.

  • Compliance: SOPs support corporate due diligence, ensuring adherence to regulatory requirements.

  • Training Tool: SOPs serve as onboarding guides, enabling new hires to hit the ground running.

  • Evaluation Checklist: They provide benchmarks for performance reviews and audits.

  • Knowledge Retention: When experienced techies leave, SOPs ensure their knowledge doesn’t walk out the door with them.

  • Post-Incident Analysis: SOPs chronicle historical steps in processes, helping dissect what went wrong during major incident reviews.


Documentation as a Cultural Shift

To shift the culture, IT leaders must champion the value of documentation. They must challenge the "hero mentality" and position SOPs as a core element of operational excellence. A well-maintained repository of SOPs transforms an IT environment from a chaotic firefighting zone into a well-oiled machine.

It’s time to stop resisting documentation and embrace it as the lifeline it is—both for individual productivity and organisational resilience. After all, when the inevitable happens and the system breaks, would you rather rely on guesswork or a proven, documented process?


SOP Overview

A SOP is a document that describes procedures. Procedures are ongoing processes which are open ended as opposed to projects which have a definite starting and ending time. Commissioning a system is a project with tasks that end at a point where the system is "standing", nearly like a building is standing when it has been erected. SOPs are those procedures that are executed after the system "stands". Hence, standing operating procedures.

Ramon Padilla writes on TechRepublic, IT leadership in "Master the mundane with SOPs that fit".

" The best run businesses are those which have mastered the mundane.

That is a pretty profound statement because, if taken literally, it means that the business has perfected its day-to-day activities to such a degree that it is free to focus on being agile and responsive to the forces that affect it in a proactive way.

The path to this mastery is partially through efficient and effective standard operating procedures (SOP). SOPs are defined as detailed, written instructions to achieve uniformity of the performance of a specific function.” In other words, procedures that make sure that inputs to a process result in the same outputs in terms of quality, quantity, consistency, etc.

SOPs by design, should be based on the best way of doing things in your business, often reflecting the best practices of the industry. Thus, we have a tie-in to methodologies such as COBIT, ITIL, and ISO.

So in general, SOPs are a good thing. However, it never ceases to amaze me how businesses can take a desire to implement best practices and SOPs and completely miss the boat on what they are doing and why they are doing it.

The purpose of SOPs is to make your business perform better and faster, with higher quality, increased customer service, and accountability. The SOPs ARE NOT the end product, nor are they designed to be so rigid as to make conducting business painful. You know where I am coming from here. I’m talking about the kind of “procedures” that are so cumbersome and time consuming that they make you want to pull your hair out (whats left of it); that make work for people whose only reason to exist in the business is to enforce the “procedures;” that were designed so long ago they haven’t kept up with the changing business, or for IT, changes in technology."

Here are some items to use as a boilerplate for a SOP.

  • Title. Define the SOP clearly with a suitable title that clearly defines the procedures or category of procedures.

  • Dates. Provide an Issue date and Effective date.

  • Document history. Detail a brief overview of changes to the document over time.

  • Approvals. State the authority and which the document has been created.

  • Description. Provide a brief overview of the document (or alternatively an executive summary).

  • Purpose and background. Detail why the procedures are being defined and any relevant justifications. Help people understand why the procedures are being implemented.

  • Scope. Clearly state which procedures are covered and those which are excluded in this document. Reference excluded procedures to the relevant SOP if they are not covered by this one.

  • Definitions. Define any ambiguous words or terms and all acronyms.

  • Operations. The procedures stated here need to be mapped into the greater IT operations (refer the mindmap below):

  • Vital functions affected by this SOP. These directly refer to the business functions that are aligned to and impacted by the document IT procedures. If there are problems with the systems or procedures covered in this SOP how will the business be impacted.

  • Lessons learned. Clearly list and describe all lessons learnt or how a knowledge base that includes this information can be accessed.

  • Equipment, hardware and software lists. List the inventory or attach as addendum. In large businesses this might be linked to a separate system that contains this information. However, be aware that a snapshot is required that aligns itself to the issue date of the SOP.

  • Checklists. List an checklist being applied and used. (Read here about checklists).

  • Media. All Images and diagrams relevant to the procedures.

  • Configuration. The actual configuration of the systems effected by the procedures, such as network equipment.

  • Testing. Testing mechanisms and sign offs including simulations.

  • Financial. Include relevant budget and expenditure.

  • Risk. Specify the risk methodology being used and the latest assessment that covers the area of the SOP.

  • Service Continuity. Describe the resilience of the systems covered by the SOP and include backup and restore tasks. List and describe how service continuity is implemented in the event of system failure.

  • Escalations. Detail the escalation path required for the systems covered by the SOP.

  • Roles and responsibilities. Use or implement a suitable matrix such as RACI.

  • Dashboard and metrics. Often also referred to as KPI's but more simply a reference to the quality and targets that are expected. This details how and where they will be collected and referenced.

  • Training. Specify how personnel is trained on the systems and procedures.

  • Monitoring requirements. How are the activities of the systems monitored.

  • Record Management. How are records maintained. As an example when checklists are completed, how are they stored and filed. Important if there is a query or dispute in a month's time that these can be retrieved.

  • References. List any external sources of manuals or documentation that is relevant.

  • Change. This deals with how changes are made not only to the systems but procedures as well (refer to a change management checklist here).

  • Stakeholders. List the stakeholders of the systems and procedures.

  • Maintenance. Schedules of maintenance and the tasks required. Also specify all related equipment warranties.

  • Handling Incidents and Troubleshooting procedures. Detail aspects of the expanded Incident Lifecycle.

  • Service review. List as a minimum notes about the previous period's review and the current review period. Include current issues, and peripheral related issues.

  • Service catalogue. Reference how these procedures related to the IT service catalogue.

  • Checklist for SOP. Finally, have you addressed all the following in this SOP (a quick check):

  1. Who?

  2. Why?

  3. What?

  4. When?

  5. How?

  6. Where?

Example SOP available for review and download.

This SOP template can be used in collaboration with the Rapid IR Risk assessment described here:



Ronald works connecting Internet inhabiting things at Fusion Broadband