Introduction
Is this a familiar situation? You are the head of a company, the director of a school, the head physician of a medical institution, or the head of a government agency, and once again you hear complaints about a “frozen” network, slow—running systems, or the unavailability of important data. At the most crucial moment, whether it’s preparing a financial report, holding an educational event, or welcoming citizens, digital services begin to work intermittently. And soon comes the disappointing news from the IT specialist.: “Our server room was flooded due to a burst pipe on the top floor” or “Due to improper operation of the cooling system, the server equipment overheated and failed.”
Such situations are a nightmare for the head of any organization, regardless of its form of ownership or field of activity. Unfortunately, such incidents happen much more often than one might assume. According to a study by the Uptime Institute for 2023, more than 60% of serious failures in the IT infrastructure are related to problems that could have been prevented even at the stage of designing and installing server rooms [10].
The reason lies in a common misconception: many managers perceive the server room as a secondary technical room that does not deserve special attention. Commercial organizations tend to reduce costs, government agencies often have limited budgets, and educational and medical institutions prioritize other needs. Instead of attracting specialists and complying with standards, decisions are made based on the principle of “where there is free space” or “how cheaper”. The result of such savings can cost ten times more than the initial “saved” investments.
If you want to avoid sudden downtime of the organization, loss of important data (be it financial information, personal data of students or patients, government documents) and unforeseen costs for restoring the IT infrastructure, this article is for you. We will consider the most common and fatal errors when creating server rooms, based on international standards and regulatory documents of the Republic of Kazakhstan, including the national standard of the Republic of Kazakhstan ANSI/TIA-942-A-2016 “Telecommunication infrastructure of data centers” [1], current Fire safety Rules [2] and Technical Regulations “General Requirements fire safety requirements” [3], as well as other relevant building and electrical standards.
Incorrect choice of server room location
One of the most fundamental and common mistakes is the wrong choice of the location of the server room, which can lead to many problems in the future.
Why is this critical?: An incorrect server room location can lead to an increased risk of flooding, limited expansion options, problems with service availability, and exposure to electromagnetic interference.
What the standards say: According to ST RK ANSI/TIA-942-A-2016 [1], the location of a data center (data center) or server room should be selected taking into account risk minimization. The standard states: “It is not allowed to place the data center next to rooms for storing flammable or aggressive chemical materials. It is also not recommended to place data centers on the upper floors of the building, as they are most susceptible to damage in case of fire and can be flooded if the roof leaks.” Although the ANSI/TIA-942-A-2016 Standard does not directly prohibit windows, general requirements for security and information protection, as well as requirements for maintaining a microclimate, often lead to decisions on placing server rooms in rooms without windows or with windows protected and not opening. If there are window openings, they must be protected from unauthorized access and provide the necessary thermal insulation and fire resistance in accordance with general building regulations and fire safety requirements [7].
The right decision:
- Have a server room on the first or middle floor of the building.
- Avoid basements due to the risk of flooding and upper floors due to potential roof leaks and the difficulty of extinguishing a fire.
- Do not place the server room next to rooms with wet processes (showers, bathrooms) or water supply and sewerage lines above the server room.
- Choose a location far from powerful sources of electromagnetic interference (for example, power transformers, large electric motors), and provide appropriate shielding if necessary.
- Provide convenient access for equipment maintenance and delivery.
2. Insufficient area and height of the room
Underestimating the size requirements of the server room leads to problems with equipment placement, overheating, and maintenance difficulties.
Why this is critical: Insufficient space can lead to overheating of equipment, maintenance difficulties, inability to expand and violation of safety requirements.
What the standards say: ST RK ANSI/TIA-942-A-2016 [1] recommends planning the space taking into account current and future needs. Specific minimum dimensions (for example, the frequently mentioned 14 sq. m. m) may vary depending on the amount of equipment being placed and the type of organization. It is recommended to refer to the design standards and requirements of the TIA-942 standard itself to determine an adequate area. The minimum ceiling height should provide sufficient space for equipment placement, organization of cable runs above cabinets and air circulation (it is often recommended to have at least 2.6 m from the clean floor to the suspended ceiling, if any, or to the bottom of the supporting structures). The door openings should be sufficient to allow the equipment to enter. A doorway width of at least 0.9 m (often 1.2 m for larger equipment) and a height of at least 2.1 m (often 2.2 m) is recommended. The doors should open outward from the server room and should not have thresholds that make it difficult to move the equipment [see general building codes and regulations of the Republic of Kazakhstan, for example, the National Assembly of the Republic of Kazakhstan 3.02-02-2022 “Residential and public buildings” or fire safety requirements for evacuation routes specified in [8]].
The right decision:
- Calculate the required area, taking into account all planned equipment, work areas, passageways and expansion reserve (recommended 30-50%).
- Provide sufficient ceiling height for ventilation and laying of communications.
- Install doors of the appropriate width and height that open outwards.
3. Unreliable power supply system without proper redundancy
The absence or improper organization of redundant power supply is a direct path to critical failures.
Why is this critical?: Even a short-term power outage can lead to data corruption, equipment failure, prolonged downtime, and loss of information.
What the standards say: ST RK ANSI/TIA-942-A-2016 [1] defines data center reliability levels where higher levels require full redundancy of power supply systems (for example, Tier III, Tier IV). To ensure reliability, it is recommended to use two independent power inputs, uninterruptible power supplies (UPS) and, possibly, a diesel generator set (DGU). The Rules for the Installation of electrical installations of the Republic of Kazakhstan (PUE RK) [12] and the Rules for the Technical Operation of Electrical Installations of Consumers (PTEEP RK) [11] establish general requirements for the installation, commissioning and operation of electrical installations, including requirements for grounding and safety. Fire safety rules [2] indicate: “Electrical installations and electrical appliances in the premises are de-energized at the end of working hours (shifts). Emergency lighting, fire extinguishing and fire water supply installations, fire and security alarm systems remain energized.”
The right decision:
- Provide a dedicated power supply line for the server room.
- Implement a grounding system in accordance with the PUE RK [12] and the recommendations of ANSI/TIA-607-D [9].
- Install a UPS with sufficient power and battery life.
- Consider installing a DSU for long-term redundancy.
- Provide automatic reserve input (AVR).
- Perform regular maintenance of the power supply system.
4. Inadequate cooling and ventilation system
Server equipment generates a significant amount of heat, and overheating is one of the common causes of failures.
Why this is critical: Insufficient cooling leads to lower performance, shorter component life, unstable operation, and failures.
What the standards say: ASHRAE (American Society of Heating, Refrigerating and Air-Conditioning Engineers) TC 9.9, in its manual “Thermal Guidelines for Data Processing Environments – Fifth Edition” (2021) [6], recommends maintaining the temperature at the entrance to the equipment in the range of 18-27°C. ST RK ANSI/TIA-942-A-2016 [1] also emphasizes the importance of environmental control and recommends redundancy of cooling systems (N+1 or 2N for higher Tier levels). The requirements for the organization of ventilation and air conditioning systems are also contained in the relevant building codes and regulations of the Republic of Kazakhstan (for example, the Joint Venture of the Republic of Kazakhstan 4.02-101-2022 “Heating, ventilation and air conditioning”, specified in [8]). For mission-critical rooms, such as server rooms, it is often recommended to use two independent air conditioning systems or systems with redundant main nodes.
The right decision:
- To carry out an accurate calculation of the heat dissipation of all equipment with a reserve for the future.
- Use specialized precision air conditioners.
- Provide redundancy of cooling systems (N+1 or 2 N).
- Organize proper air circulation (for example, “hot” and “cold” corridors).
- Implement temperature and humidity monitoring systems.
5. Non-compliance with fire safety requirements
Ignoring fire safety requirements can have disastrous consequences.
Why this is critical: A fire in the server room can lead to a complete loss of data and equipment, significant damage and a threat to life.
What the standards say: The Technical Regulations “General requirements for fire safety” [3] establish general requirements, including the need to equip premises with automatic fire extinguishing and alarm systems, depending on their purpose and area. In particular, paragraph 26 states: “Automatic fire extinguishing installations must ensure the elimination of fire in a room (building) before the occurrence of critical values of fire hazards; the occurrence of fire resistance limits of building structures; causing maximum permissible damage to protected property.” Fire safety rules [2] oblige the head of the organization to ensure the proper condition of fire protection systems. SN RK 2.02-02-2023 “Fire automation of buildings and structures” [4] and SP RK 2.02-102-2022 “Fire automation of buildings and structures” [5] details the requirements for the design and installation of these systems. Gas fire extinguishing systems that are safe for electronic equipment are preferred for server rooms. The enclosing structures of the room (walls, floors, doors) must have the required fire resistance limit in accordance with the CH RK 2.02-01-2023 “Fire safety of buildings and structures” [7].
The right decision:
- Equip the room with an automatic gas fire extinguishing system and a fire alarm system.
- Use fire-resistant materials for finishing.
- Install fire doors with an appropriate fire resistance limit.
- Provide an emergency power outage system (except for life support and fire extinguishing systems).
- Ensure the availability of primary fire extinguishing equipment (for example, carbon dioxide fire extinguishers).
6. Improper organization of the cable infrastructure
The chaotic laying of cables creates many problems.
Why this is critical: A poorly organized cable system makes it difficult to circulate air, maintain, upgrade, and increase the risk of cable damage.
What the standards say: ST RK ANSI/TIA-942-A-2016 [1] contains detailed requirements for the organization of cable infrastructure, including the use of cable trays, organizers, labeling and separation of power and low-current cables. The standard specifies: “For the distribution of cables and the organization of cable flows in a telecommunications room, it is necessary to use conduits and organizers. The means of distributing and organizing cable flows must be securely fixed, withstand the weight of the cable, and ensure the protection and distribution of cables with the minimum allowable bending radius of the cable. “
Правильное решение:
- Использовать структурированную кабельную систему (СКС).
- Применять кабельные лотки, каналы и органайзеры.
- Осуществлять маркировку всех кабелей и портов.
- Разделять силовые и информационные кабели.
- Использовать кабели с негорючей оболочкой или с низким дымовыделением (LSZH).
7. Отсутствие или недостаточный уровень физической безопасности
The server room is the brain of the digital infrastructure, and its physical protection is critically important.
Why this is critical: The lack of proper physical protection can lead to unauthorized access, theft, equipment damage, and data compromise.
What the standards say: ST RK ANSI/TIA-942-A-2016 [1] describes the requirements for physical security, which are tightened with an increase in the Tier of data centers. This includes access control, video surveillance. General requirements for the protection of premises may also be contained in departmental norms or information security standards [14]. It is recommended to install reliable metal doors equipped with access control and management systems (ACS) using electronic locks and the possibility of emergency unlocking.
The right decision:
- Install strong doors with secure locks.
- Implement an access control and management system (ACS).
- Organize video surveillance.
- Keep a log of visits.
- Restrict access to authorized personnel only.
8. Incorrect grounding and protection from static electricity
Problems with grounding and static electricity can lead to malfunctions and equipment failure.
Why is this critical?: Static electricity and poor grounding can damage sensitive electronics, cause malfunctions and data loss.
What the standards say: The requirements for telecommunication grounding and potential equalization are described in detail in the ANSI/TIA-607-D [9] standard (which may be referred to by the ANSI/TIA-942-A standard). The grounding must comply with the requirements of the PUE RK [12]. To minimize static electricity, it is recommended to maintain relative humidity in the range of 40-60% (according to ASHRAE [6]) and use antistatic floor coverings. The requirements for the resistance of antistatic floors (often the recommended value of surface resistance is in the range of 106−109Ω) can be found in specialized standards for protection against electrostatic discharges (for example, the IEC 61340 series of standards [13]).
The right decision:
- Perform a high-quality grounding system for all equipment and metal structures.
- Use antistatic floor coverings.
- Maintain optimal humidity levels.
- Use antistatic wristbands when working with the equipment.
9. Lack of a monitoring and notification system
Without monitoring, it is impossible to respond promptly to emerging problems.
Why this is critical: Problems can be discovered too late, which increases downtime and the risk of critical failures.
What the standards say: ST RK ANSI/TIA-942-A-2016 [1] recommends the introduction of monitoring systems for environmental parameters (temperature, humidity, leaks), the condition of engineering systems (power supply, air conditioning) and security systems (access control, video surveillance). Requirements for monitoring the ICT infrastructure, especially for government agencies and critical facilities, may also be contained in relevant Government Resolutions or departmental regulations (for example, within the framework of the “Unified Requirements in the field of ICT and information Security” [14]). It is important to monitor not only the parameters of the indoor environment, but also the state inside the server cabinets, as well as events such as the opening of cabinet doors or rooms.
The right decision:
- Implement a comprehensive monitoring system (temperature, humidity, power supply, ACS, leaks, smoke).
- Set up a notification system for responsible persons (SMS, email).
- Integrate the monitoring system with the general management system of the IT infrastructure.
10. An ill-conceived expansion and modernization strategy
The lack of planning for the future leads to difficulties and additional costs with increasing needs.
Why is this critical?: There may be a shortage of space, power supply and cooling capacity, which will require expensive rebuilding.
What the standards say: ST RK ANSI/TIA-942-A-2016 [1] strongly recommends: “When planning a server room, you should always take into account the increase in space in the future so that this increase does not lead to relocation or new repairs.” This applies not only to the area, but also to the reserve capacity of the power supply and cooling systems.
The right decision:
- Design a server room with a margin in terms of area, power supply and cooling (at least 30-50%).
- Choose scalable and modular solutions.
- Document the entire infrastructure.
11. Absence or poor maintenance of operational documentation and diagrams
Even a perfectly mounted server room will eventually turn into a source of problems without up-to-date and detailed documentation.
Why this is critical: The lack or obsolescence of documentation (power supply circuits, cable logs, equipment configurations, location plans) leads to:
- Significantly increase the troubleshooting time.
- The risk of errors during planned work or modernization.
- Difficulties in transferring cases to new employees.
- Inability to quickly restore the system after a failure using the reference configuration.
What do standards and best practices say?: Although the direct requirements for the composition of operational documentation may vary, ST RK ANSI/TIA-942-A-2016 [1] implies the need for infrastructure administration and management, which is impossible without documentation. ITIL (Information Technology Infrastructure Library) best practices and other IT service management systems strongly recommend keeping detailed and up-to-date records of configuration units, connection diagrams, and procedures.
The right decision:
- Develop and keep up-to-date a complete set of operational documentation, including:
- Architectural plans of the server room, indicating the location of the equipment.
- Detailed diagrams of the power supply system (single-line diagrams, switchboard diagrams, cable routes).
- Diagrams of the air conditioning and ventilation system.
- Structured cabling system (SCS): cable logs, wiring diagrams, labeling of all components.
- Configurations of active network equipment and servers.
- Plans of racks with equipment placement (rack elevation).
- Instructions for the operation and maintenance of engineering equipment.
- Regularly audit and update documentation, especially after any configuration changes.
- Use specialized documentation and OT asset management software (DCIM – Data Center Infrastructure Management, if applicable, or configuration management systems).
- Provide easy access to documentation by authorized personnel.
12. Neglect of regular maintenance of engineering systems
Engineering systems (power supply, UPS, DGU, air conditioning, fire extinguishing) require regular professional maintenance.
Why this is critical: The rejection of planned maintenance or its poor-quality implementation leads to:
- Sudden failure of critical equipment (for example, UPS failure due to worn-out batteries, overheating due to clogged air conditioner filters).
- Reducing system efficiency and increasing operating costs.
- False alarms or fire extinguishing/alarm system failures.
- Cancellation of the warranty obligations of the equipment manufacturer.
What the standards and equipment manufacturers say: All engineering equipment manufacturers provide maintenance regulations. The rules of technical operation of electrical installations of consumers (PTEP RK) [11] also prescribe regular inspections and maintenance of electrical equipment. Fire safety regulations [2] require the maintenance of fire automation systems in good condition, which includes their regular maintenance by specialized organizations. Maintenance recommendations can also be found in ST RK ANSI/TIA-942-A-2016 [1] in the sections related to data center operation.
The right decision:
- Develop and strictly adhere to the schedule of scheduled preventive maintenance and maintenance for all engineering systems of the server room.
- To conclude service contracts with specialized authorized organizations.
- Keep maintenance and repair logs.
- Ensure the availability of the necessary spare parts (spare parts, tools and accessories) for the prompt elimination of minor malfunctions.
- Regularly check and replace consumables (UPS batteries, air conditioner filters, end-of-life fire extinguishing system sprinklers, etc.).
13. Insufficient attention to configuration and change management
Any changes to the IT infrastructure or server engineering systems must be controlled and documented.
Why it’s critical: Uncontrolled changes lead to:
- Unexpected failures and conflicts in the operation of systems.
- Inconsistency with the actual configuration of the documentation.
- Difficulties in diagnosing problems (the “domino effect”, when one change causes a cascade of other problems).
- Reducing the overall level of safety and reliability.
What standards and best practices say: Change Management and Configuration Management processes are key in IT management methodologies such as ITIL. ST RK ANSI/TIA-942-A-2016 [1] also emphasizes the importance of managed processes to ensure data center reliability. For information technology facilities, especially in the public sector or at critical facilities, change management requirements can be established by law [14].
The right decision:
- Implement a formalized change management process:
- Registration of requests for changes (RFC – Request For Change).
- Assessment of the impact and risks of changes.
- Planning and testing of changes.
- Approving changes.
- Implementation of changes in the established “maintenance window”.
- Documentation of all changes made.
- Analysis of measurement results.
- Maintain an up–to-date database of configuration units (CMDB – Configuration Management Database) or its equivalent.
- Regularly verify the actual configuration with the reference (documented) one.
- Restrict the rights to make changes to strictly authorized personnel.
14. Ignoring energy efficiency and operating costs (OPEX)
When designing, the focus is often on capital expenditures (CAPEX), forgetting about future operating costs.
Why is this critical?:
- High energy bills (the main consumers are IT equipment and the cooling system).
- Inefficient use of capacities, leading to overpayments.
- Frequent replacement of equipment due to suboptimal operating conditions or the choice of components with low efficiency.
- Increasing the carbon footprint of the organization.
What do standards and best practices say?: ST RK ANSI/TIA-942-A-2016 [1] and the ASHRAE recommendations [6] contain provisions aimed at improving the energy efficiency of data centers, for example, through the organization of “hot/cold” corridors, the use of higher permissible temperatures, and the use of energy-efficient equipment. Indicators such as PUE (Power Usage Efficiency) are generally accepted for evaluating the energy efficiency of a data center.
The right decision:
- At the design stage, take into account not only the cost of purchase, but also the energy efficiency of equipment (servers, UPS, air conditioning systems).
- Apply modern corridor insulation solutions to optimize air flow.
- Consider the possibility of using free cooling (natural cooling) technologies where it is climatically justified.
- Regularly monitor energy consumption and analyze resource efficiency.
- Optimize server utilization (for example, using virtualization) to reduce their number and, consequently, energy consumption and heat dissipation.
- Choose power supplies for high-efficiency OT equipment (for example, 80 PLUS Platinum/Titanium certification).
Conclusion
Creating a reliable and efficient server room requires an integrated approach, professional knowledge and strict compliance with standards. By avoiding the fatal errors considered and following the recommendations of current regulatory documents of the Republic of Kazakhstan and international standards, you will ensure the long-term and uninterrupted operation of your IT infrastructure, protecting valuable data and investments.
Disclaimer
The information presented in this article is for general information purposes and is based on international standards and regulatory documents of the Republic of Kazakhstan that are current at the time of writing (May 2025). When designing and installing server rooms, you should follow only the latest versions of regulatory documents and involve certified specialists. The author and publisher are not responsible for the possible consequences of using the information from this article without a detailed study of the project and consultation with professionals. Specific requirements and recommendations may change.
List of recommended sources for designing server rooms (it is necessary to check the relevance at the time of design):
- ST RK ANSI/TIA-942-A-2016 “Telecommunication infrastructure of data processing centers”.
- Fire safety rules approved by the Order of the Minister of Emergency Situations of the Republic of Kazakhstan dated February 21, 2022 No. 55 (with subsequent amendments and additions).
- Technical Regulations “General requirements for fire safety”, approved by the Order of the Minister of Emergency Situations of the Republic of Kazakhstan dated August 17, 2021 No. 405 (with subsequent amendments and additions).
- SN RK 2.02-02-2023 “Fire automation of buildings and structures”.
- SP RK 2.02-102-2022 “Fire automation of buildings and structures”.
- ASHRAE TC 9.9. (2021). Thermal Guidelines for Data Processing Environments – Fifth Edition. American Society of Heating, Refrigerating and Air-Conditioning Engineers.
- SN RK 2.02-01-2023 “Fire safety of buildings and structures”.
- Building codes and Regulations (SNiP RK) and Codes of Practice (SP RK) of the Republic of Kazakhstan applicable to the design of buildings and premises (for example, concerning loads, minimum dimensions, escape routes, door openings, HVAC systems, etc.). (Specific document numbers depend on the aspect of design).
- ANSI/TIA-607-D-2019 “Commercial Building Grounding and Bonding Requirements for Telecommunications” (or the current version).
- Uptime Institute. (2023). Global Data Center Survey (or an up-to-date annual report).
- The rules of technical operation of electrical installations of consumers, approved by the authorized body in the field of electric power industry of the Republic of Kazakhstan (PTEEP RK).
- Electrical installation regulations of the Republic of Kazakhstan (PUE RK).
- The IEC 61340 series of standards for the protection of electronic devices from electrostatic phenomena (for example, IEC 61340-5-1).
- Relevant Resolutions of the Government of the Republic of Kazakhstan and departmental regulations concerning the requirements for information and communication infrastructure and information security, especially for government agencies and critical informatization facilities.