TELDAT Blog

Communicate with us

Improving device reliability and redundancy

Mar 5, 2024

Hardware

In a previous article we looked at how to mathematically calculate the reliability of electronic equipment. That is, the probability of it working correctly for a given period of time. This is best characterised by the Mean Time Between Failure (MTBF), or its opposite, Failure In Time (FIT). MTBF values are often in the order of hundreds of thousands of hours for electronic equipment.

Source: Ricardo Saiz

The probability of a failure occurring during time t is best expressed as an exponential function, similar to a straight line for small intervals.

How do you calculate the MTBF of a device?

Device reliability depends on that of its component parts (solderable electronic components, modules, wiring, etc.). The total MTBF is the sum of the inverse of the MTBF of each part, similarly to parallel resistances. If, in an electrical circuit, admittance is added up, the FIT of a device made up of many components (parallel paths leading to a failure) shall be the sum of all these FIT. This is why it is easier to operate with FIT than MTBF.

How to make devices more reliable?

In turn, the FIT of a component is not an immutable value but depends on the environment and (more specifically) on the temperature. Heat is directly related to the failure rate, and indeed to the speed of many physical processes and chemical reactions. Swedish scientist Svante Arrhenius (1859 – 1927) was the first to model this relationship, in 1889, with the equation that bears his name:

Formula

According to this formula, when the absolute temperature is close to zero, reactions stop. However, they accelerate significantly with increasing temperature.

High service availability

Our device will become less reliable as temperatures rise, but how can we make it more reliable? We can’t fight the laws of physics, but we can use them to make the best engineering decisions. In addition to heeding the advice found in manuals (“do not cover the ventilation slots” or “install the device far from heat sources”), we can improve the reliability of the system. This is known as service availability, which is ultimately what matters.

Redundancy of devices

In a router or switch, we can duplicate the power supply (one of the elements with the highest failure rate). The probability of a power supply failing in a t interval is:

This formula equals 0 when t=0, but its derivative is:

The device will stop working if both power supply units – PSUs fail. The probability of this happening is the previous formula squared:

As in the previous case, this formula equals 0 at t=0. However, its derivative is also 0 at the power supply unit – PSU.

Source: Ricardo Saiz

With two units working simultaneously (only one of which is essential), the failure rate draws a very different curve. This is especially true for shorter periods (when compared to the MTBF). Let’s look at a simple example.

We have a power supply with an MTBF of 200,000 hours. What are the chances of it failing within a year?

200,000 hours may seem like a long time, but there is a 4.3% chance of it breaking down in the first year of use. If we have a pool of 23 devices, we will suffer an average of one breakdown per year (with the ensuing service outage).

If we set up two power supply units – PSUs working redundantly, the probability of a critical failure over a year is:

It only amounts to 0.18%.

If we also connect each power supply unit – PSU to a separate electrical circuit (e.g., an uninterruptible power supply or UPS), we obtain another advantage: having a power cut leave us temporarily without service will be much less likely.

If our equipment sends an alert to the network administrator when it detects a failure, the faulty device can be replaced within a short period of time. Ideally before a second, critical failure occurs.

When combining redundancy with diligent fault detection and remediation, service availability is extremely high. This is because, after a failure occurs, suffering another breakdown during the time it takes to repair the device (presumably hours or a few days) is unlikely. We can understand this graphically, since we move in the grey line’s flat area (i.e., where the derivative is almost zero).

Source: Ricardo Saiz

MTBF findings and more

Teldat devices (such as the new generation of switches, some of which are equipped with redundant power supplies to meet the most demanding requirements) offer MTBF figures ranging between 500,000 and one million hours. We also carry out a rigorous Reliability, Availability, Maintainability and Safety (RAMS) analysis for equipment intended for special scenarios, such as railways. Using Fault Tree Analysis (FTA), we can identify potential failures and design alternative operating modes in the event of simple failures. As a result, our service availability figures are close to 100%.

Ricardo Saiz Villoria

Ricardo Saiz, telecommunications engineer, is part of Teldat’s R&D department. He specializes in hardware, and is responsible for electronic design and equipment certification.

Tags: router technology, telecommunication technology

Related Posts

NIS 2 – Cybersecurity-related legislation in 2024

Dec 13, 2024 | Online Security

Now that 2024 is about to end, let’s have a look at this year’s cybersecurity highlights. Changes in legislation like NIS 2 (both in countries and supranational bodies, like the EU) probably rank amongst the most important. All of these changes in...

5G Networks – Cybersecurity Solutions and Threats

Dec 4, 2024 | Cellular Networks

The arrival of 5G networks is set to shake up global connectivity, delivering unprecedented speeds, the ability to connect a vast number of devices, and ultra-low latency. However, this new technology also introduces a series of cybersecurity...

Network Behavior Analysis: Key to guarantee Security and Performance

Dec 2, 2024 | Online Security

In an increasingly complex and interconnected technological environment, the performance and safety of IT networks are key aspects for any organization. Traditional monitoring and protection systems are not enough to respond to advanced threats or...

« Older Entries

Privacy preference

We use cookies and other technologies on our website. Some of these are essential, while others help us to improve this website and your experience. Personal data (e.g. recognition characteristics, IP addresses) may be processed, e.g. for personalised advertisements and content or for the measurement of advertisements and content.
You can find more information about the use of your data in our privacy policy.
Here you will find an overview of all cookies used. You can consent to entire categories or display more information and select specific cookies.

Accept all Save

Back Reject all

Essential (1)

These cookies are required to enable you to navigate through the websites and use key functions. Required cookies make the basic functions of the website, like setting your privacy preferences, contacting us directly through the website, the ability to log in or fill out forms possible. They are also used for anonymized evaluation of user behavior, which helps us to continuously further develop our website for you.

View info about cookies Hide info about cookies

Nombre	Borlabs Cookie
Proveedor	Propietario de este sitio web, Legal notice
Propósito	Guarda las preferencias de los visitantes seleccionadas en la Cookie Box de Borlabs Cookie.
Nombre de la cookie	borlabs-cookie
Caducidad de la cookie	1 año

Statistics (2)

These cookies and similar technologies are used to enable specific and relevant content, tailored to your personalized requirements. The statistics obtained via these cookies are used to obtain interesting content and measure effectiveness, allowing us to understand how our website is used, helping us to improve user experience. They also allow us, as well as certain third parties, to obtain statistical information on the general circulation of our web. They may store information that allows personal identification of the visitor. The technology we’re using is Google Analytics, which means a transfer of personal information to the United States.

View info about cookies Hide info about cookies

Aceptar	Google Tag Manager
Nombre	Google Tag Manager
Proveedor	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Propósito	La cookie de Google se utiliza para controlar el manejo avanzado de secuencias de comandos y eventos.
Política de privacidad	https://policies.google.com/privacy?hl=en
Nombre de la cookie	_ga,_gat,_gid
Caducidad de la cookie	2 años

Aceptar	Google Analytics
Nombre	Google Analytics
Proveedor	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Propósito	Cookie by Google used for website analytics. Generates statistical data on how the visitor uses the website.
Política de privacidad	https://policies.google.com/privacy?hl=en
Nombre de la cookie	_ga,_gat,_gid
Caducidad de la cookie	2 Months

Social Media (4)

Social media cookies allow users to view videos and photos and access our social media platforms directly on the internet. Third-party cookies are set in the process. These may merge the information with other data. If you do not allow these cookies, you will miss out on some of the information and user experience associated with the web.

View info about cookies Hide info about cookies

Aceptar	Google Maps
Nombre	Google Maps
Proveedor	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Propósito	Se utiliza para desbloquear el contenido de Google Maps.
Política de privacidad	https://policies.google.com/privacy?hl=enl=en
Host(s)	.google.com
Nombre de la cookie	NID
Caducidad de la cookie	6 meses

Aceptar	OpenStreetMap
Nombre	OpenStreetMap
Proveedor	Openstreetmap Foundation, St John’s Innovation Centre, Cowley Road, Cambridge CB4 0WS, United Kingdom
Propósito	Se utiliza para desbloquear el contenido de OpenStreetMap.
Política de privacidad	https://wiki.osmfoundation.org/wiki/Privacy_Policy
Host(s)	.openstreetmap.org
Nombre de la cookie	_osm_location, _osm_session, _osm_totp_token, _osm_welcome, _pk_id., _pk_ref., _pk_ses., qos_token
Caducidad de la cookie	1-10 años

Aceptar	Twitter
Nombre	Twitter
Proveedor	Twitter International Company, One Cumberland Place, Fenian Street, Dublin 2, D02 AX07, Ireland
Propósito	Se utiliza para desbloquear el contenido de Twitter.
Política de privacidad	https://twitter.com/privacy
Host(s)	.twimg.com, .twitter.com
Nombre de la cookie	__widgetsettings, local_storage_support_test
Caducidad de la cookie	Ilimitado

Aceptar	YouTube
Nombre	YouTube
Proveedor	Google Ireland Limited, Gordon House, Barrow Street, Dublin 4, Ireland
Propósito	Se utiliza para desbloquear el contenido de YouTube.
Política de privacidad	https://policies.google.com/privacy?hl=enl=en
Host(s)	google.com
Nombre de la cookie	NID
Caducidad de la cookie	6 meses

Urban Trams with

Wi-Fi and Cybsecurity

Financial

Rolling Stock

Healthcare

Telco

Retail

In-Vehicle

Industrial & IoT

Services

General Services

Training

Support

Teldat M10-Smart

Access Products

Advanced Networking

Edge Network Computing

Industrial & IoT

Network Security

SD-WAN

TELDAT Blog

Improving device reliability and redundancy

How do you calculate the MTBF of a device?

How to make devices more reliable?

High service availability

Redundancy of devices

MTBF findings and more

Ricardo Saiz Villoria