Yesterday's 11-hour crash in Gmail (there was a brutally cursed
topic on this topic), due to which problems with the delivery of emails and / or attachments to them were observed in almost a third of email traffic, received its official explanation.
The company itself describes the scale of the problem in these figures: 71% of mail traffic did not experience any problems, while in the remaining 29% the average delay in the delivery of letters was 2.6 seconds. Google also honestly admits that about 1.5% of emails were delivered with a delay of more than two hours. Users attempting to download large attachments also observed multiple error messages. With all this, the already delivered mail worked without noticeable problems.
The delay in message delivery was caused by a double network failure. This is a very rare case in which two separate redundant network segments stop working simultaneously. These failures were not related at all, but their combination led to a decrease in Gmail’s message delivery capacity and, at 5.54 am Pacific time, messages began to accumulate.
Original
The message delivery delays were triggered by a dual network failure. Redundant network paths at the same time. The two network failures were unrelated, but in combination they began at 5:54 am PST messages started piling up.
As a result, the restoration of Gmail functionality required the team of engineers, who took up the problem immediately, adding the missing capacity and, in fact, restoring the mentioned failures, as a result of which, after about 10 hours, users stopped experiencing problems with Google mail.
The company made conclusions from what happened: measures are planned to improve network capacity, even in the case of simultaneous network failures — such as happened yesterday. At the same time, it is alleged that you should not forget that Gmail uptime is 99.9% and the company is trying very hard to keep it in the future.
')
[
Source ]