📜 ⬆️ ⬇️

Storage Performance

Since LSI (now part of Avago Technologies) sold its division of flash controllers from the notorious Seagate, their main focus is again on solutions for RAID and SAS (a kind of back to basics). To mark this “return to the roots,” I decided to translate an interesting article by Thomas Pavel, director of sales for the EMEA storage division. Not sure about whether this article is posted on the Internet, at least not yet found.

The influx of data and does not think to subside, feeding on demanding applications, such as business analytics, social media, video broadcasting and grid computing. This merciless data growth poses many challenges for system architects. How to handle such huge amounts of information effectively at a price? And, perhaps, the main question is how to maintain storage performance at the same level (or even improve)?

In fact, it is difficult to find in the IT industry those who are not affected by the influx of data. Even home users generate photo and video materials more both in quantity and quality, which, in turn, requires an increase in storage sizes in their PCs and smartphones. That is why social networks, such as Facebook, get more and more information. According to materials published by Facebook back in 2012, the amount of data they receive throughout the day reached 0.5 petabyte.

As data volumes increase, the time it takes to access it increases. This is what is behind the continuous improvement of storage technologies, and in this article we will look at how the third generation of SAS will help overcome various bottlenecks in storage systems.
')
When designing high-performance optimized data warehouses, you need to carefully consider potential bottlenecks. This is especially difficult, given the fact that these same bottlenecks change every time, as storage technologies are updated. There are three most important elements affecting performance: a PCIe (Peripheral Component Interconnect Express) server, a SAS solution implemented in the Host Bus Adapter (HBA) adapter, and, directly, disks connected via Serial Attached SCSi interfaces or Serial Advanced Technology Attachment (SAS and SATA, respectively).

SAS is now in the third generation of development, and each generation has doubled the performance characteristics, starting with the original 3 Gb / s to 6 Gb / s and now 12 Gb / s. SAS, like PCIe, uses pipes, and high-performance storage systems typically use many SAS lines to support high data rates. Both conventional HDD and SSD solid state are now able to support SAS or SATA interfaces.

Storage Performance

When developing a storage system designed for high performance, it is necessary to take into account the capacity of each of the elements of the system. Typical characteristics of this value for different generations of critical elements discussed in this article are listed in Table 1.

Table 1 - approximate throughput of various types of disks and generation of SAS and PCIe.



Table 2 contains several examples of configurations that show bottlenecks in the case when the maximum limit of the disks (usually the slowest component of the system) is reached.



Table 2 shows an example of storage configurations showing bottlenecks and using the maximum number of disks with peak performance.

As can be seen from the table, the need to use more disks to increase capacity requires the use of more modern generations of SAS and / or PCIe, or more SAS lanes. On the other hand, in a system with a small number of disks, their relatively low total bandwidth is a bottleneck, so there is no need to “reconfigure” the configuration with modern technologies and / or an increase in channels. The example disks have a 6 Gb / s interface and have a capacity of 230 Mb / s and 55 Mb / s for 15K RPM HDD and SSD, respectively.

Please note that table 2 implies that all drives operate with a maximum load at the same time, which does not always happen in real life. It should also be borne in mind that the number of I / O operations per second (IOPs) in most modern systems in many cases is a more important characteristic than the total bandwidth. Therefore, in real life, most configurations can support more disks than in the table below.

Switch to 12 Gb / s SAS

The main problem in the transition to the third generation of SAS is familiar to many: investment protection. Many organizations have already invested very noticeable amounts in SAS disks and want to save these investments during the transition. The main essence of the problem is that due to backward compatibility, the third generation of the SAS standard provides for slowing down the work to the slowest data transfer rate among all devices in the system.

In small point-to-point configurations, this problem is not always noticeable, since the transition will require updating both the initiator and the target device. But in most companies, such configurations are very rare. Because of this, due to the restriction “to the slowest node of the system”, without the use of point-to-point configurations, the maximum speed will be unattainable until all disks are replaced with those supporting the new standard.

Fortunately, there is a way to overcome this limitation. First, a few explanations on how SAS extenders work. SAS extenders allow one (or several) initiators to communicate with several target devices at once, as shown in Figure 1. Extenders help to scale SAS, since each of them is capable of supporting multiple disks, which in turn allows aggregating their throughput.



Figure 1 - Two 48-port SAS Extenders, each connected to 12 initiators and 36 target devices.

This figure shows a redundant scheme in which 48-port extenders are connected to two RAID controllers and disk storage. Additional storages can be connected in daisy-chain configuration. Some SAS extenders support connecting up to 2000 drives in this mode.

In accordance with the third generation of the SAS standard, if any of the SATA or SAS drives works at a speed of 6 Gb / s, the RAID controller interfaces will also work at that speed. But what if we can combine the bandwidth of two 6 Gb / s disks into one 12 Gb / s channel, just as several channels can be aggregated into a PCIe?

This bandwidth aggregation of target devices (disks) is what is behind the bandwidth aggregation technology, which is called DataBolt in LSI, an Avago Technologies Company. This technology allows you to immediately increase the number of I / O operations per second and the total bandwidth, while maintaining the investment made in the first and second generation SAS devices. Aggregation allows 12 Gb / s SAS to work with existing 6 Gb / s devices using double speed. In fact, with aggregation of bandwidth, the mandatory need to use 12 Gb / s SAS disks in order to achieve maximum system performance is completely eliminated.

Bandwidth aggregation works by using 12 Gb / s buffers on each port, which allows the use of communications at a speed of 12 Gb / s with disks of any speed, both SAS and SATA. Despite the fact that the technology is proprietary, it works completely “inside” the extender, which allows all interfaces to be fully compatible with the standard. Thus, a 12 Gb / s SAS port connected to a 6 Gb / s SAS disk will work with the disk as a backward compatible SAS port at 6 Gb / s, but the system will work at 12 Gb / s.
Conclusion

Storage bottlenecks are always encountered during the migration of different generations of technology. With the advent of the third generation PCIe, for example, the second generation SAS has become such a bottleneck. LSI technology eliminates this bottleneck, making PCIe a bottleneck in a system using 12 Gb / s SAS.

Source: https://habr.com/ru/post/228041/


All Articles