📜 ⬆️ ⬇️

Intel® Parallel Studio XE 2016 Beta - what's new?

A big update to the Intel® Parallel Studio XE package was released this week. Version 2016 includes three brand new products:
  1. Intel® Data Analytics Acceleration Library (Intel® DAAL) - C ++ and Java data analytics solution (statistics, machine learning, etc.).
  2. The new Vectorization Advisor in the Intel® Advisor XE 2016 Beta to optimize code for SIMD instructions, i.e. vectorization.
  3. MPI Performance Snapshot for quick overall performance evaluation of MPI programs.

The beta version is available publicly and for free, the program lasts until June 23, but the licenses will work until September 25, 2015. To get the Beta version you need to register here .
This article is devoted to the review of the new functional, we will try to highlight in more detail the individual products in subsequent blogs - write in the comments what is of interest.

Intel® Advisor XE 2016 Beta


Intel® Advisor XE has traditionally been the tool for prototyping concurrency on threads. In the 2016 Beta version, it is actually divided into two large products:


Main features of Vectorization Advisor:



We will write more about Vectorization Advisor in the following blogs.
')

Intel® Data Analytics Acceleration Library (Intel® DAAL) 2016 Beta


Optimized library for working with “big data” at different stages: obtaining data from the source, preliminary processing, transformation, data mining, modeling, validation and decision making.



Compiler


The official name is Intel® Parallel Studio XE 2016 Beta, Composer Edition. The product combines the compiler and libraries. The Intel compiler extends the set of supported standards:

OpenMP * 4.0:

OpenMP * 4.1 TR 3:

C / C ++ standards:

The Fortran compiler began to support submodules, improved runtime checking of uninitialized variables ( [Q]init option), added support for IMPURE ELEMENTAL (F2008) and improved C Interoperability (F2015).

Intel® VTune ™ Amplifier XE 2016 Beta


The new version of VTune ™ Amplifier XE has expanded the functionality of profiling OpenMP and MPI applications. For OpenMP, the tool provides statistics on parallel regions and highlights those where there is the greatest potential for optimization. VTune Amplifier recognizes the main reasons for OpenMP inefficiency - load imbalance, overhead, synchronization, etc. Details have been written about OpenMP analysis.

VTune Amplifier greatly expanded the capabilities of MPI-profiling applications, and the most interesting - hybrid, MPI + OpenMP. Those. You can simultaneously analyze OpenMP and MPI communication imbalances.

In the Bottom-up table, you can group by MPI process and OpenMP regions within it, as well as by individual barriers (for example, in cycles) within a region:

Performance analysis of OpenCL programs running on Intel® HD Graphics becomes more convenient with the new “architecture diagram”, where you can visually see the GPU hardware blocks and their loading - busy cores and data traffic:

Other changes in VTune Amplifier XE:


Intel® MPI Library 5.1 Beta



The new option -gtool allows -gtool to conveniently launch the analyzer tool for your MPI application. For example, Vectorization Advisor to analyze vectorization:

mpirun -n 4 -gtool "advixe-cl -collect survey:2,3" ./your_app

Or VTune Amplifier, to analyze the performance of individual wounds:

mpirun -n 4 -gtool "amplxe-cl -collect hostpots:2,3" ./your_app

Please note, in the example, the analysis is run only on the ranks 2 and 3 of the four running. Those. You can selectively profile individual processes.

Intel® Trace Analyzer and Collector 9.1 Beta


The main innovation is MPI Performance Snapshot . The tool is used for quick top-level performance evaluation - how the application scales, what is the load balance of MPI, OpenMP, and computing.

Improved analysis of hybrid applications: Intel® Trace Analyzer allows you to select MPI markets and generate the VTune ™ Amplifier command line for them using the new <code-gtool option from the Intel® MPI Library. Those. You first identify problematic MPI processes with the Intel® Trace Analyzer, and then study them in detail with the VTune ™ Amplifier.


Summary


Intel® Parallel Studio 2016 Beta carries a serious set of new features in a number of areas:

Some of this is the development of a traditional set for the HPC developer, but much is being extended to other areas, such as multimedia and data analysis.

It was a brief review, in the following posts we will write in more detail - let me know what is interesting in the first place.
Sign up for Intel® Parallel Studio XE 2016 Beta .

Source: https://habr.com/ru/post/255175/


All Articles