⬆️ ⬇️

Investigation of the method of principal components and linear discriminant analysis on the change in the angle and conditions of the person’s illumination as an object of recognition

Good day everyone. I am a graduate student. The topic of my dissertation is “Development of image identification methods for providing individual access in real time”.

In my first post I wrote, not from the very beginning. Here I start from the beginning.



The recognition of a person by an image of a person is distinguished from biometric systems by the fact that, firstly, special or expensive equipment is not required, and secondly, physical contact with devices is not needed. However, the recognition of a person by the image of the person does not provide 100% reliable identification.



The peculiarity is to recognize a person by the image of a person, regardless of the change in angle and lighting conditions when shooting.

')

Such tasks do not have an exact analytical solution. This requires the selection of key features that characterize the visual image, the determination of the relative importance of signs by selecting their weights and taking into account the relationship between signs. Initially, these tasks were performed by a human expert, which took a lot of time and did not guarantee quality. In new methods, the selection of key features is carried out by automatic analysis of the training set, but nevertheless, most of the information about the features is set manually. For automatic application of such analyzers, the sample should be large enough and cover all possible situations.



Neural network methods offer a different approach to solving the pattern recognition problem. Weights in the neural network are not calculated by solving analytical equations, but are adjusted by various methods during training. Neural networks are trained on a set of training examples. A trained NA can be successfully used for recognizing a person under various conditions. T.O. the use of neural networks for the task of recognizing a person from the face is a promising direction.



Methods of identifying a person by a face



With all the variety of different algorithms and methods for image recognition, a typical recognition method consists of three components (Fig. 1):



1. transformation of the original image into the initial representation (may include both preprocessing and mathematical transformations, for example, the computation of principal components);



2. selection of key characteristics (for example, the first n principal components or coefficients of the discrete cosine transform are taken);



3. classification mechanism (simulation): cluster model, metric, neural network, etc.



image

Fig.1. The scheme of interrelation of structural elements of a typical image recognition method



The established approaches to the identification of persons based on images of human faces are practically well established. It is necessary to improve the existing algorithms in order to optimize the time and accuracy characteristics of the search that they provide by using key features extracted automatically from the person’s image (for example, gender, beard, glasses, face angle, etc.), and thus increase speed and accuracy of search.



Principal component method



The Principal Component Analysis (PCA) method is used to compress information without significant loss of information. It consists in a linear orthogonal transformation of the input vector X of dimension N to the output vector Y of dimension M, where N> M.



Benefits:



- if there are variations in a set of facial images, such as race, gender, emotions, lighting, components will appear, the magnitude of which is mainly determined by these factors. Therefore, the values ​​of the respective principal components can be used to determine, for example, the race or sex of a person;



- storage and search of images in large databases, image reconstruction.



The underlying difficulty lies in the high demands on the conditions for shooting images. Images should be obtained in similar lighting conditions, the same angle (decided by adding images to different training angles to the training sample) and high-quality pre-processing should be carried out, leading the images to standard conditions.



Method of linear discriminant analysis



Using the method of linear discriminant analysis (Linear Discriminant Analysis, LDA), choose the projection of the image space on the feature space in such a way as to minimize the intraclass and maximize the interclass distance in the feature space. In these methods it is assumed that the classes are linearly separable.



Benefits:



- High recognition accuracy (about 94%) is noted for a wide range of lighting conditions, various facial expressions and the presence or absence of glasses.



Method problems:



- however, questions remain unclear whether this method is applicable to search in large databases, whether the method can work, when in the training sample for some individuals there is an image only in some light conditions;



- there was also no change in the angle, and experiments with changing lighting were carried out without changing other factors. Whether this method will work with such combinations is also unknown. As in the method of own persons, there is also a need for high-quality pre-processing, which leads to images in standard conditions.



The main goal of this work is to develop methods for identifying and building information retrieval systems for special applications (IPS SP), which provide automatic identification of a person’s identity in real time based on the image of his face.



Achieving the goal set in the work dictates the need to solve a number of the following main tasks:



- development of “fast” algorithms for recognition and selection of the main characteristics of the image of a human face, ensuring high accuracy of identification of the object of search;



- development of a storage and coding algorithm for auxiliary information characterizing the search object that provides acceptable volumetric-temporal indicators of the operation of the IPS SP;



- development of an algorithm for reliable identification of persons based on information stored in the database of IPS SP;



- development of a prototype IPS SP that implements the above algorithms in order to verify in practice the correctness of the theoretical conclusions made in this work and to issue recommendations for its further improvement based on the results of IPS SP pilot testing.

The task was to compare the principal component method and the linear discriminant analysis (LDA) method. The principal component method and the LDA method should be verified.



To conduct research, programs in the C ++ Builder language have been developed that implement the principal component method and the LDA method. Experimental studies were conducted using the ORL base, the FERET base and its own base of 15 people. All bases contained images of different registration angles, with arbitrary facial expressions, different scale and registration conditions. The purpose of this experiment was to assess the effectiveness of the recognition method for various number of K classes in the database (K = 4, 15, 40, 100, 200 and 395). Evaluation of recognition efficiency for a small number of classes (K = 4, 15) showed that the minimum number of images in each class should not be less than 5, since the covariance matrices used in the PCA and LDA methods in this case become special and in this case cannot guarantee the stability of the reduction of the source feature space.



The first and second components of the reduced features Ĺ (vxvy) are “responsible” for the turn and posture of the head, and the third is for the facial expression itself. In this case, the influence of a component in the recognition process (as a result of choosing a close image) is the higher, the lower the sequence number of the component.



The PCA and LDA methods were tested for the case when the source database is composed of two or more heterogeneous databases. For this, 355 classes from the FERET database were added to 40 classes of the ORL database. It should be noted that the added images had a lower resolution, a dark background, different lighting and size, as well as significant variations in the rotation of the face.



Due to this difference in the source data in the space of reduced features, new features have appeared, grouped in a separate area regarding the features of the ORL database.



The research results are summarized in table 1.



Table 1.



The number of recognizable peopleNumber of images of each personNumber of images used for trainingError of the second kind (FRR) when using the correlation coefficient
PCALda
fourtenfive0,0000,000
1515five0.3330.063
ten0.2300.133
40ten30.3300.122
five0.2500.155
70.1840.033
10020five0.1970.056
70.1760.104
20020five0.1730.102
70.1040.083
39520five0.0830.064
70.0800.046




Conclusion



From the above analysis it follows that to increase the probability of recognition, it is advisable to use a combination of both methods. This requires further research.



Literature

1. Golovko V.A. Neurointelligence: Theory and Applications. Book 1. Organization and training of neural networks with direct and feedback - Brest: BPI, 1999, - 260s.

2. Samal D.I., Starovoytov V.V. - Approaches and methods of recognizing people by portraits. - Minsk, ITK NASB, 1998. - 54s.

3. Samal D.I., Starovoytov V.V. Methods of automated recognition of people on portraits // Digital image processing. - Minsk: ITK, 1999.-P.81-85.

4. Voronovsky G.K., Makhotilo K.V., Petrashev S.N., Sergeev S.A. - Genetic algorithms, artificial neural networks and virtual reality problems. - Kharkov: Basis, 1997.



Wait for the following articles :)

Source: https://habr.com/ru/post/197974/



All Articles