Palmprint Verification Using Multi-scale Gradient Orientation Maps

Kim Min-Ki

doi:10.3807/JOSK.2011.15.1.015

OA학술지
Journal of the optical society of Korea

Palmprint Verification Using Multi-scale Gradient Orientation Maps

DOI : 10.3807/JOSK.2011.15.1.015
Author: Kim Min-Ki
Organization: Kim Min-Ki
Publish: Journal of the optical society of Korea Volume 15, Issue1, p15~21, 25 March 2011

ABSTRACT

Palmprint Verification Using Multi-scale Gradient Orientation Maps

KEYWORD

Palmprint verification , Slope direction , Kirsch operator , Orientation map , (070.5010) Pattern recognition , (100.2000) Digital image processing , (070.4560) Data processing by optical means

본문

Collapse all

I. INTRODUCTION

With the rapid progress of electronics and Internet commerce,personal identification is becoming increasingly important.Biometrics technology is considered one of the safest and most efficient ways to identify individuals or discriminate between an authorized person and an impostor. Biometrics comprises methods of uniquely recognizing persons based on their intrinsic physical (e.g., fingerprint, finger geometry, face, iris, etc.) or behavioral (e.g., voice, signature, gate, etc.) characteristics [1-3]. Among these traits, the palmprint has attracted many researchers’ attention in the last 10 years. The palmprint, the large inner surface of the hand, contains many unique features such as principal lines, wrinkles, ridges, minutiae points, singular points, and texture. Earlier studies focused on high-resolution (at least 400 dpi) palmprint images for some applications such as law enforcement. Nowadays, most studies focus on low-resolution palmprint images for civil and commercial applications.

There are many approaches to palmprint recognition. Line-based approaches concentrate on extracting palm lines such as principal lines and wrinkles [4-8]. The extracted lines are usually encoded based on their orientation and magnitude. Appearance-based approaches use holistic appearance features such as principal component analysis [9, 10], the Fisher linear discriminant [11], and independent component analysis [12]. Statistical approaches transform an input image into another domain and divide the transformed image into small regions. Then statistical features are extracted from each small region [13, 14]. According to the results of these studies, line-based approaches are deemed to produce the most promising results. In particular, some studies [5,7, 8] based on line orientation information are considered the state-of-the-art methods. Kong and Zhang [5] used six 2D Gabor filters with different directions, and extracted the dominant orientation information using the winner-takes-all rule. Wu et al. [7] devised four directional templates to define the orientation of each pixel. Jia et al. [8] devised another six directional templates based on modified finite Radon transformation.

In line-based methods, it is important to classify points as located on a palm line or not. This classification poses a challenge. Kong and Zhang [5] assumed that each point on a palmprint image is located on a palm line. Under this assumption, the orientation of each point is extracted. This approach is simple and has shown promising results. Thus, the following researches [7, 8] used the same approach.This approach has some problems, though. The line structures in a palmprint image are complex, and multiple lines may intersect in some regions. Although the orientation information of the pixels that are on palm lines is robust to variations in lighting conditions, the orientation information of the other pixels could be vulnerable to changes in illumination. That is, the orientation information of a point that is not located on a palm line is more fragile than that of a point on a palm line. Furthermore, the majority of the points are not located on a palm line. To solve this problem,this paper proposes a new viewpoint. When a palmprint image is hypothetically considered a 3D terrain, the principal lines and wrinkles become deep and shallow valleys on a palm landscape. All the line-based methods have tried to detect the orientation of valleys, but this study focuses on the slope caused by deep and shallow valleys. The main goal of this study is to detect the orientation of the steepest slope in a local area.

The rest of this paper is organized as follows. Section 2 presents a preprocessing method of extracting a region of interest (ROI) in a palmprint image and proposes a new feature that represents the dominant slope orientation of each pixel. Section 3 presents the feature-coding method by which a gradient orientation map is created, and describes a feature-matching method based on pixel-unit comparison. In Section 4, the experimental results and analysis are provided, and finally, the conclusions are described in Section 5.

II. EXTRACTION OF PALMPRINT FEATURES

   2.1. Palmprint Preprocessing

When palmprint images are captured, some variations may occur such as in translation and rotation. Hence, palmprint images should be aligned in position and orientation before the feature extraction step [15, 16]. The central part of a palm is a region of interest (ROI), from which a palmprint feature is extracted. The five main steps in cropping the ROI are as follows.

(1) The fixed threshold method is used to convert the original image into a binary image. Some noises, including isolated pixels, can be ignored.

(2) The boundary of the palm is traced and then smoothed with a Gaussian filter.

(3) Two reference points, R1 and R2, are located using a turning angle. Three equidistant points, p1, p2, and p3,on the boundary form two vectors that extend from p1 to p2 and from p2 to p3. The two vectors form a turning angle, as shown in Fig. 1 (e). The three points move together clockwise around the boundary. The turning angle varies according to the relative positions of the three points. If the direction of a vector is counterclockwise, the turning angle has a negative value. Otherwise, it has a positive value. R1 is the position of point p2, where the turning angle has the smallest negative value in the top left area of the boundary. Similarly, R2 can be found in the bottom left area of the boundary.

(4) The central point of a palm (Pc) is located. R1 and R2 are lined up to get the Y axis of the palmprint coordinate system, and the perpendicular line L that passes through the midpoint (Pm) of the two points is used. Pc is found on line L so that the length between Pm and Pc would equal a predefined value.

(5) ROI is defined as a squared region, which is extracted with Pc as its central point. ROI is cropped and the skew is corrected via rotation.

[FIG. 1.] The main steps in the preprocessing: (a) original image (b) binarized image (c) smoothed boundary (d) top left area andbottom left area (e) turning angle (f) two reference points (g) a central point and its ROI and (h) ROI image.

   2.2 Gradient Feature Extraction

For a function f(x, y), the gradient of f at coordinate (x, y) is defined as the vector:

The gradient vector points in the direction of the greatest rate of increase of f(x, y). When a palm image is hypothetically considered a 3D terrain, the intensity value I(x, y) is the height above sea level at a point (x, y). The gradient at a point is a vector that points in the direction of the steepest slope at that point. The steepness of the slope at that point is given by the magnitude of the gradient vector. Thus, the local topographic property in a palm image can be described with the gradient direction and magnitude for each pixel.

The directional feature of the gradient is more robust than the feature of the magnitude with respect to variations in lighting conditions. Thus, this study focuses on the directional feature. To compute a gradient vector, the Kirsch operator[17] is used. The Kirsch operator is a non-linear edge detector that finds the maximum edge strength in a few predetermined directions. The direction of the gradient at point (x, y) is calculated as follows for directions with a π/4 difference:

where g^(k) is the derivative kernel and I(x, y) is the gray value at point (x, y). The argument k of the maximum value becomes the direction of the gradient vector. Therefore,unlike in Huang’s study [18], directional decomposition of a gradient vector is not necessary. This approach is also computationally effective because it requires only integer multiplication and comparison. If an image is crudely convoluted with the derivative kernels, as the edge detector does, the gradient direction is unstable because the 3×3 area is too small to contain enough information. To overcome this problem, a w×w scanning window is introduced. The scanning window consists of nine b×b-sized blocks. Fig. 2 shows an example of a 9×9 scanning window with nine 3×3 blocks. The local image in the scanning window is compressed into a 3×3 image by averaging the gray values of the pixels in each block. Then the compressed image is convoluted with each derivative kernel of the Kirsch operator.

[FIG. 2.] The convolution operation between a compressedimage and derivative kernels.

III. FEATURE CODING AND MATCHING

   3.1. Feature Coding

Feature coding of the palmprint is performed by replacing all the pixels in the ROI with their index in the gradient direction. Feature coding creates an orientation map. If all the pixels in a cropped ROI image are encoded, if the size of the ROI image is 128×128, the size of the orientation map is also 128×128, as shown in Fig. 3 (b). It requires much processing time and creates redundant information. The computation time and the amount of redundant information can be reduced via downsampling. If every fourth pixel is selected, the processing time becomes one-sixteenth shorter, and the size of the orientation map becomes 32×32. As shown in Fig. 3 (c), feature coding saves most of the important information in the higher-resolution orientation map. On the other hand, if the ROI image is downscaled and convoluted with the same scanning window, orientation maps with different scales can be created. The downscaling is performed by averaging the intensity of the pixels in the non-overlapping 2×2 window. Fig. 3 (d) and (e) are created from half-scaled and quarter-scaled ROI images, respectively. The orientation map represents local or global gradient information according to the scale.

[FIG. 3.] An ROI image and its orientation maps: (a) a 128×128 ROI image (b) a 128×128 orientation map (c) a 32×32 orientation map(d) a 32×32 half-scale orientation map and (e) a 32×32 quarter-scale orientation map.

   3.2. Feature Matching

The basic strategy for calculating the similarity between two feature images is pixel-to-pixel matching. Let Q_O and R_O denote the orientation maps that are acquired from a query image Q and a reference image R, respectively. Each pixel in Q_O is compared with the corresponding pixel in R_O. The distance between two corresponding pixels in Q_O and R_O is calculated by modulo distance, because each pixel represents the orientation index between 0 and 7. The distance of the two images Q and R with the size m×n can be described as:

where the function f_k represents the modular k distance(i.e., the distance on the circle: so the modular eight distance between 0 and 7 is 1, not 7), Q_M and R_M denote masking maps of Q and R; and the symbol ^ is the logical AND operator. The masking maps Q_M and R_M are automatically acquired through a thresholding procedure that generates a mask to identify the location of the non-palmprint pixels. If the gray level of a pixel is higher than a predefined threshold, the corresponding pixel at the masking map is 1. Otherwise, the pixel is 0. Masking maps are used to alleviate the mismatch problem caused by the displacement of the palm during the process of data acquisition [15]. Fig. 4 shows examples of two palms captured from the same person, and their ROIs and masking maps.

A translated or rotated palm image is mostly aligned in the preprocessing step, in which the ROI of a palm is extracted. Any remaining variation due to imperfect preprocessing can degrade the performance of the pixel-to-pixel matching method, though. Thus, the pixel-to-cross-shaped area comparison method, which was introduced in the study of Jia et al. [8], was used to improve the matching robustness. The distance function described in Eq. (3) is modified as follows:

where (i’, j’)∈{(i-1, j), (i, j-1), (i, j), (i, j+1), (i+1, j)}.The function d(Q→R) represents the distance from Q to R. The distance from R to Q can also be described as d(R→Q). Finally, the distance function can be calculated as follows.

[FIG. 4.] Palm images captured from the same person: (a)example of the normal placement of a palm and its ROI andmasking map and (b) example of the misplacement of a palmand its ROI and masking map.

   3.3. Fusion of Matching Distances

In this study, three orientation maps were made with different scales to represent local and global gradient information. The local gradient information is extracted from the cropped 128×128 ROI images, and its two downscaled 64×64 and 32×32 images are used to represent the gradient information in the wider area. Before the matching distances that were obtained from the multi-scale orientation maps were fused,distance normalization was required because they have different statistical properties such as their mean, deviation, and minimum and maximum values. The Min-Max method described in Eq. (6) was used for the normalization. The quantities max(D) and min(D) specify the end points of the distance range. This method maps the raw distances to the [0, 1] range.

Two or more normalized distances can be fused together to get the final matching distance. The final matching distance between a query image Q and a reference image R is denoted by the following equation:

where w_m and n_m denote the weight and the normalized distance of the matcher m, respectively, and M is the number of matchers. The weight, w_m, is determined according to each fusion method.

IV. EXPERIMENTAL RESULTS

   4.1. Palmprint Database

The widely used public database, PolyU [19], was used to evaluate the performance of the proposed method. The database contains a total of 7,752 images from 386 different palms. The palmprint images were collected in two sessions. In each of the sessions, around 10 samples for each palm were captured. The average interval between the first and second sessions was about two months. In addition, the light source and the focus of the CCD camera were changed so that the images that were collected in the first and second sessions could be regarded as having been captured by two different devices [8]. An LED was used in the first session, and an incandescent lamp was used in the second session. Also, the lenses in the two sessions slightly differed, and the focus in the first session was slightly longer. Some samples in this database are shown in Fig. 5, in which the four samples in the top row were captured in the first session and those in the bottom row were captured in the second session. The two images in the same column were captured from the same palm at different sessions.

[FIG. 5.] Some samples from the PolyU palmprint database.

The database was divided into two data sets: training and testing. The training and testing data sets consisted of the samples that were taken in the first and second sessions, respectively. Only the training data set was used to adjust the parameters of the size of the scanning window and the resolution of the orientation map. Palmprint verification was performed by matching a sample taken from the testing data set with the registered templates that were made from the training data set. This is a more realistic experiment because the two data sets were not collected in the same session, as is always the case in real-world applications.

   4.2. Selection of the Window Size and the Resolution

To test the effects of some parameters, 1,000 samples were selected from 200 different palms in the training dataset. Only the first sample of each palm was selected to create the reference model, and the other four samples of each palm were used for testing. The size of the scanning window and the resolution of the orientation map are important parameters that affect the recognition accuracy and the processing time. The size of the scanning window controls the sensitivity of the local intensity variation, and the shifting unit of the scanning window controls the resolution of the orientation map. If the shifting unit doubles, the resolution of the orientation map is halved both horizontally and vertically. The nearest neighbour classifier was used for the recognition. Fig. 6 shows the recognition rate for the different sizes of the scanning window with different resolutions of the orientation map. When the local image in the scanning window was directly convoluted with the derivative kernels, that is, when the 3×3 scanning window was used, poor results were achieved. In particular, when the size of the orientation map was 32×32 or less, the performance was drastically degraded.In the case when the compressed image was used, promising results were achieved. In addition, the performance was well-preserved even in the 32×32 orientation map. The larger the scanning window and the higher the resolution of the orientation map were, the more processing time was needed, so the 9×9 scanning window and the 32×32-resolution orientation map were used.

[FIG. 6.] The effects of the size of the scanning window and the resolution of the orientation map.

   4.3. Palmprint Verification

A biometric system classifies an individual as either a genuine user or an impostor. Thus, the system may commit two types of recognition error: it may either falsely accept an impostor or falsely reject a genuine user. If the matching distance does not exceed the appointed threshold, the palmprint is accepted. Otherwise, it is rejected. The false acceptance rate (FAR) is the probability that an unauthorized individual will be accepted, and the false rejection rate (FRR) is the probability that an authorized user will be inappropriately rejected. If a system is designed so that it would be more difficult for an impostor to enter by adjusting the threshold(i.e., reducing the FAR), the system also becomes more difficult for a valid person to enter (i.e., the FRR will increase) [20]. Thus, the two rates contradict each other and cannot be lowered at the same time. The genuine acceptance rate (GAR) is also dependent on the threshold, because it is calculated with 1-FRR at a specific FAR. The equal error rate (EER) is independent of the threshold, however, because it is the rate at which the FAR is equal to the FRR.Therefore, the EER can be used as an application-independent metric. The decidability index d’ [21] does not use the threshold but the statistics of two groups: the genuine group and the impostor group. Thus, it is also an applicationindependent metric. For this reason, the EER and the decidability index d’ were used to evaluate the performance of the proposed method.

Three data sets, N = 100, 200, and 386, were used to evaluate the verification accuracy and extensibility of the proposed method. Each set contained all the samples of N different palms in the testing data set. When N = 386, the total number of matchings was about 1,489,960 (386×10×386) for each scale. The number of genuine matches was 3,860(386×10), and the rest were impostor matches. Table 1 shows the results of the matching with the three data sets. Generally, the downsampled 64×64 images performed best with respect to the EER, and the original 128×128 ROI images performed best with respect to the index d’. As the number of subjects increases, the EER rises and the index d’ falls. The decreasing rate of performance is less, however, than the increasing rate of N. This means that the proposed method is extensible to a larger dataset. The distribution of the genuine and impostor matching distances when N =386 is shown in Fig. 7. There are two distinct peaks in the distributions. One corresponds to the correct matching distance, and the other corresponds to the incorrect matching distance. These two peaks are widely separated, and the overlapping area is very small. The distributions differ with the scale of the ROI images. The distributions of the downscaled images are more broadly spread than those of the original ROI images.

[TABLE 1.] Comparison of the verification results with different scales and data sets

Comparison of the verification results with different scales and data sets

[FIG. 7.] The distributions of the genuine and impostor matching distances (N = 386).

[TABLE 2.] Comparison of the verification results of four different fusion methods

Comparison of the verification results of four different fusion methods

To find the fusion method that can enhance the accuracy of the palmprint verification, four different fusion methods were experimented on: simple-sum, min-score, max-score,and matcher weighting [22]. Table 2 shows the verification results of the fusion methods, where M₁, M₂, and M₃ denote the matchers that used the features taken from the 128×128, 64×64, and 32×32 ROI images, which is acquired by downscaling with 1/1, 1/2, 1/4, respectively. Three different downsampling rates of 1/4, 1/2, 1/1 were used to get the gradient orientation maps of 32×32. In consequence, it produces the orientation map of the same resolution regardless of the different sizes of ROI images. All the fusion methods improved the verification performance. In particular, the matcher-weight fusion method showed the greatest performance improvement. The effect on the performance improvement with the addition of one additional scale to the two-scale orientation map was not significant.

   4.4. Comparison with Other Palmprint Verification Methods

The approach in this study was compared with three line-based methods that are based on orientation coding:competitive coding (CompCode), palmprint orientation coding(POC), and robust line orientation coding (RLOC). To compare the features of these different methods under the same conditions, the preprocessing routine was shared and the same strategy for pixel-to-cross-shaped area matching was used. The verification results are described in Table 3.

Comparison of the four methods in which a single-scale feature was used showed that the proposed approach M₂ had the lowest EER and that CompCode had the highest d’. The fused method, however, in which orientation maps with three different scales were used, performed best not only in terms of the EER but also in terms of the decidability (d’). These results validate the effectiveness of the proposed gradient orientation feature. It also shows that the verification could be greatly improved by fusing orientation maps with different scales.

[TABLE 3.] Comparison of the verification results of the line-based methods

Comparison of the verification results of the line-based methods

V. CONCLUSIONS

In this study, a new approach to palmprint verification based on gradients was proposed. A palm image was hypothetically considered a 3D terrain. From this viewpoint, the principal lines and wrinkles become deep and shallow valleys. Most previous studies based on orientation coding focused on the direction of the valley itself, but this study focused on the steepest slope direction in each local area, where the slope is mainly caused by the valleys.

The experimental results showed that the gradient orientation feature that was obtained using the Kirsch operator is highly effective in palmprint verification. They also showed that the proposed method is superior to other methods based on line-based orientation coding. In particular, the fusion of orientation maps with different scales greatly improved the verification, regardless of the fusion method. Although this work was presented in the context of palmprint verification, the proposed approach is general enough, and the ideas can be applied to other biometrics such as iris and finger knuckle verification.

참고문헌

1. Zhang D, Jing X, Yang J 2006 Biometric Image Discrimination Technologies
2. Kang B, Park K 2009 Multimodal biometric authentication based on the fusion of finger vein and finger geometry [Opt. Eng.] Vol.48 P.090501
3. Jeong M 2009 Analysis of fingerprint recognition characteristics based on new CGH direct comparison method and nonlinear joint transform correlator [J. Opt. Soc. Korea] Vol.13 P.445-450
4. Kumar A, Wong D, Shen H, Jain A “Personal verification using palmprint and hand geometry biometric” P.668-678
5. Kong A, Zhang D “Competitive coding scheme for palmprint verification” P.520-523
6. Yue F, Zuo W, Zhang D, Wang K 2009 Competitive code-based fast palmprint identification using a set of cover trees [Opt. Eng.] Vol.48 P.067204
7. Wang X. Wu K, Zhang D “Palmprint authentication based on orientation code matching” P.555-562
8. Jia W, Huang D, Zhang D 2008 Palmprint verification based on robust line orientation code [Pattern Recognition] Vol.41 P.1504-1513
9. Lu G, Zhang D, Wang K 2003 Palmprint recognition using eigenpalms features [Pattern Recognition Letters] Vol.24 P.1463-1467
10. Ekinci M, Aykut M 2007 Gabor-based kernel PCA for palmprint recognition [Electron. Lett.] Vol.43 P.1077-1079
11. Wu X, Zhang D, Wang K 2003 Fisherpalms based palmprint recognition [Pattern Recognition Letters] Vol.24 P.2829-2838
12. Lu G, Wang K, Zhang D “Wavelet based independent component analysis for palmprint recognition” P.3547-3550
13. Han Y, Tan T, Sun Z “Palmprint recognition basedon directional features and graph matching” P.1164-1173
14. Pan X, Ruan Q 2009 Palmprint recognition using Gaborbased local invariant features [Neurocomputing] Vol.72 P.2040-2045
15. Zhang D, Kong W, You J, Wong M 2003 Online palmprint identification [IEEE Transactions on PAMI] Vol.25 P.1041-1050
16. Struc V, Pavesic N 2009 Phase congruency features for palm-print verification [IET Signal Processing] Vol.3 P.258-268
17. Kirsch R 1971 Computer determination of the constituent structure of biological images [Computers & Biomedical Research] Vol.4 P.315-328
18. Huang L, Shimizu A, Hagihara Y, Kobatake H 2003 Gradient feature extraction for classification-based face detection [Pattern Recognition] Vol.36 P.2501-1511
19.
20. Wu X, Wang K, Zhang D 2005 Wavelet energy feature extraction and matching for palmprint recognition [Journal of Computer Science and Technology] Vol.20 P.411-418
21. Daugman J 2003 The importance of being random: statistical principles of iris recognition [Pattern Recognition] Vol.36 P.279-291
22. Snelick R, Uludag U, Mink A, Indovia M, Jain A 2005 Large-scale evaluation of multimodal biometric authentication using state-of-the-art systems [IEEE Transactions on PAMI] Vol.27 P.450-455

OAK XML 통계

이미지 / 테이블

[ FIG. 1. ] The main steps in the preprocessing: (a) original image (b) binarized image (c) smoothed boundary (d) top left area andbottom left area (e) turning angle (f) two reference points (g) a central point and its ROI and (h) ROI image.
[ FIG. 2. ] The convolution operation between a compressedimage and derivative kernels.
[ FIG. 3. ] An ROI image and its orientation maps: (a) a 128×128 ROI image (b) a 128×128 orientation map (c) a 32×32 orientation map(d) a 32×32 half-scale orientation map and (e) a 32×32 quarter-scale orientation map.
[ FIG. 4. ] Palm images captured from the same person: (a)example of the normal placement of a palm and its ROI andmasking map and (b) example of the misplacement of a palmand its ROI and masking map.
[ FIG. 5. ] Some samples from the PolyU palmprint database.
[ FIG. 6. ] The effects of the size of the scanning window and the resolution of the orientation map.
[ TABLE 1. ] Comparison of the verification results with different scales and data sets
[ FIG. 7. ] The distributions of the genuine and impostor matching distances (N = 386).
[ TABLE 2. ] Comparison of the verification results of four different fusion methods
[ TABLE 3. ] Comparison of the verification results of the line-based methods