Recently, we reported a single input channel joint transform correlator for the color pattern recognition which decomposes the input color image into three R, G, and B gray components and adds those components into a single gray image in the input plane. This technique has the merit of a single input channel instead of three input channels. However, we found this technique has some problems with discrimination impossibility in the case of a simple primary color pattern which results in the same gray level through the addition process. Thus, we propose a modified coding technique which selectively recombines the decomposed three R, G, and B gray components instead of the simple adding process. Simulated results show that the modified coding technique can accurately discriminate a variety of kinds of color images.
Pattern recognition can be applied to various kinds of fields, for example facial appearance, fingerprints, handwriting, and character recogni-tion [1-7]. These kinds of pattern recognition and target tracking applications can be achieved by the joint transform correlator (JTC) [8-16]. The introduction of color information in pattern recognition by the JTC is usually made by means of a multi-channel correlation techni-que that decomposes the source and the target color images into three red, green, and blue RGB channels. The correlation is made separately for each channel, and arithmetic or logical point-wise operations can be used to derive the final output [17-20]. The multi-channel JTC methods perform the correlation process in parallel for all the color channels that compose the image by the utilization of coherent optical correlators that are illuminated simultaneously with a few coherent sources, each having a different wavelength. The output plane consists of a set of superimposed correlation distributions that must be analyzed independently and com-posed together to render the detection decision. A disadvantage of this approach is that it requires three different channels, which increases the system cost. For example, a spatial light modulator (SLM) is needed at each channel, two lenses are required at each channel, and an extra beam splitter and a mirror are also needed. The concept of multi-channel single output color JTC configuration was initially proposed by Deutsch to overcome such disadvantages [21]. However, the multi-channel single output color JTC has to take into account the separation between the input target image and the reference image. Deutsch [21] found that the separation between the input target image and the reference image for the same color must be the same and also larger than the sum of the widths of these images in order to prevent any unwanted overlapping between correlation outputs. To yield sharp correlation peaks, Alam et al. proposed a fringe-adjusted JTC based on the Newton-Raphson algorithm [22]. However, they still used multi-channel input and single output JTC to improve correlation discrimination.
We proposed previously a technique of color pattern recogni-tion by decomposing the color image into three color com-ponents and adding those components into a single gray image in the input plane [23]. This new technique requires a single input channel instead of three input channels However, this technique has some problems with discrimination impossi-bility in the case of simple color patterns such as red, green, and blue which result in the same gray level through the addition process. Thus, we propose a modified coding technique of selectively recombining the decomposed three R, G, and B gray images instead of a simple addition process. We present the simulated results to show that our newly proposed technique can accurately recognize and discriminate color difference. In the Section II, conventional multi-channel color pattern recognition JTC and its basic theory is presented. In Section III, the proposed new method of encoded single input channel and single output color pattern recognition JTC system are described in detail. Section IV describes simulation results for the color pattern recognition, and finally, some comments are contained in the conclusion.
II. SINGLE-CHANNEL SINGLE OUTPUT JTC
We present a new single-channel single output JTC for color pattern recognition of the color image. This method can achieve color pattern recognition by decomposing the color image into three color components (red, green, and blue) and recombining those components into a single gray image in the input plane instead of using these three color components. This new technique needs a single input channel instead of three input channels and single output CCD camera, thus a simple JTC can be used.
Now, Fig. 1 shows the transformation of the three decom-posed color components of the color images into a single gray image. In this process, first, three color components (red, green, and blue) are transformed into the corresponding gray components, and finally these three gray components are added together to form a single gray image. Equation (1) expresses these recombined single gray images for the reference and target, respectively.
where each component of Eq. (1) represents the gray image of the corresponding red, green, or blue component. Figure 2 shows the optical structure of the basic JTC system for the single-channel and single output color pattern recognition. Thus,
the input joint images
After the input joint image on the LCD1 passes through the lens L1,
Therefore, JPS, the intensity of the interfered light is captured by a light detector on the Fourier plane. The JPS contains noise and DC components which degrade the correlated signal obtained on the output plane in the second stage. The DC components should be blocked to obtain a well-correlated signal. The blocking of the DC components may be implemented by use of a Fourier-plane image-subtraction technique. This paper also uses the non-linear JTC which introduces a non-linearity parameter
The cross-correlation output can be obtained by inverse Fourier transforming the JPS through the lens L2. The cross-correlation output displayed on the output plane through the inverse Fourier transform lens L2 can be expressed as Eq. (5).
There exist only two correlation terms at the output plane in comparison with those 36 correlation terms for the conventional three input-channel JTC.
III. MODIFIED CODING FOR SINGLE CHANNEL INPUT
We previously presented useful results of color pattern recognition by using our new single input channel JTC. However, there exists some measurement limitation of our new technique. First, let’s look at the case of primary single color image such as red, green, and blue instead of the mixed color image. Red color has three components (R=255, G=0, B=0) value. Green and blue image has three components (R=0, G=255, B=0) and (R=0, G=0, B=255), respectively. These color images have different color components, but they produce the same recombined gray scale value through the addition process expressed as Eq. (1). Thus, these primary single color images can not be discriminated by the JTC based on our previously proposed method. Now, we propose a new modified technique of coding the recombined single gray image to overcome such problem as shown on Fig. 3. Figure 3 represents two kinds of processes. First, part (a) shows our previously proposed method of obtaining the recombined single gray image through the addition process. Second, part (b) shows the modified method of obtaining a single gray image through the modified coding process. Figure 4 represents the method of obtaining a single gray input by using the modified coding technique. This modified coding technique requires two steps. First, three R, G, and B arrays of the gray images are obtained by decomposing the original color input images. Three R, G, and B arrays of the reference can be expressed as Eq. (6).
where
Thus, newly coded reference matrix rRGB has first column elements, for example as [
First, in Fig. 5 we prepared 256×256 mixed color images of the fruit to evaluate the performance of the proposed encoded single input channel and single output JTC. The fruits on Fig. 5 nare composed of three non-zero value of R, G, and B components. Figure 5(a) is the reference image, and sample 1 (Fig. 5 (b)) and sample 2 (Fig. 5(c)) are the target images which are different from the reference images in the hue of the right-side pepper. As a fact, we prepared these fruit images to investigate the ability of discrimination when the color of some region of the target image is different from that of the reference. In this case, as mentioned above, the images are composed of three non-zero values of R, G, and B components. Figure 6 shows two kinds of single gray input images obtained from the original reference image. Figure 6(b) is the recombined image with addition process, and Fig. 6(c) is obtained through the coding process. Table 1 shows the correlation peaks according to these two kinds of image processing. Table 1 indicates that the correlation peaks decrease considerably when the red pepper of the fruit image shown on Fig. 5(a) changes into green and blue. For the addition process the correlation peak of the matched case is about 1.1487×107, but the correlation peaks of the mismatched case of green (sample 1) and blue peppers(sample 2) are about 1.0908×107 and 0.8948×107, respectively. Again, for the coding process the correlation
[TABLE 1] Correlation peak according to the method of the single input channel
Correlation peak according to the method of the single input channel
peak of the matched case is about 1.2477×107, but the correlation peaks of the mismatched case of green (sample 1) and blue pepper (sample 2) are about 1.1215×107 and 1.0596×107, respectively. Therefore, we can conclude that these two methods are good for this kind of mixed color recognition.
Second, in Fig. 7 we prepared 256×256 simple color images to evaluate the performance of the proposed encoded single input channel and single output JTC. Figure 7(a) is the reference image (Red), and the others from (b), Green to (f), Magenta are the target images. In this case, these
[TABLE 2] Correlation peak according to the method of single input channel
Correlation peak according to the method of single input channel
images have at least one zero value of R, G, and B com-ponents. For example, reference image (Red) has three components (R=255, G=0, B=0) value. Also green and blue images have three components (R=0, G=255, B=0) and (R=0, G=0, B=255), respectively. These color images have different color components, but they produce the same recombined gray scale image through the addition process. Likewise, yellow, cyan, and magenta color image have different color components, but they produce the same recombined gray scale image through the addition process. Thus, we propose a new modified technique of coding the recombined single gray image to overcome such a problem. As mentioned above, this coding technique rearranges three gray images of red, green, and blue components into a single array in R, G, B order instead of using the addition process. This technique separates R, G, and B gray components for each different position in order to make a new matrix unlike that of the addition process. Figure 8 shows two kinds of single gray input images obtained from the reference image (Red). Figure 8(b) is the recombined image with addition process, and Fig. 8(c) is obtained through the coding process. Table 2 shows the correlation peaks according to these two kinds of image processing for the simple color images. Table 2 indicates that the correlation peaks show the same value when the addition process is used, thus these simple color images can not be discriminated by the JTC based on the previously proposed addition method. On the other hand, Table 2 shows that the correlation peaks of the target color images are lower than the reference, thus we can conclude that simple color images are well discriminated by the JTC based on our newly modified coding method. Figure 9 shows the 2-D correlation peaks from the fruit images shown on Fig. 5 by using the coding process.
In conclusion, the proposed modified coding technique represents an excellent single input channel JTC to overcome the problem of the complicated three input channel JTC.Moreover, the main advantage of our newly proposed JTC adopting the coding method is that it can discriminate well not only the mixed color images but also the simple color images. In detail, the final results shows that the modified coding technique solves the problems of the discrimination impossibility arising for the case of a simple primary color pattern which results in the same gray level through the addition process. Thus, we expect that our coding technique of the single input JTC color pattern can be used not only for the finding of a small section of the target such as the crowd images but also for the discrimination of a slight color difference in color measurement.