Hyperspectral images are often contaminated with stripe noise, which severely degrades the imaging quality and the precision of the subsequent processing. In this paper, a variational model is proposed by employing spectral-spatial adaptive unidirectional variation and a sparse representation. Unlike traditional methods, we exploit the spectral correction and remove stripes in different bands and different regions adaptively, instead of selecting parameters band by band. The regularization strength adapts to the spectrally varying stripe intensities and the spatially varying texture information. Spectral correlation is exploited via dictionary learning in the sparse representation framework to prevent spectral distortion. Moreover, the minimization problem, which contains two unsmooth and inseparable l1-norm terms, is optimized by the split Bregman approach. Experimental results, on datasets from several imaging systems, demonstrate that the proposed method can remove stripe noise effectively and adaptively, as well as preserve original detail information.
Hyperspectral images (HSIs) have been successfully applied to a myriad of applications such as data classification and target detection [1]. However, they are inevitably contaminated with striping artifacts due to response variations of each detector. The existence of stripe noise not only degrades the image quality, but also limits the precision of the subsequent data analysis. Consequently, it is an essential preprocessing step to remove the stripes.
In the past decades, several methods have been proposed in the extensive literature [2], and can be broadly classified into three major categories. The first category eliminates the striping pattern by filtering techniques. Filters in the spatial domain have the disadvantage of image information loss in spite of simple implementation [3]. Wavelet-based filters only adjust the vertical or horizontal detail components [4-7]. Nonetheless, the structural information with the same frequency is unavoidably removed, which results in blurring and ringing artifacts. The second category, including histogram matching [8] and moment matching [9], makes use of similar statistics of individual detectors and restricts statistical parameters to a predefined reference. Though image statistical properties can be preserved, these equalization methods are highly sensitive to the assumption of statistical homogeneity. The third category is a variational approach [10-12]. The destriping issue is formulated as an inverse problem, and can be tackled in the regularization framework. Despite the effectiveness of these methods, the selection of regulation parameters has a great influence on the performance of destriping results [13].
In reality, however, there exist hundreds of bands with spectrally varying stripes in HSIs. Even in the same band, stripe intensities may be spatially dependent. Thus, selecting regulation parameters band by band is particularly time-consuming and poses a challenging problem to conventional solutions. In Ref. [14], a spectral-spatial adaptive hyperspectral total variation (SSAHTV) denoising model was proposed, in which the regulation parameters accommodated to the differences of the spectral noise and the spatial information. Nevertheless, the results for heavy stripes are not satisfactory, because this model fails to consider the unidirectional property of stripes.
Different from a single image, HSIs usually have hundreds of bands. Furthermore, high spectral correlation exists between adjacent bands. If the stripes are suppressed in the spatial domain only, spectral distortion will be introduced in the destriped image. However, little effort has been devoted to dealing with the spectral correlation. Recently, the sparse representation technique has been successfully applied in denoising problems [15-18]. It is based on the assumption that the noise-free image can be estimated by a linear combination of a few atoms in a redundant dictionary, which can be learned by utilizing the correlation and redundancy in an image [19].
Motivated by the success of the adaptive mechanism in SSAHTV model and the sparse representation in exploiting HSI correlation, we propose a unified spectral-spatial adaptive unidirectional variation and sparse representation method to counter the above problems. The proposed model consists of two
This paper is organized as follows. In Section 2, we describe the spectral-spatial adaptive unidirectional variational model, the sparsity constraint, and the optimization process. Section 3 presents experimental results on synthetic and real images, and discusses the selection of optimal parameters. Finally, summarizations and conclusions are presented in Section 4.
In this paper, the stripes are assumed to be additive, and the noisy HSI
where
The stripes are highly anisotropic and have a directional property, which indicates that only the gradient perpendicular to the stripes is influenced. Motivated by this observation, the authors of Ref. [10] formulated an intuitive unidirectional variation (UV) model as
where ∇
The simplest way to destripe HSIs with the UV model is to handle each single band with possibly different
Then, by taking partial derivatives of the cost function (3) with respect to
where |⋅| represents the absolute value operator. Eq. (4) indicates that different bands are still considered as independent. That is, HSIs are destriped channel by channel, which has the following drawbacks. For different bands, large values of
2.2. Spectral-spatial Adaptive Unidirectional Variation Model
To make the HUV model spectrally adaptive, we borrow the color total variation (CTV) model from Ref. [24] and redefine the HUV model as follows:
Here,
introduces a coupling among all the spectral bands, where
Similarly, the Euler-Lagrange equation derived from (5) takes the following form,
It is shown that a coupling term |∇
Besides the spectrally adaptive property discussed above, the spatially dependent property in different regions remains a critical problem. We use a simple spatial information extractor
Here,
Here,
Embedding the weight in Eq. (9) into Eq. (5), a novel spectral-spatial adaptive unidirectional variational (SSAUV) model is thus
2.3. Sparse Representation Regulation
The above SSAUV model fails to take the spectral correlation into consideration. To tackle this problem, we merge the SSAUV model with a sparse representation, in which the spectral correlation is utilized to learn a dictionary and the noise-free image can be approximated by a linear combination of its atoms [19].
The denoising model based on sparse representation for HSIs can be defined as the following minimization problem:
Here,
Given the computational efficiency and implementation simplicity, we reshape the 3D HSI cube by concatenating all its vertical spectral slices into a
To this end, we arrive at the final cost function with the constraint of the spectral sparsity imposed on the SSAUV model, that is
Here in,
Following Ref. [29], (13) is transformed to the following constrained problem by introducing two auxiliary variables
Sequentially, (14) is further converted into an unconstrained minimization problem by using a quadratic penalty function:
where
The
which is a least-square problem. Nulling the first derivative of (16) yields the following formula
which is strictly diagonal and can be solved using the Gauss-Seidel algorithm.
The
whose solution is
where in shrink(
Then, we update the Bregman variables as follows:
Finally, the normalized mean square error (NMSE) is applied as the stopping criterion.
To validate the destriping performance, we performed simulations and actual experiments, and compared our method with the combined wavelet-Fourier filter (wavelet-FFT) [5], the variational stationary noise remover method (VSNR) [30], the SSAHTV method [14], and the UV method [10]. For fair comparison, parameters in these algorithms were kept equal for each band. They were tuned until the best performance was obtained.
In all cases, the gray values of HSIs were normalized between [0, 1]. We refer to peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) [31] to quantitatively assess the improvement. The mean indices of all the bands are denoted as MPSNR and MSSIM. Generally, larger index values indicate better restoration results.
In this subsection, the Washington DC Mall data, acquired by Hyperspectral Digital Imagery Collection Experiment (HYDICE), was utilized to test the effectiveness of the proposed method. This HSI contained 307 × 307 pixels and 191 spectral bands.
Under the assumption that the raw data was noise-free, the striped images were obtained by adding the same zero-mean Gaussian noise
Bands 58 and 156 are selected as typical bands. The results for Scenario 1 are shown in Figs. 2 and 3, and Scenario 2 in Figs. 4 and 5. It is obvious that the proposed method achieves the best destriping results, where the stripes are altogether removed and the structural details are well preserved. In comparison, the band-wise wavelet-FFT method can remove most of the stripes and improve the visual quality of the images, but a few residual stripes still remain as indicated by the ellipse marks. Even worse, the image contrast is unexpectedly decreased. The results via the SSAHTV model are over-smoothed and exhibit severe loss of details. Although this model has taken into account the differences of the spectral and spatial information, it fails to exploit the directional property of the stripes. Essentially, it is an isotropic model, which accounts for the residual stripes.
The effect of sparse representation (SR) was also investigated. The SR term was removed and the stripes were suppressed in the SSAUV model. The results in Figs. 2(e)-5(e) show that some vertical structures, i.e., the road in the ellipse marks, are degraded together with the stripes, which implies that spectral distortion may be induced. This is primarily because the SSAUV model only emphasizes the smoothness perpendicular to the stripes, and the spectral correlation is ignored. For Scenario 2, Figs. 6 and 7 plot the spectrum differences at pixels (100, 200) and (300, 300) of each band, respectively. It is apparent that the proposed method inhibits spectral distortion, benefiting from the sparsity constraint.
Band-wise PSNR and SSIM values of the two scenarios are presented in Figs. 8 and 9. The proposed method achieves a consistent improvement of over 2 dB in terms of PSNR, illustrating that our method is rather noise insensitive. In addition, the simulations were repeated 10 times using different random noise patterns. The arithmetic mean of MPSNR and MSSIM with their uncertainties, denoted by M-MPSNR and M-MSSIM, is condensed in Table 1. The numbers in parentheses correspond to Type A standard uncertainty. Consistent with the visual performance, the proposed method outperforms other methods in point of PSNR and SSIM values. The simulations have demonstrated the compelling performance of our method in preserving details and suppressing stripes with less spectral distortion, regardless of stripe intensities.
[TABLE 1.] Quantitative evaluation of the simulated experiments before and after destriping
Quantitative evaluation of the simulated experiments before and after destriping
3.2.1. EO-1 Results
The Earth Observing-1 (EO-1) Hyperion data was acquired by the Hyperion sensor over Hualien County, Taiwan on May 4th, 2005. The original data, with random noise and stripes, was composed of 242 bands in spectrum range of 0.4 to 2.5 mm. Each band contained 256×6974 pixels. After removing some bands with little information, the final test data was cropped to the size of 256×256×149. The band indexes of the test data were 8~57, 77~115, 117~120, 131~133, 135, 138, 140~162, 164, 171~176, 179~180, 182, 184, 195~209, and 222~223, respectively.
The results of band 96 with tiny stripes are shown in Fig. 10. As the band-wise wavelet-FFT and VSNR methods neglect the spectrally varying intensities of the stripes, the global parameters appear so large for this band that the destriped images are excessively smoothed. Hence, heavy artifacts are introduced. For the SSAHTV method, these slight stripes are removed, without causing noticeable artifacts. The destriped image in Fig. 10(e) seems plausible. However, the UV method focuses too much on the destriping, which leads to loss of useful information.
The results of band 173 with severe stripes are shown in Fig. 11. For the wavelet-FFT method, although most of the stripes are alleviated and visually pleasant results are achieved, some residual stripes persist in the ellipse area marked in Fig. 11(b). For the results via the SSAHTV model, quite a part of stripes can be observed. Even worse, the images appear over-smoothed and suffer from significant loss of details. Similar results are achieved by the VSNR method. According to the destriping result of UV, this method is less robust than the proposed method, because we can still see stripe artifacts in the ellipse area in Fig. 11(e).
Evidently, our approach provides the best destriping results. The stripes in smooth regions are fully erased, while the detail information is well preserved, which is attributed to the adaptive mechanism in the spatial domain. Furthermore, the spectrally varying intensities of the stripes are sufficiently considered: On one hand, small denoising strengths are enforced to band 96, thus the maximum data genuineness is preserved in Fig. 10(f). On the other hand, large denoising strengths are enforced to band 173, thus the stripes are effectively removed in Fig. 11(f). Besides, benefiting from the sparse representation term, the spectral correlation is exploited and the contrast of the destriped image is not decreased.
Figs. 12 and 13 display the mean cross-track profiles of bands 96 and 173 before and after destriping, respectively. Owing to the existence of the stripes, the curves in Figs. 12(a) and 13(a) exhibit rapid fluctuations. It can be seen that the curves of our results are quite smooth and follow the similar trend as the raw data. For the wavelet-FFT method, the curves appear over smoothed, indicating that the textures are unexpectedly degraded. Even though the fluctuations in Figs. 12(c), 12(d), 13(c), and 13(d) are somewhat suppressed, a few burrs, indicating residual stripes, still remain. For the UV method, the trend of the cross-track profile in Fig. 12(e) differs a lot from Fig. 12(a), which indicates that brightness distortion is introduced in the destriping process.
3.2.2. Urban Results
The Urban scene in Texas was collected by the HYDICE sensor, which could be downloaded online at http://www.tec.army.mil/hypercube. This HSI was ravaged by random noise and stripes simultaneously, and consisted of 307×307 pixels and 210 bands ranging from 0.4 to 2.4 mm. Bands 104-108, 139-151, and 207-210 were abandoned for high degree of noise and little useful information. The remaining test data was 307×307×188 in size.
The destriping results of bands 109 and 204 are presented in Figs. 14 and 15, respectively. Subjectively, the proposed model is superior to the other four contrasting methods. In the band-wise wavelet-FFT method, the damping factor depends on the whole data cube and stays constant for each band. Consequently, for the bands with slight stripes, the constant damping factor is too large to preserve details, as shown in Fig. 15(b). The SSAHTV method is capable of eliminating the slight stripes, as shown in Fig. 15(c), whereas a few severe stripes in Fig. 14(c) still exist since the directional characteristic of stripes is not taken into consideration. Besides, random noise is suppressed to some extent. For the VSNR method and the UV method, the residual stripes are still significant in Figs. 15(d) and 15(e), respectively.
3.3. Convergence Speed and Parameter Selection
To measure the convergence speed of the proposed method, Fig. 16(a) plots NMSE as a function of iteration numbers on the simulated experiment Scenario 2. The NMSE curve decreases significantly in the first five iterations due to the efficiency of the split Bregman method. Since the evolution curve converges after about 17 iterations, the maximum number of iterations
The selection of regulation parameters
As seen in Fig. 16(c), the MAE curve decreases gradually with the increase of
To test the parameter
Selection principles of the parameters in split Bregman and sparse representation methods are not particularly discussed, as they are beyond the scope of this paper.
In this paper, we present a variational method to destripe HSIs. We take the stripe noise properties of unidirection, spectral correction, and variation of stripe intensities in different bands and regions into full consideration. Instead of manual selection band by band, regulation parameters adapt to the spectrally varying intensities of stripes and the spatially varying texture information. Moreover, spectral correlation is exploited via the sparse representation constraint to prevent spectral distortion. Then, the split Bregman method is employed to optimize the minimization problem. Experiment results demonstrate that the proposed method can remove the stripes in different bands and different regions efficiently, without distortion in the spectral domain and loss of structural details. In addition, the proposed method is robust to parameter selection and stripe intensities.
The proposed method may be computationally complex, as any iterative algorithm. Future extension of this work may include speeding up the method with parallel techniques, such as graphic processing units (GPUs). Destriping, as well as deblurring, dehazing, and super resolution is a typical inverse problem. It may be interesting to tackle these issues simultaneously in one framework.