Electrocardiogram Signal Compression with Reconstruction via Radial Basis Function Interpolation Based on the Vertex

Ryu Chunha; Kim Tae-Hun; Kim Jungjoon; Choi Byung-Jae; Park Kil-Houm

doi:10.5391/IJFIS.2013.13.1.31

OA학술지
International Journal of Fuzzy Logic and Intelligent Systems

Electrocardiogram Signal Compression with Reconstruction via Radial Basis Function Interpolation Based on the Vertex

DOI : 10.5391/IJFIS.2013.13.1.31
Author: Ryu Chunha, Kim Tae-Hun, Kim Jungjoon, Choi Byung-Jae, Park Kil-Houm
Organization: Ryu Chunha; Kim Tae-Hun; Kim Jungjoon; Choi Byung-Jae; Park Kil-Houm
Publish: International Journal of Fuzzy Logic and Intelligent Systems Volume 13, Issue1, p31~38, 25 March 2013

ABSTRACT

Patients with heart disease need long-term monitoring of the electrocardiogram (ECG) signal using a portable electrocardiograph. This trend requires the miniaturization of data storage and faster transmission to medical doctors for diagnosis. The ECG signal needs to be utilized for efficient storage, processing and transmission, and its data must contain the important components for diagnosis, such as the P wave, QRS-complex, and T wave. In this study, we select the vertex which has a larger curvature value than the threshold value for compression. Then, we reconstruct the compressed signal using by radial basis function interpolation. This technique guarantees a lower percentage of root mean square difference with respect to the extracted sample points and preserves all the important features of the ECG signal. Its effectiveness has been demonstrated in the experiment using the Massachusetts Institute of Technology and Boston’s Beth Israel Hospital arrhythmia database.

KEYWORD

Electrocardiogram , Data compression , Radial basis function , Interpolation , Vertex , Signal reconstruction

본문

Collapse all

1. Introduction

Electrocardiogram (ECG) is an invaluable tool for diagnosis of heart diseases and prognosis observation, which records the electrical activity of the heart. A large amount of data needs to be compressed for storage and transmission because generally an ECG signal has a sampling frequency of over 200 Hz. A high performance processor with mass storage is necessary to precisely digitize the ECG data, which leads to the difficulty in developing portable ECG devices because of limited resources. In addition, the ECG data must be managed with efficiency for long-term monitoring of the patient’s heart condition [1-3]. Therefore, data compression needs to be employed for efficient storage, processing and transmission. Data compression is the process of detecting and eliminating redundancies in a given data set and must achieve maximum data reduction while preserving the significant morphological features upon reconstruction. The reconstruction should be made within a predefined tolerance. Figure 1 shows the overall block diagram of the ECG signal model for compression and classification of a disease syndrome.

The compression schemes of the ECG data that have been studied thus far are the direct time-domain and transform domain techniques. Various research works have been presented

[Figure 1.] Overall block diagram of the electrocardiogram signal model.

for the exact delineation of ECG signals based on numerical differentiation, pattern recognition, and mathematical models. The direct techniques include the amplitude zone time epoch coding (AZTEC), fan, differential pulse code modulation (DPCM), turning point (TP), and the coordinate reduction time encoding system (CORTES) algorithms [4-6]. Because these techniques detect and eliminate data redundancies, they have shown more efficient performance. However, they generate a large reconstruction error with a quite high signal distortion. The transformation methods include the wavelet transform (WT), discrete cosine transform (DCT), fourier transform (FT), walsh transform, karhunen loeve transform (KLT), and so on [7-9]. Most of these transforms compact the signal’s energy into a few transform coefficients, which implies that many of the transform coefficients have little energy and can be discarded.

The compression techniques for the ECG signals should not cause any loss of relevant medical information, and high compression ratio (CR) is desired. The CR is defined as the ratio between the original and compressed signals. The error is evaluated as the percentage of root mean square difference (PRD) between the original and reconstructed signals. To improve the ECG compression quality, the algorithms should reduce the PRD without distorting any relevant diagnostic information. In this paper, we show that the compression must retain all specific points in the ECG signals and that the parameters of the ECG mathematical model can be used to reconstruct the ECG signal. A typical ECG signal has a P wave, QRS-complex, and a T wave, as shown in Figure 2, all of which are important components of the diagnosis. These feature points have a larger signal variation rate than the other regions. Therefore, we first

[Figure 2.] Typical electrocardiogramsignal. X, extraction of dominant points.

select the vertices that have larger curvature values than the threshold value, and add supplementary points with the zero crossing points. Subsequently, we reconstruct the compressed signal using radial basis function (RBF) interpolation. In general, the RBF interpolation method is mainly used in geological applications [10]. Geological application involves three dimensional modeling. However the ECG signal is two dimensional; therefore, the ECG signal modeling is simpler than the geological modeling. Moreover, the RBF efficiently represents a system with a nonlinear property, such as the ECG signal. The implemented method has been verified to have a very low reconstruction error and to maintain important components for diagnosis information.

This paper is presented as follows. The second section presents the general theories of the RBF interpolation and the algorithm that obtains the vertices based on the curvature. Then, the third section provides the experimental results of the proposed method. The conclusion is presented in the final section.

2. Compression with Reconstruction via RBF Interpolation

   2.1 RBF Interpolation Overview

The RBF is an approximation model for multivariate interpolation, which has been widely used in many engineering branches. In this section, we introduce the basic theory of RBF interpolation [11,12]. We assume that a continuous function f : R → R is known only at a set of discrete points X := {x₁, · · · , x_N} in Ω ⊂ R.

The RBF interpolation of f on X involves choosing the basis function ？. Then, it defines the interpolant as follows:

where p₁, · · · , p_m is the basis for Ⅱ_m and represents the subspace consisting of all algebraic polynomials whose degree is less than m. Coefficients a_n are chosen so that the following condition is satisfied:

The coefficients of R_f,_X must satisfy an (N + m) × (N + m) system of linear equations, which can be written in a matrix form as

where A and P are the N × N and N × m matrices having elements A_ij = ？(x_i ？ x_j) and P_ij = p_j(x_i), respectively, and O is a null matrix. Further, a ∈ Rⁿ and b ∈ R^m are the coefficients of R_f,X, and the components of f are the data f(x_j>) with j = 1, · · · , N.

   2.2 Vertex Selection Based on the Curvature

Curvature refers to the deviation rate of a curve or a curved surface from a straight line or plane surface tangent to it [13]. The curvature value of the input ECG signal is calculated for the vertex selection, which has a larger curvature value than the threshold value. Initially, the original ECG signal is preprocessed by band-pass filtering at 0.5？25 Hz to reduce the ECG signal noise [14] because a typical ECG signal contains a 60 Hz power line noise, baseline wander, and muscle noise. Then, a Gaussian LPF is used to eliminate the high frequency noise in the preprocessed ECG signal. The curvature is calculated for vertex selection with the smoothed ECG signal.

The curve function with a time variable is represented by

O(t) = (s(t), v(t)),

where s(t) is the sample index at t, and v(t) is the signal voltage. The smoothed signal C(t, σ) is

C(t, σ) = (S(t, σ), V(t, σ)),

where S(t, σ) and V(t, σ) are expressed as

g(t, σ) is a Gaussian function for smoothing with a standard deviation σ. Curvature k(t, σ) based on the smoothed signal C(t, σ) is calculated as

where S'(t, σ), S''(t, σ), V'(t, σ) and V''(t, σ) are expressed as follows:

Finally, the vertices are selected by the curvature, whose value is calculated by Eq. (2) to be more than the threshold value. Figure 3 shows the restored ECG signal based on the selected vertices, marked as small square boxes.

   2.3 Reconstruction via RBF Interpolation with the Selected Vertex

The compression process must retain all specific points in the ECG signal, which are the important components for diagnosis. These points include the local maximum and minimum points of the ECG signal, such as the P, Q, R, S, and T waves [15]. The significant points selected from Section 2.2 will be used as the parameters for signal reconstruction using the RBF interpolation. The selected vertex has a larger curvature value than

[Figure 3.] Example of electrocardiogram (ECG) signal with vertices based on curvature.

[Figure 4.] Block diagram of the compression and reconstruction processes.

the threshold value. However, the restored ECG signal with the selected vertices based on the curvature is not sufficiently complete because of over-distortion with the original ECG signal. Therefore, for pertinent RBF interpolation, zero crossing points are selected as supplemental points on the sampled data of the ECG signal and added to the initially selected vertices. The block diagram of the global compression and reconstruction processes is shown in Figure 4.

For a given sample data set x_i = {x₁, x₂, · · · , x_N}, he coefficients of R_f,X are presented and used for signal reconstruction from Eq. (1) as follows:

where A and P are expressed as

In this study, we use the basis function called multiquadrics among the many RBFs. One of the reasons for choosing this particular function is our desire to use the parameter as a tension parameter. The value of λ, β as an arbitrary constant will affect the smoothness of the reconstructed curve. p_i(x) are all algebraic polynomials whose degree is less than m.

3. Experimental Results and Discussions

In this study, an experiment to evaluate the proposed method is conducted using the MIT-BIH arrhythmia database [16]. The MIT-BIH arrhythmia database has a 360 Hz sampling frequency acquired in 1800s. The personal computer used in this experiment has a 2.40 GHz Core2 Quad CPU, and a 2 GByte memory. The Matlab R2010a compiler is adopted. The ECG signal is sampled within a 0.0028s period. We choose a signal period that matches with the cardiac cycles. To identify the cardiac cycles in the ECG signal, R wave detection is performed as a preliminary step.

To evaluate the performance of the ECG signal, we use the compression points ratio (CPR) instead of the CR and the PRD. The CPR is defined as the ratio between the number of samples in an average cycle period of the original signal and the compressed signal sample points. The CPR and PRD are defined as follows:

where N_O(t) and N_R(t) are the number of points in the original ECG and compressed ECG signals, respectively, and O(t) and R(t) are the original and reconstructed signals, respectively. It has been established that the reconstructed signal with the PRD values between 0% and 2% and 2% and 9% belongs to the “very good” and “good” quality groups, respectively [17,18].

Figure 5 shows the reconstructed signal with RBF interpolation using the dataset with vertices based on the curvature (a)

[Figure 5.] Reconstructed signals by radial basis function interpolation via (a) vertices based on the curvature and (b) supplemental data with zero crossing points based onMassachusetts Institute of Technology and Bostons Beth Israel Hospital record 100.

and the supplemental dataset with the zero crossing points (b). The reconstructed signal with vertices based on the curvature is not sufficiently complete because it has an over-distortion with the original signal. Therefore, it needs supplementary points which are beneficial for RBF interpolation. We select the additional zero crossing points that can help in the reconstruction by the RBF. We used multiquadrics as the basis function of the RBF and used the third-order algebraic polynomials. More issues have to be addressed in reconstructing the curve using multiquadrics. The tension parameters such as λ and β in the basis function of Eq. (3) can be tuned for smoothness. From the experiment, we concluded that the reconstruction error that belonged to the “very good” quality group was approximately 7 ≤ λ ≤ 20 for fixed β = 1. These parameters yielded very good results.

Figure 6-(a) and (b) show the reconstructed signal via linear and RBF interpolations, respectively, using the MIT-BIH arrhythmia database based on record 103 of the ML II signal.

[Figure 6.] Reconstructed ECGsignal via (a) linear interpolation (PRD = 16.3946) and (b) RBF interpolation (PRD = 1.8036) onMIT-BIH record 103. CPR = 6.0588, λ = 13, and β = 1. CPR, compression points ratio; ECG, electrocardiogram; MIT-BIH,Massachusetts Institute of Technology and Boston’s Beth Israel Hospital; PRD, percentage of root mean square difference; RBF, radial basis function.

The blue line is the original ECG signal, and the red line is the reconstructed signal.

We compared the evaluation by the linear function, and it was clear that the implemented method provided a remarkable performance and a low restoration error rate, as shown in Table 1. The best performance with PRD = 1.5400 via RBF interpolation is within the “very good” quality range at λ = 13 as compared with that of the linear interpolation, when applied on MIT-BIH arrhythmia record 100. Moreover, the restored ECG signal included the specific components of the ECG signal, such as the P wave, QRS complex, and T wave. We can observe that the morphological information of the original signal was well preserved.

4. Conclusion

The ECG signal needs to be utilized for efficient storage, processing and transmission, and its data must retain their important components for diagnosis information, such as the P wave,

[Table 1.] Performance comparison of the RBF and linear interpolations using the proposed algorithm

Performance comparison of the RBF and linear interpolations using the proposed algorithm

QRS-complex, and T wave. This study has demonstrated an efficient method that reconstructed the transmitted ECG signal with the minimum amount of compressed data. The ECG data were compressed to selected sample points, which were the vertices with large curvature, and zero crossing supplemented vertices in the original signal. The signal was reconstructed by RBF interpolation. The multiquadric function was used as the basis function because it has a tension parameter that could influence the smoothness. The reliability of the proposed method was clearly verified with a very low reconstruction error, in addition to maintaining the diagnostic features. Improvement in the compression rate via RBF interpolation with vertex, including the diagnostic components, is left as a topic for future research.

  > Conflict of Interest

No potential conflict of interest relevant to this article was reported.

참고문헌

1. Enseleit F., Duru F. 2006 “Long-term continuous external electrocardiographic recording: a review” [Europace] Vol.8 P.255-266
2. Schijvennars B. J. A., Van Herpen G., Kors J. A. 2008 “Intraindividual variability in electrocardiograms” [Journal of Electrocardiology] Vol.41 P.190-196
3. Yang W. S., Hwang K. S., Lee K. M., Lee K. M., Kim W. J., Yun S. J. 2007 “Requirement analysis and architecture design for ubiquitous healthcare service systems” [International Journal of Fuzzy Logic and Intelligent Systems] Vol.7 P.209-215
4. Cox J. R., Nolle F. M., Fozzaard H. A., Oliver G. C. 1968 “AZTEC, a preprocessing program for real-time ECG rhythm analysis” [IEEE Transactions on Biomedical Engineering] Vol.BME-15 P.128-129
5. Jalaleddine S. M. S., Hutchens C. G., Strattan R. D., Coberly W. A. 1990 “ECG data compression techniquesa united approach” [IEEE Transactions on Biomedical Engineering] Vol.37 P.329-343
6. Abenstein J. P., Tomkins W. J. 1982 “A new data-reduction algorithm for real-time ECG analysis” [IEEE Transactions on Biomedical Engineering] Vol.BME-29 P.43-48
7. Bradie B. D. 1996 “Wavelet packet-based compression of single lead ECG” [IEEE Transactions on Biomedical Engineering] Vol.43 P.493-501
8. Reddy B. R. S., Murthy I. S. N. 1986 “ECG data compression using Fourier descriptors” [IEEE Transactions on Biomedical Engineering] Vol.BME-33 P.428-434
9. Olmos S., Millan M., Garcia J. I., Laguna P. 1996 “ECG data compression with the Karhunen-Loeve transform” [in Proceedings of Conference on Computers in Cardiology] P.253-256
10. Hardy R. L. 1990 “Theory and applications of the multiquadric -biharmonic method 20 years of discovery 1968-1988” [Computers & Mathematics with Applications] Vol.19 P.163-208
11. Dyn N. 1989 “Interpolation and approximation by radial and related fuctions” in Approximation Theory VI, C. K. Chui, L. L. Schumaker, and J. D. Ward, Eds. P.211-234
12. Lee B. G., Lee Y. J., Yoon J. 2006 “Stationary binary subdivision schemes using radial basis function interpolation” [Advances in Computational Mathematics] Vol.25 P.57-72
13. Kim T. H., Kim S. W., Ryu C. H., Yun B. J., Kim J. H., Choi B. J., Park K. H. 2010 “ECG signal compression using feature points based on curvature” [Journal of Korea Intelligent Information Society] Vol.20 P.624-630
14. Alesanco A., Garcia J. 2008 “Automatic real time ECG coding methodology guaranteeing signal interpretation quality” [IEEE Transactions on Biomedical Engineering] Vol.55 P.2519-2527
15. Kohler B. U., Hennig C., Orglmeister R. 2002 “The principles of software QRS detection” [IEEE Engineering in Medicine and Biology Magazine] Vol.21 P.42-57
16. Moody G. 1992 MlT-BIH Arrhythmia Database CD-ROM: overview
17. Zigel Y., Cohen A., Katz A. 2000 ”The weighted diagnostic distortion (WDD) measure for the ECG signal compression” [IEEE Transactions on Biomedical Engineering] Vol.47 P.1422-1430
18. Lee S. I., Lee S. Y. 2008 “Integration of user profiles and real-time context information reflecting time-based changes for the recommendation system” [International Journal of Fuzzy Logic and Intelligent Systems] Vol.8 P.276-283

이미지 / 테이블

[ Figure 1. ] Overall block diagram of the electrocardiogram signal model.
[ Figure 2. ] Typical electrocardiogramsignal. X, extraction of dominant points.
[ Figure 3. ] Example of electrocardiogram (ECG) signal with vertices based on curvature.
[ Figure 4. ] Block diagram of the compression and reconstruction processes.
[ Figure 5. ] Reconstructed signals by radial basis function interpolation via (a) vertices based on the curvature and (b) supplemental data with zero crossing points based onMassachusetts Institute of Technology and Bostons Beth Israel Hospital record 100.
[ Figure 6. ] Reconstructed ECGsignal via (a) linear interpolation (PRD = 16.3946) and (b) RBF interpolation (PRD = 1.8036) onMIT-BIH record 103. CPR = 6.0588, λ = 13, and β = 1. CPR, compression points ratio; ECG, electrocardiogram; MIT-BIH,Massachusetts Institute of Technology and Boston’s Beth Israel Hospital; PRD, percentage of root mean square difference; RBF, radial basis function.
[ Table 1. ] Performance comparison of the RBF and linear interpolations using the proposed algorithm