ECG Data Compression Using Modified Cortes

* P.G Scholar, Department of Electrical Engineering, M.M.M. University of Technology, Gorakhpur, India.

** Professor, Department of Electrical Engineering, M.M.M. University of Technology, Gorakhpur, India.

Abstract

Electrocardiogram (ECG) is a graphical illustration of the cardiac cycle as produced by an electrocardiograph. ECG recordings are indispensable when it comes to monitoring critical cardiac patients, astronauts etc. However, this around-the-clock surveillance results in voluminous ECG data, which becomes difficult to handle. Thus, the basic requisities of minimal usage of data storage space and speedy transmission over channels in tele-medicine fostered research in the field of ECG Data Compression. So far numerous techniques under Direct Data methods of compression like Turning Point (TP), Amplitude Zone Time Epoch Coding (AZTEC), Coordinate Reduction Time Encoding System (CORTES), Scan Along Polygonal Approximation (SAPA), Fan etc. have served the purpose fairly. This work tends to amalgamate the TP and modified AZTEC techniques, providing an efficient hybrid algorithm for compression.

Keywords :

ECG,
Data,
Compression,
Hybrid,
TP,
Modified AZTEC.

Introduction

Electrocardiography is a canonical procedure in these days, primarily because of an incessant growth in the number of heart patients. Since, modern day health facilities solely rely on software to handle cardiologic data of patients, a need is felt to satisfy the primary requirements of minimal storage space for data and its rapid transmission. With a variety of compression algorithms developed so far, we have achieved it to quite an extent but still there is scope for improvement.

For a compression algorithm to be competent in every aspect it should satisfy the three R's (i) Represent (ii) Retain and (iii) Reconstruct. By 'Representation' we mean that the algorithm must be equipped to compress long continuous ECG signals using few samples [5]. 'Retaining' pertains to the fact that the compressed data must retain the basic diagnostic information about the patient [10] and 'Reconstruction' here, implies that the algorithm must reproduce the original data from the compressed file with negligible loss. Most compression algorithms live up to the first two requirements but lag when it comes to data reconstruction. We use evaluation measures like Compression Ratio (CR) [2, 9] and Percent Root Mean Square Difference (PRD) [3]to estimate the genuineness of a particular algorithm. However, these are not the only testing parameters.

A very wide range of compression algorithms are available today. Wavelets, Parameter extraction, DDC all have played a vital role in the field of ECG compression. DDC or Direct Data Compression Technique alone has about eight to ten methods enlisted under it [3]. To name a popular few we have Turning Point (TP), Amplitude Zone Time Epoch Coding (AZTEC), modified AZTEC, Coordinate Reduction Time Encoding System (CORTES), Scan Along Polygonal Approximation (SAPA) [7], Fan [8] etc. These time domain techniques have not only provided good processing speed but also high compression ratios. Compression ratios as high as 10:1 have been obtained.

So far these techniques, when individually implemented have undoubtedly performed well but lately, intra-domain hybrids have gained grounds. The reason being that hybrids not only amalgamate but also incorporate the best features of the fundamental algorithms. The CORTES is one such concoction which successfully mixes the TP and AZTEC algorithms. It uses the high compression ratio of the AZTEC technique and at the same time claims high accuracy of the TP method. The current paper deals with a hybrid of TP and modified AZTEC technique. The Modified AZTEC technique is different from AZTEC in the respect that it is self- adaptive in nature and has a variable threshold unlike the AZTEC [4]. These techniques have been intertwined on the lines of CORTES algorithm and certain enhancements have been incorporated which have resulted in a better compression ratio and a reduced PRD.

1. Basic Algorithms- Tp and Aztec

1.1 Turning Point

This DDC technique works with only odd samples of ECG signal. It halves the sampling frequency. The waveform is reconstructed using interpolation with 2, which increases the sampling rate of compressed sequence by a factor of 2.

1.2 Modified AZTEC Technique

This is an extension of the AZTEC technique. It employs a variable threshold which makes it a self adaptive algorithm [1]. Self-adaptive here implies that the algorithm is intelligent enough to use a higher threshold value for low information region and vice-versa. Higher threshold in low information region i.e. (region with less significant data and more redundant data) ensures a higher compression ratio, thereby eliminating more redundancies. For similar reasons we have low threshold for high information region.

2. Modified Cortes Algorithm

2.1 Methodology

The CORTES method determines on the basis of the length of the line produced post AZTEC as to which of the TP or AZTEC processed results will be saved. This length is compared to a threshold value. If the length is greater than the threshold the AZTEC data is saved else the outcome of TP is recorded. However, this technique gave a CR of approximately 4.8:1 for a PRD of 7 as evaluated by J.P. Abenstein [11]. Hence, a need was felt to further improve the CR of CORTES without tampering much with the PRD.

2.2 Algorithm

The proposed hybrid method works similar to the CORTES algorithm. The only difference is that the original algorithm used AZTEC and TP methods. However, in this work we have attempted to amalgamate TP with the dynamic modified AZTEC algorithm.

The steps are as follows:

(1)

(2)

If sign=1, the slope is assumed to be going up else for sign=0, the slope goes down. For the case when both the directions are the same, it is assumed that the next sample has the final elevation of the slope. Also, one more slope is added to the slope length. This is followed by the resetting of the statistical parameters. Case 3: For ln=1, the entire process for case 2 is repeated except that here it is assumed that the sample after the next has the final elevation of the slope. Also, two more samples are added to the slope length. The slope mode is then terminated. If ln > 3, amplitude (mean) and length of plateaus are computed

ECG samples are available.
Two sets of variables each are initialized for modified AZTEC and TP.
The process begins by running an iterative loop n-2 times where 'n' is the length of ECG signal used.
The ECG data is processed using modified AZTEC algorithm (both horizontal plateaus and slopes are produced). The statistical parameters like mean, moment, threshold and also length 'ln' are initialized. Then the first sample sets the limits as x_min and x_max . For iterative variable i > 1, the statistical parameters are updated using their respective dynamic formulae. Meanwhile, if the subsequent samples exceed the limits as x_min and x_max they replace the previous limits [1]. The difference between the current maximum and minimum values is calculated. If the difference exceeds the threshold or ln=50, the slope mode is started. The slope mode is executed using three cases. Case 1: ln=3, the final elevation of the slope is saved. Case2: For ln=2, the next and previous directions of slope are determined using expressions.
PRD of compressed set of samples( both plateaus and slopes) is calculated. If PRD does not exceed a certain empirical level, in this case 10 %, the current amplitude and length of the compressed signal are stored in the compecg(p) and l(p) variables respectively and the statistical parameters (mean, third moment and threshold) are reset.
Else, the same data is then processed with TP algorithm using three samples at a time. The amplitude and length of the compressed signal which is 2 in case of TP are saved in the same variables compecg(p) and l(p) respectively.
The compression algorithm concludes and the final values of amplitude and length of the compressed signal are saved.

The flowchart for the complete process is illustrated in Figure 1.

Figure 1. Flowchart for Modified CORTES

The TP part is reconstructed using interpolation by 2. For the modified AZTEC part, reconstruction of slopes and plateaus is carried out separately. For slopes, the intermediate points are calculated and interpolated. The plateaus are reconstructed by repeating the same amplitude for fixed number of times.

3. Test Results and Discussions

Previously, the CORTES method saved either the results of AZTEC or TP data merely on the basis of the length of the plateau produced after AZTEC compression. Slopes had no role to play.

However, in the modified CORTES scheme, both plateaus and slopes have been considered determining factors. The results have verified that this step has only proved beneficial in all respects. Not only did it boost the Compression Ratio but it also immensely reduced the PRD giving us both small compressed file size and accurately reconstructed ECG signal.

The program was tested in the Matlab environment and the results have been summarized in the Table 1. For the section with modified AZTEC the authors used an initial threshold of 0.05 and initialized C₁ as 1 and C₂ as 0.3 in the expressions for criterion function and threshold respectively, for optimum results. Also, an initial empirical level for PRD is set as 10% so that even in the worst of cases the deviation of the reconstructed signal from the original one doesn't exceed the specified level. The authors have obtained compression ratio as high as 19 without compromising the quality of the reconstructed signal as the corresponding PRD is quite low. Thus, the modified CORTES scheme successfully satisfies the basic requisities of small compressed file size and accurate signal reconstruction.

Table 1. Summary of the results obtained for D- 00001.DCD

The corresponding Matlab waveforms are obtained and illustrated: Figure 2 to Figure 11 show two sets of MATLAB waveforms, original and reconstructed. The original waveforms are signals obtained from cse database. The signals are first compressed and then reconstructed using modified CORTES.

Figure 2. Original and Reconstructed Waveforms using Modified CORTES for Data 2L2.DIG

Figure 3. Original and Reconstructed Waveforms using Modified CORTES for Data 2V4.DIG

Figure 4. Original and Reconstructed Waveforms using Modified CORTES for Data 1AL.DIG

Figure 5. Original and Reconstructed Waveforms using Modified CORTES for Data 1AR.DIG

Figure 6. Original and Reconstructed Waveforms using Modified CORTES for Data 2V1.DIG

Figure 7. Original and Reconstructed Waveforms using Modified CORTES for Data 1L3.DIG

Figure 8. Original and Reconstructed Waveforms using Modified CORTES for Data 2L3.DIG

Figure 9. Original and Reconstructed Waveforms using Modified CORTES for Data 1V6.DIG

Figure 10. Original and Reconstructed Waveforms using Modified CORTES for Data 2AF.DIG

Figure 11. Original and Reconstructed Waveforms using Modified CORTES for Data 1V2.DIG

In each case, i.e from Figure 2 to Figure 11, the first graph depicts the original ECG signal with 5000 samples and an amplitude ranging between -3 to 3 mV. The second graph illustrates the reconstructed ECG using modified CORTES.

Conclusions

The original CORTES scheme though a hybrid provided a limited compression ratio up to 5. However, with minute changes made in the algorithm, the authors obtained better results. Incorporating the dynamic modified AZTEC scheme in place of the simple AZTEC, made the algorithm self adaptive and reliable. The modified CORTES scheme used both slopes and plateaus to determine and save the significant data unlike the original scheme. Also, an initially set value of PRD helped us keep a check on the signal reconstruction. This helped to achieve very high compression ratios for remarkably low PRDs.

Acknowledgments

The authors are very much thankful to the UGC (MRP) project which is in progress in the Department of Electrical Engineering for providing us some of the ECG data which were used for validating the modified CORTES scheme.

References

[1]. B. Furht, Alex Perez, (1988). “An Adaptive Real-Time ECG Compression Algorithm with Variable Threshold”, IEEE, Vol. 35, pp 489-494.

[2]. C. A. Andrews, J. M. Davies, and G. R. Schwarz, (1967). “Adaptive data compression,” Proc. IEEE, Vol. 5, pp. 267- 277.

[3]. Sateh M.S. Jalaleddine, Chriswell G. Hutchens and Robert D. Strattan, (1990). “ECG Data Compression Techniques-A Unified Approach”, Vol 37, No. 4.

[4]. J.R. Cox, F.M. Noelle (1968) “AZTEC, a Preprocessing Program for Real-Time ECG Rhythm Analysis”, IEEE, pp 128- 129, April.

[5]. C. M. Kortman, (1967). “Redundancy reduction-a practical method of data compression,” Proc. IEEE, Vol. 55, pp. 253-263.

[6]. W. C. Mueller, (1978). “Arrhythmia detection program for an ambulatory ECG monitor,” Biomed. Sci. Instrument., Vol. 14, pp. 81-85.

[7]. M. Ishijima, S. B. Shin, G. H. Hostetter, and J. Sklansky, (1983). “Scan-along polygon approximation for data compression of electrocar- diograms,” IEEE Trans. Biomed. Eng., Vol. BME-30, pp. 723-729.

[8]. L. D. Davisson, (1967). “The Fan method of data compression,” 1966 Goddard summer workshop, NASA TM X-55742, X-700-67-94, Final Rep., pp. 23-30.

[9]. Indu Saini and Priyanka, (2013). “Analysis ECG Data Compression Techniques- A Survey Approach,” Vol 3, pp. 544-548.

[10]. Anand Kumar Patwari and Durgesh Pansari, (2014). “Analysis of ECG Signal Compression Technique Using Discrete Wavelet Transform for Different Wavelets,” Vol. 8, pp. 168-173.

[11]. J. P. Abenstein and W. J. Tompkins, (1982). “New datareduction algorithm for real-time ECG analysis,” IEEE Trans. Biomed. Eng., Vol. BME-29, pp. 43-48.