References
[1]. Boll, S. (1979). Suppression of acoustic noise in speech
using spectral subtraction. IEEE Transactions on Acoustics,
Speech, and Signal Processing, 27(2), 113-120. https://doi.
org/10.1109/TASSP.1979.1163209
[2]. Chugh, A., Rana, P., & Rana, S. (2014). Speech
recognition system using wavelet transform. International
Journal of Computer Science and Mobile Computing,
3(8), 63-71.
[3]. Gupta, V. K., Bhowmick, A., Chandra, M., & Sharan, S.
N. (2011, February). Speech enhancement using MMSE
estimation and spectral subtraction methods. In 2011,
International Conference on Devices and Communications
(ICDeCom) (pp. 1-5). IEEE. https://doi.org/10.1109/ICDE
COM.2011.5738532
[4]. Hidayat, R., Bejo, A., Sumaryono, S., & Winursito, A.
(2018, July). Denoising speech for MFCC feature extraction
using wavelet transformation in speech recognition
system. In 2018, 10th International Conference on
Information Technology and Electrical Engineering (ICITEE)
(pp. 280-284). IEEE. https://doi.org/10.1109/ICITEED.2018.
8534807
[5]. Hirsch, H. G., & Ehrlicher, C. (1995, May). Noise
estimation techniques for robust speech recognition. In
1995, International Conference on Acoustics, Speech,
and Signal Processing (Vol. 1, pp. 153-156). IEEE.
https://doi.org/ 10.1109/ICASSP.1995.479387
[6]. Karray, L., & Martin, A. (2003). Towards improving
speech detection robustness for speech recognition in
adverse conditions. Speech Communication, 40(3), 261-
276. https://doi.org/10.1016/S0167-6393(02)00066-3
[7]. Kinnunen, T., Saeidi, R., Sedlák, F., Lee, K. A., Sandberg,
J., Hansson-Sandsten, M., & Li, H. (2012). Low-variance
multitaper MFCC features: A case study in robust speaker
verification. IEEE Transactions on Audio, Speech, and
Language Processing, 20(7), 1990-2001. https://doi.org/
10.1109/TASL.2012.2191960
[8]. Lockwood, P., Boudy, J., & Blanchet, M. (1992, March).
Non-linear spectral subtraction (NSS) and hidden Markov models for robust speech recognition in car noise
environments. In Acoustics, Speech, and Signal
Processing, IEEE International Conference (Vol. 1, pp. 265-
268). IEEE Computer Society. https://doi.ieeecomputer
society.org/10.1109/ICASSP.1992.225921
[9]. Martin, R. (1994). Spectral subtraction based on
minimum statistics. In Proceedings of European Signal
Processing (pp.1182-1185).
[10]. Martin, R. (2001). Noise power spectral density
estimation based on optimal smoothing and minimum
statistics. IEEE Transactions on Speech and Audio
Processing, 9(5), 504-512. https://doi.org/10.1109/89.928915
[11]. Ris, C., & Dupont, S. (2001). Assessing local noise level
estimation methods: Applications to noise robust ASR.
Speech Communication, 34(1-2), 141-158. https://doi.
org/10.1016/S0167-6393(00)00051-0
[12]. Shao, Y., & Chang, C. H. (2010). Bayesian separation
with sparsity promotion in perceptual wavelet domain for
speech enhancement and hybrid speech recognition.
IEEE Transactions on Systems, Man, and Cybernetics-Part
A: Systems and Humans, 41(2), 284-293. https://doi.org/
10.1109/TSMCA.2010.2069094