i-manager Publications

Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net

Ashwini G.*, Ramashri T.**

*_** Department of Electronics and Communication Engineering, Sri Venkateswara University College of Engineering, Tirupati, Andhra Pradesh, India.

Periodicity:January - March'2025
DOI : https://doi.org/10.26634/jip.12.1.21658

Abstract

Accurate segmentation in medical imaging, particularly for modalities such as Chest X-rays, CT scans, and microscopic images, is critical for diagnosis and treatment. However, noisy and low-quality data can significantly affect performance. This paper presents a novel framework that integrates Noise2Split denoising with a Hybrid Swin Transformer U-Net to enhance segmentation accuracy in these challenging medical imaging tasks. By combining Noise2Split's effective noise reduction with the Swin Transformer's advanced feature extraction and U-Net's robust segmentation architecture, the model efficiently addresses both noise and segmentation challenges. The Swin Transformer effectively captures both local and global context, while the skip connections in U-Net contribute to recovering detailed high- resolution features. Extensive experiments on Chest X-rays, CT scans, and microscopic images demonstrate that this integrated model performs better than traditional methods in terms of segmentation accuracy, making it a valuable tool for clinical applications where imaging quality is compromised.

Keywords

Image Segmentation, N2S, De-Noising, Swin Transformer, U-Net, Deep Learning.

How to Cite this Article?

Ashwini, G., and Ramashri, T. (2025). Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net. i-manager’s Journal on Image Processing, 12(1), 50-62. https://doi.org/10.26634/jip.12.1.21658

References

[1]. Ashwini, G., & Ramashri, T. (2023). Denoising and segmentation of medical images using N2S-U-Net. International Journal of Scientific Research in Science and Technology, 10 (6), 586-594.

[2]. Ashwini, G., Ramashri, T., & Ahmed, M. R. (2025). Noise2split-single image denoising via single channeled patch-based learning. International Journal of Image and Graphics, 25(01), 2450057.

[3]. Bibaut, A., Kallus, N., Dimakopoulou, M., Chambaz, A., & van Der Laan, M. (2021). Risk minimization from adaptively collected data: Guarantees for supervised and policy learning. Advances in Neural Information Processing Systems, 34, 19261-19273.

[4]. Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q., & Wang, M. (2022, October). Swin-unet: Unet-like pure transformer for medical image segmentation. In European Conference on Computer Vision (pp. 205- 218). Springer Nature Switzerland.

[5]. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020, August). End-to-end object detection with transformers. In European Conference on Computer Vision (pp. 213-229). Springer International Publishing.

[6]. Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., & Zhou, Y. (2021). Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306.

[7]. Chen, L. C., Zhu, Y., Papandreou, G., Schroff, F., & Adam, H. (2018). Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 801-818).

[8]. Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T., & Ronneberger, O. (2016). 3D U-Net: learning dense volumetric segmentation from sparse annotation. In Medical Image Computing and Computer-Assisted Inter vention– MICCAI 2016 : 19 t h International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19 (pp. 424-432). Springer International Publishing.

[9]. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T & Houlsby, N. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.

[10]. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770-778).

[11]. Kaggle. (n.d. -a). COVID 19 XRay and CT Scan Image: Extensive COVID-19 X-Ray and CT Chest Images Dataset.

[12]. Kaggle. (n.d. -b). Finding and Measuring Lungs in CT Data: A Collection of CT Images, Manually Segmented Lungs and Measurements in 2/3D.

[13]. Khened, M., Kollerathu, V. A., & Krishnamurthi, G. (2019). Fully convolutional multi-scale residual DenseNets for cardiac segmentation and automated cardiac diagnosis using ensemble of classifiers. Medical Image Analysis, 51, 21-45.

[14]. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10012-10022).

[15]. Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5- 9, 2015, Proceedings, Part III 18 (pp. 234-241). Springer international publishing.

[16]. Sinha, A., & Dolz, J. (2020). Multi-scale self-guided attention for medical image segmentation. IEEE Journal of Biomedical and Health Informatics, 25(1), 121-130.

[17]. Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., & Jégou, H. (2021). Training data- efficient image transformers & distillation through attention. In International Conference on Machine Learning (pp. 10347-10357). PMLR.

[18]. Yu, F., & Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122.

[19]. Zhang, Z., Liu, Q., & Wang, Y. (2018). Road extraction by deep residual u-net. IEEE Geoscience and Remote Sensing Letters, 15(5), 749-753.

[20]. Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N., & Liang, J. (2018). Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4 (pp. 3-11). Springer International Publishing.

	North Americas,UK, Middle East,Europe		India	Rest of world
	USD	EUR	INR	USD-ROW
Pdf	35	35	200	20
Online	15	15	200	15
Pdf & Online	35	35	400	25

Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net

Abstract

Keywords

How to Cite this Article?

References

If you have access to this article please login to view the article or kindly login to purchase the article

Purchase Instant Access

Options for accessing this content: