TY - JOUR
T1 - CMAC-Net
T2 - Cascade Multi-Scale Attention Convolution Network for diabetic retinopathy lesion segmentation
AU - Chen, Haowei
AU - Yin, Ming
AU - Guo, Yi
AU - Soomro, Toufique Ahmed
PY - 2026/2
Y1 - 2026/2
N2 - Automatic diabetic retinopathy (DR) lesions is of great importance to assist ophthalmologists in making accurate diagnoses. Currently, the accurate segmentation of fundus lesions is challenged by the large differences in the shape and size of the lesions, and their concentrated locations of different lesions. Many deep learning methods based on convolutional neural networks (CNN) have been proposed to handle the task of segmentation of fundus lesion. However, these methods still have two limitations: (1) lack of backbone networks targeting the characteristics of fundus images, and limited ability to extract lesions of different scales; (2) with semantic conflicts when fusing multi-level information from nonadjacent layers. To this end, we propose a Cascade Multi-Scale Attention Convolution Network (CMAC-Net) for diabetic retinopathy lesion segmentation. In particular, Mobile Dynamic Attention Convolution (MDAC) is adopted as backbone network integrating with Cascade Progressive Context Fusion (CPCF) module in a U-Net framework. CMAC-Net has the following advantages: (1) MDAC preserves high-resolution features, leading to high segmentation accuracy of minor lesions; (2) MDAC uses large kernel convolution to separate the large and small lesions at the same time; (3) CPCF aims to fuse the semantic information from different levels, avoiding the inconsistent parts. The experimental results on three datasets show that our model outperforms the state-of-the-art, in terms of several metrics.
AB - Automatic diabetic retinopathy (DR) lesions is of great importance to assist ophthalmologists in making accurate diagnoses. Currently, the accurate segmentation of fundus lesions is challenged by the large differences in the shape and size of the lesions, and their concentrated locations of different lesions. Many deep learning methods based on convolutional neural networks (CNN) have been proposed to handle the task of segmentation of fundus lesion. However, these methods still have two limitations: (1) lack of backbone networks targeting the characteristics of fundus images, and limited ability to extract lesions of different scales; (2) with semantic conflicts when fusing multi-level information from nonadjacent layers. To this end, we propose a Cascade Multi-Scale Attention Convolution Network (CMAC-Net) for diabetic retinopathy lesion segmentation. In particular, Mobile Dynamic Attention Convolution (MDAC) is adopted as backbone network integrating with Cascade Progressive Context Fusion (CPCF) module in a U-Net framework. CMAC-Net has the following advantages: (1) MDAC preserves high-resolution features, leading to high segmentation accuracy of minor lesions; (2) MDAC uses large kernel convolution to separate the large and small lesions at the same time; (3) CPCF aims to fuse the semantic information from different levels, avoiding the inconsistent parts. The experimental results on three datasets show that our model outperforms the state-of-the-art, in terms of several metrics.
KW - Convolution
KW - Diabetic retinopathy
KW - Feature fusion
KW - Fundus lesion segmentation
KW - Multi-scale
UR - http://www.scopus.com/inward/record.url?scp=105014520112&partnerID=8YFLogxK
UR - https://go.openathens.net/redirector/westernsydney.edu.au?url=https://www.sciencedirect.com/science/article/pii/S1746809425009954
U2 - 10.1016/j.bspc.2025.108484
DO - 10.1016/j.bspc.2025.108484
M3 - Article
AN - SCOPUS:105014520112
SN - 1746-8094
VL - 112
JO - Biomedical Signal Processing and Control
JF - Biomedical Signal Processing and Control
M1 - 108484
ER -