DSpace Repository

Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study

Show simple item record

dc.contributor.author Nirthika, R.
dc.contributor.author Manivannan, S.
dc.contributor.author Ramanan, A.
dc.contributor.author Wang, R.
dc.date.accessioned 2023-01-31T06:38:57Z
dc.date.available 2023-01-31T06:38:57Z
dc.date.issued 2022
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/8939
dc.description.abstract Convolutional neural networks (CNN) are widely used in computer vision and medical image analysis as the state-of-the art technique. In CNN, pooling layers are included mainly for downsampling the feature maps by aggregating features from local regions. Pooling can help CNN to learn invariant features and reduce computational complexity. Although the max and the average pooling are the widely used ones, various other pooling techniques are also proposed for different purposes, which include techniques to reduce overfitting, to capture higher-order information such as correlation between features, to capture spatial or structural information, etc. As not all of these pooling techniques are well-explored for medical image analysis, this paper provides a comprehensive review of various pooling techniques proposed in the literature of computer vision and medical image analysis. In addition, an extensive set of experiments are conducted to compare a selected set of pooling techniques on two different medical image classification problems, namely HEp-2 cells and diabetic retinopathy image classification. Experiments suggest that the most appropriate pooling mechanism for a particular classification task is related to the scale of the class-specific features with respect to the image size. As this is the first work focusing on pooling techniques for the application of medical image analysis, we believe that this review and the comparative study will provide a guideline to the choice of pooling mechanisms for various medical image analysis tasks. In addition, by carefully choosing the pooling operations with the standard ResNet architecture, we show new state-of-the art results on both HEp-2 cells and diabetic retinopathy image datasets. en_US
dc.language.iso en en_US
dc.publisher Neural Computing and Applications en_US
dc.subject Medical image analysis en_US
dc.subject Pooling en_US
dc.subject Convolutional neural networks en_US
dc.subject HEp-2 cell image classification en_US
dc.subject Retinopathy image classification en_US
dc.title Pooling in convolutional neural networks for medical image analysis: a survey and an empirical study en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record