Medicine Technology 🌱 Environment Space Energy Physics Engineering Social Science Earth Science Science
Science 2026-02-16 3 min read

CNN model detects and classifies lung nodules with 98.4% accuracy in CT scan study

A deep learning system trained and tested on the LIDC-IDRI database demonstrates high sensitivity and specificity for pulmonary nodule detection, though single-database validation limits generalizability

Lung cancer and the detection problem

Lung cancer is the leading cause of cancer-related death worldwide. Its mortality rate is high partly because most cases are diagnosed at an advanced stage, when the cancer has already spread and curative treatment is less likely to succeed. Pulmonary nodules - small masses in lung tissue detected on CT scans - can be early indicators of malignancy, but distinguishing nodules that require further investigation from the far more common benign nodules is a significant clinical challenge.

Conventional computer-aided detection systems designed to assist radiologists in this task have shown two persistent limitations: high false-positive rates that generate unnecessary follow-up procedures, and insufficient sensitivity that allows some genuine malignancies to be missed. Both problems add cost and clinical burden to an already strained radiological workflow.

Deep learning approaches, particularly convolutional neural networks (CNNs), have shown performance advantages over earlier computer-aided detection methods in multiple imaging tasks. A retrospective study published in Exploratory Research and Hypothesis in Medicine applied a CNN architecture to the task of pulmonary nodule detection and classification using the publicly available LIDC-IDRI database.

The study design and results

The analysis included 82 patients and 10,496 CT scan slices from the LIDC-IDRI database. The processing pipeline followed five sequential steps: image preprocessing, lung parenchyma segmentation using Otsu's thresholding and morphological operations, detection of nodule candidates, feature extraction, and final classification using the CNN model.

The CNN architecture consisted of two convolutional layers with 20 and 30 filters respectively, using 3x3 kernels, ReLU activation functions, max-pooling layers, and a softmax output layer. The network was trained with a mini-batch size of 32 for 50 training epochs, using a Stochastic Gradient Descent with Momentum optimizer set to a learning rate of 0.001 and momentum of 0.9.

On the LIDC-IDRI test data, the model achieved a sensitivity of 98.7%, specificity of 97.5%, precision of 97.9%, and overall accuracy of 98.4%. The model also distinguished between nodule subtypes - solid, partially frosted glass, and totally frosted glass - with satisfactory discrimination across these categories.

Comparison with recent published approaches including hybrid CNN-LSTM (long short-term memory) models and ResNet-based architectures showed the proposed method achieving competitive performance while maintaining lower computational complexity.

Important limitations

The researchers are transparent about the study's significant limitations. The entire analysis was conducted on a single database - LIDC-IDRI - which means the performance figures describe how well the model performs on data similar to its training data. Whether those performance levels would hold on CT scans from different scanner manufacturers, different imaging protocols, or different patient populations is unknown.

The sample size of 82 patients is also modest by the standards of clinical validation. Robust deployment of a nodule detection system in clinical practice would typically require validation on thousands of patients across multiple institutions before performance claims could be considered reliable.

The study's authors acknowledge both limitations explicitly: "Despite strong results, the study acknowledges limitations such as single-database validation and a relatively small training size." They outline plans to validate the model on additional databases including ELCAP and NELSON, and to optimize multi-class classification performance for improved generalizability.

Context in the broader AI diagnostic imaging landscape

AI-assisted nodule detection is an active area with multiple competing approaches at various stages of clinical validation. The performance metrics reported in this study (98.7% sensitivity, 97.5% specificity) are at the high end of what has been reported in the literature for similar tasks, which warrants careful interpretation. Published performance on curated databases consistently overestimates real-world performance, partly because databases like LIDC-IDRI were annotated by multiple expert radiologists to establish consensus labels that are cleaner than the ambiguous cases radiologists encounter in routine practice.

The computational simplicity of the proposed architecture - noted by the authors as an advantage over more complex models - could be relevant for deployment in resource-limited settings where high-performance computing infrastructure is not available. Simpler models that achieve near-equivalent accuracy to more complex ones while requiring less computational overhead may be more practical for integration into routine clinical workflows.

Source: The study was published in Exploratory Research and Hypothesis in Medicine. Full text available at: https://www.xiahepublishing.com/2472-0712/ERHM-2025-00032. The analysis was conducted on 82 patients (10,496 CT slices) from the LIDC-IDRI database; single-database validation limits generalizability.