An unsupervised semi-automated pulmonary nodule segmentation method based on enhanced region growing

He Ren; Lingxiao Zhou; Gang Liu; Xueqing Peng; Weiya Shi; Huilin Xu; Fei Shan; Lei Liu

doi:10.21037/qims.2019.12.02

Original Article

An unsupervised semi-automated pulmonary nodule segmentation method based on enhanced region growing

He Ren^1,2#, Lingxiao Zhou^1#, Gang Liu^1#, Xueqing Peng¹, Weiya Shi¹, Huilin Xu¹, Fei Shan¹, Lei Liu¹

¹Shanghai Public Health Clinical Center & Institutes of Biomedical Sciences, School of Basic Medical Sciences, School of Data Science, Fudan University, Shanghai 200032, China;²Shanghai University of Medicine & Health Sciences, Shanghai 201318China

^#These authors contributed equally to this work.

Correspondence to: Lei Liu; Shan Fei. Shanghai Public Health Clinical Center & Institutes of Biomedical Sciences, School of Basic Medical Sciences, School of Data Science, Fudan University, Shanghai 200032, China. Email: liulei_sibs@163.com; shanfei@shphc.org.cn.

Background: Nowadays, computer technology is getting popular for clinical aided diagnosis, especially in the direction of medical images. It makes physician diagnosis of lung nodules more efficient by providing them with reliable and accurate segmentation.

Methods: A region growing based semi-automated pulmonary nodule segmentation algorithm (ReGANS) was developed with three improvements: an automatic threshold calculation method, a lesion area pre-projection method, and an optimized region growing method. The algorithm can quickly and accurately segment a whole lung nodule in a set of computed tomography (CT) images based on an initial manual point.

Results: The average time taken for ReGANS to segment 1 pulmonary nodule was 0.83s, and the probability rand index (PRI), global consistency error (GCE), and variation of information (VoI) from a comparison between the algorithm and the radiologist’s 2 manual results were 0.93, 0.06, and 0.3 for the boundary range (BR), and 0.86, 0.06, 0.3 for the precise range (PR). The number of images covered by one pulmonary nodule in a CT image set was also evaluated to compare the segmentation algorithm with the radiologist's results, with an error rate of 15%. At the same time, the results were verified in multiple data sets to validate the robustness.

Conclusions: Compared with other algorithms, ReGANS can segment the lung nodule image region more quickly and more precisely. The experimental results show that ReGANS can assist medical imaging diagnosis and has good clinical application value. It also provides a faster and more convenient method for pre-data preparation of intelligent algorithms.

Keywords: Lung cancer; pulmonary nodule; segmentation; computed tomography (CT)

Submitted Jul 04, 2019. Accepted for publication Nov 27, 2019.

doi: 10.21037/qims.2019.12.02

Introduction

According to reports from the World Health Organization (WHO) and major cancer research centers, lung cancer has the world’s highest cancer mortality rate (1). A status report produced by the International Agency for Research on Cancer states that lung cancer remains the leading cause of cancer incidence and mortality, with 2.1 million new lung cancer cases and 1.8 million deaths predicted worldwide in 2018, representing close to 1 in 5 (18.4%) cancer deaths (2). However, early detection with computed tomography (CT) has been shown to help reduce lung cancer-specific mortality (3). In medicine, a lung nodule is defined as a focal opacity whose largest diameter is between 3 mm and 3 cm in length (4). Nodules with diameters less than 3 mm are known as micronodules (5).

In general, the radiologist forms a diagnosis by reading the CT image in digital imaging and communications in medicine (DICOM) format (6). The radiologist quickly searches through all the images to determine the accuracy of suspected pulmonary nodules with obvious features and then combines those images of the previous and subsequent consecutive images to analyze the size, characteristic changes, and signs of the lung nodules in the corresponding images. This kind of reading process takes quite some time and puts intense pressure on radiologists, especially when they have been working all day and must repeatedly switch between various conditions. Hence, computer technology is required to quickly and accurately segment/mark lung nodule images to assist radiologists in diagnosing diseases (7-9). This supplementary approach has been used for applications such as lung parenchyma density analysis (10,11), airway analysis (12,13), and pulmonary nodule detection (14-16). The application of computer-aided detection in pulmonary nodules segmentation with CT images and nodule type and characteristics determination would be helpful for the early detection of lung cancer and tumor diagnosis.

A general process of automatic methods for pulmonary nodule segmentation usually includes image acquisition, preprocessing, pulmonary parenchyma segmentation, focal region extraction, optimization, and feature extraction (17,18).

Many researchers have studied image segmentation (19-22). Some approaches identify lesions based on preset shape-information of the target area and use machine learning-based algorithms to detect lesion features (23). However, many segmentation algorithms require image denoising, which would severely affect the processing speed as considerable manual input is required. Also, some automatic segmentation algorithms require manual operations in preprocessing, such as range determination and threshold testing (24-26). These result in extremely long segmentation time and reduce the accuracy of results. Therefore, these segmentation algorithms have extremely limited practical value. Other studies use images of a single lesion or CT-enhanced scan image for segmentation, and thus the results are not reproducible. Furthermore, those automatic segmentation algorithms also require a large amount of segmented lesion data for prior learning, which takes a lot of pre-work, and the quality of the data used for model training directly affects the quality of the model.

In this paper, a new semi-automated algorithm of lung nodule segmentation is introduced. It links the clinical manifestation of the lesion to image features via an image processing method. The results of ReGANS were compared with other computer-based segmentation techniques and manual segmentation performed by radiologists to validate the efficiency and accuracy, evaluated by probability rand index (PRI) (27), global consistency error (GCE) (28), and variation of Information (VoI) (28). Multiple types of nodules were also randomly selected for an experiment to prove its robustness.

Methods

The study design was approved by the Shanghai University of Medicine and the Health Sciences Ethics Review Board. The need for informed consent was waived.

Dataset

Two datasets were used in this study. The first one was generated at the Shanghai Public Clinical Health Center (SPCHC) and was collected between 2016 and 2017 (Table 1). An unenhanced chest CT exam was performed for the entire lung scan of each patient with a United-Imaging 760 CT device (42–126 mA 120 kV, slice thickness of 1 mm) and a Siemens Emotion 16 CT device (34–123 mA, 130 kV, 1 mm) with a 512×512 resolution. A total of 55 sets of image data were collected with 407 CT images. Among them, 25 groups (166 CT images) were used in the analysis of the speed and accuracy of the method, and the remaining 30 groups (241 CT images) were used in further method robustness verification experiments. Two radiologists performed the diagnosis of the pulmonary nodule.

Table 1 Case information
Full table

Another dataset was taken from the The Lung Image Database Consortium Image Collection public data set (LIDC-IDRI) (https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI). A total of 30 sets of image data with 234 CT images were randomly selected.

The snake (29), special processing named selective binary Gaussian filtering regularized level set (SBGFRLS) (30), and level-set (31) methods were compared to the ReGANS. These methods were partially modified with reference to the experimental environment and data. As the number of iterations was increased further, the time taken by the algorithm increased substantially without a significant gain in optimization performance. The Snake and level-set used 250 iterations. For the SBGFRLS method, the image was first cut into a small square centered on and fully containing the lung nodule, which in turn increased the speed of the algorithm. The region-growing algorithm in the ReGANS method involved the evaluation of the 8 neighboring voxels surrounding an initial seed point. These surrounding voxels were evaluated according to their similarity to the seed voxel. They were accepted or rejected based on a combination of an artificially set change rate from the starting point, the voxel value change rate of the region, and the initial voxel values of the region (see step “Pulmonary nodule segmentation” for details).

Running environment

All the algorithms in the experiment were implemented in MATLAB 2017a on a personal computer machine with a 2.6 GHz Intel (R) Core (TM) i7-5600 processor and 8 GB of RAM.

Proposed method

The main steps of ReGANS are: (I) extraction and optimization of lung lobes; (II) determination of the extent of lesions; (III) segmentation of pulmonary nodules; (IV) and determination and continuation (Figure 1).

Figure 1 Overview of the steps in the pretreatment approach and pulmonary nodule.

Lung lobe extraction

The goal of this step was to separate the lung lobes from the CT image and link the values of the image pixels to the medical clinical depiction. The parameters used to segment the lung lobes from the image were based on both the optimal threshold algorithm and the medical clinical description.

The gray value was converted to a CT value using information in the DICOM format file, and the abnormal point in the image was removed (6). The threshold was by an iterative algorithm (32), and the lung cavity region was extracted and recorded as IM2. Then, the threshold was calculated in IM2, combined with the location of the lobe in IM2 and other features to separate the lung lobe region C1. The boundaries of C1 were optimized using image processing techniques (33), and a new image was acquired (M3).

Determination of the extent of the lesion

Locating the pulmonary nodules is the most important step for lung nodule segmentation. A fuzzy positioning method was employed to identify the nodule range.

The probability density map IM3 was created using 0.45 as the threshold to obtain the upper limit of the lung parenchymal CT values of the current group from the graph. IM3 and the upper limit were used as the thresholds to obtain the binary image of the lung lobe (BL1). Eight rays crossing the positioned coordinate points with angles of 0°, 45°, 90°, 135°, 180°, 225°, 270°, and 315° from the horizontal were extended on BL1 until they intersected the horizontal axis. The maximum and minimum lengths of the 8 rays were removed, and the average length of the other 6 rays was calculated AvgL. The “slice_Location” in the DICOM file was used to determine the spacing of each layer to calculate the number of sheets (Ser1) that the nodule may cover. A maximum intensity projection (34) was performed on Ser1 to obtain the image IM4. A square using positioned coordinate points as the center and 2 × AvgL as the side lengths was drawn, and the square region in IM4 was defined as S1. The maximum value and the minimum values of S1 were extracted and used to perform a “bilateral filter” (35) to obtain the image (IM5). An iterative threshold algorithm was used (IM5) to obtain the threshold (T2). IM5 was binarized using the T2 threshold. The “imclose” (33) operation was subsequently performed on the image, and it was filled to obtain the lung nodule range (LNR).

Pulmonary nodule segmentation

The purpose of this step was to segment the pulmonary nodule, and it was completed after image normalization, lung lobe extraction, and lesion extension. In this step, a single image was processed, and the radiologist specified a coordinate point. The subsequent images were defined by the result of the previous image (see step D for details) with the starting image being in range of the LNR.

To calculate the range of pixel variable values within the LNR, 8 rays originating from the specified coordinate points were drawn, and the pixel values were recorded. The absolute value of the gradient value between the pixels before and after was calculated and uniquely represented by Gr. The valid data were extracted and the average (AvgS) was calculated. To calculate the center point of the pixel variation range in the LNR region, all pixel values in the were sorted from large to small, and the optimal value (EP) was extracted.

EP, AvgS, and the original image were used as inputs, and a modified region-growing method was used to obtain the putative pulmonary nodule region, PN. Based on the original Region-Growing method, our method optimized the pixel growth judgment conditions so that it could be automatically performed under the influence of EP, AvgS, and initial coordinate points. This can solve the problem of the pixel growth range in the original method needing to be manually set each time which leads to inaccurate segmentation.

Further optimization is required because of the influence of factors such as blood vessel penetration in the PN region. An “imclose” operation PN was performed to obtain a series of regions. The largest region was selected and reported as LR. Then, LR was projected onto the LNR. The region was denoted as LNI, and the regions of the LNR outside of the projection were denoted as LNO. The areas of LNI and LNO were calculated, and the average pixel values of LNI and LNO were denoted as AI and AO, and PI and PO. The following were calculated: P = PI − PO; AS = AO − AI. Using AS and P as the decision conditions, the various possible situations were analyzed, and the target area (TIM) of the image was calculated.

Loop and terminate

Because the CT was based on a series of images, the nodule segmentation continued to the next image. The average CT value of the TIM was then calculated. If it was not less than −775, and the difference in variance between this image and the previous image was less than 0.002 (the variance of the 0^thTIM was defined to be same as the 1^stTIM), the current image segmentation was considered to be valid, and the next image was processed. Otherwise, the loop was terminated. The starting coordinate point in the IM_ai step was defined as the gravity center of the TIM in IM_ai−1.

Results

The performance of the newly developed ReGANS method was evaluated and compared with Snake, SBGFRLS, and level-set method, taking two types of radiologist’s segmentation results as a benchmark. The lesion boundary delineated by radiologists was defined as the boundary range (BR), while the precise range (PR) was BR without blood vessels and bronchi. The performance of the ReGANS was evaluated in aspects of accuracy, speed, and robustness. In addition, detailed information on the pulmonary nodules used in the first dataset was also recorded (Table S1, supplementary material).

Table S1 Details of each lung nodule in the first dataset
Full table

The accuracy and robustness of the ReGANS were evaluated by PRI, GCE, and VoI (19,36). Using the MATLAB runtime environment, the “compare_segmentations.m” toolkit was downloaded to perform calculations (https://www.dssz.com/778476.html).

Accuracy

The planar segmentation results of the four algorithms were compared based on the second dataset. The accuracies of these algorithms are shown in Table 2.

Table 2 Accuracy of each segmentation result
Full table

The average PRI of ReGANS, when compared to BR, was 0.93 (95% CI, 0.90–0.96), while that of the other three algorithms was lower (0.59–0.82), which indicated that ReGANS could get more accurate results close to BR. Meanwhile, the average GCE between ReGANS and BR was 0.058 (95% CI, 0.05–0.07), which suggested that the global error of ReGANS was negligible. Consistently, the average VoI between ReGANS and BR was 0.30 (95% CI, 0.26–0.34). The average GCE and VoI ofReGANS were significantly lower than those of the other three algorithms (for all comparison, P<0.0001). The average PRI, GCE, and VoI between the ReGANS and PR were 0.86 (95% CI, 0.84–0.88), 0.06 (95% CI, 0.052–0.068), and 0.30 (95% CI, 0.27–0.33), respectively. These results indicate that the segmentation performance of the ReGANS was comparable to that of PR.

The supero-inferior coverage difference between the PR and ReGANS in each sample was compared; the radiologist diagnosed 166 images, 28 of the ReGANS results were not in the radiologist's diagnosis, and two correct results were not calculated. The result showed that the error and loss rate of the range of tuberculosis detected by the ReGANS was less than 18% (Figure 2A). Most of the missing images were terminal in each sample. The CT values of the nodule range were close to the lung parenchyma, causing it to be ignored by the ReGANS (Figure 2B).

Figure 2 Algorithm detection results compared with actual nodule coverage. In (A), the blue area represents the intersection between ReGANS and the manual annotation superoinferiorly; the yellow area represents the images neglected by ReGANS; the black area represents extra images calculated by ReGANS; I indicates the nodules in this set of data that have no other tissue interference; II represents one blood vessel/bronchus passing through the lung nodules; III represents multiple blood vessels/bronchi passing through the lung nodules. (B) represents a missing image due to the CT value of the lung nodule area in the image being low. (C) represents an extremely irregular pulmonary nodule and depicts the results of the ReGANS algorithm. (D) shows a pulmonary nodule with blood vessels passing through and depicts the results of segmentation by the ReGANS method.

The performance of the ReGANS on special types of pulmonary nodules was also adequate. The ReGANS could segment lung nodules with special signs, like the vacuole sign (Figure 2C). For special lung nodules with blood vessels or bronchial passages, the ReGANS could also be effectively segmented, although the complete effective area of the pulmonary nodules is not fully depicted in Figure 2D. Collectively, the ReGANS exhibited a better accuracy than the other algorithms.

Speed

The segmentation time of each algorithm was recorded as the number of images of lung nodules in each group of data ranging from 4–22. Eight images (average) were used to calculate the overall segmentation time. The average segmentation time of a single CT image was 0.83 s (95% CI, 0.829–0.840) and segmentation of all the lung nodule images took 6.64 s (Table 3). The average segmentation times of the Snake, SBGFRLS, and level-set methods for a single image were 13.04, 2.42, and 19.02 s, respectively. The radiologists’ manual segmentation time was 60 s, and thus, a minimum of 480 s (8 min) was required to process these images.

Table 3 Time of segmentation of each method
Full table

The time required to segment the image with the ReGANS was generally consistent. Images with special types of nodules did not significantly increase the time required. As shown in Figure 2D, the time required by the ReGANS was only 0.92 s. Thus, the speed of the ReGANS was significantly higher than those of the other algorithms.

Robustness

ReGANS was also used to split 30 sets of images in the first dataset and images in the third dataset. The accuracy and speed of the ReGANS were evaluated for those two datasets, as shown in Table 4.

Table 4 Method robustness comparison with two datasets
Full table

At the same time, the serial number of the lung nodule image in a set of case images calculated by REGAN was also recorded and compared with the diagnostic results of the radiologist. In the 30 images in the first dataset, the radiologist diagnosed 241 images, 17 of the ReGANS results were not in the radiologist's diagnosis, and 12 correct results were not calculated. In the 30 images in the third dataset, the radiologist diagnosed 234 images, 14 of the ReGANS results were not in the radiologist's diagnosis, and 9 correct results were not calculated. The total error rate was 10.95%

Further inspection

To further evaluate the segmentation accuracy of the ReGANS method in other diseases of the lung. We collected some CT images of pulmonary fibrosis for segmentation experiments. The segmentation results are shown in the appendix.

Discussion

Most currently reported algorithms are not satisfactory for clinical application. First, automatic segmentation operations usually must be performed on 5–10 images, sometimes 20, and the time spent for published algorithms is several minutes (29-31). Second, other methods have many complicated pre-processing steps and cumbersome manual operations (37), including determining approximate ranges, building similar shapes, and enhancing image contrast. Moreover, in some algorithms, iterative analysis is required, and professional training is necessary. The automatic segmentation algorithms require a substantial amount of data and manual targets to establish models through algorithms. These models are affected by data volume, data classification, and artificial results. Lastly, in most cases, the reported algorithms cannot process a set of images as a single sample. In other words, the supero-inferior images of a patient cannot be processed automatically. A few researchers have developed automatic supero-inferior algorithms, but at a low speed with more than 10 min needed.

ReGANS requires only 6–8 s, even for complicated lung nodules. Only one-click operation is required for the ReGANS, and the radiologist does not need to pre-process the images and select the pulmonary nodule one image at a time. ReGANS is a very simple operation that offers a radiologist with a continuous feature image of a pulmonary nodule quickly and accurately. This can effectively help radiologists by providing them with reliable diagnostic information. To a certain extent, they are freed from a whole day of tedious work.

Segmentation algorithms based on machine learning or deep learning do have significant advantages. A mature model that completes training can also quickly and accurately segment the lesion area in practical applications. In some cases where hardware devices are better, they can even do segmentation faster than ReGANS, but, limited by the design of the model itself, the quality of the training image directly affects the capabilities of the model. With high speed and accuracy, ReGANS can provide high-quality annotation images for model training and effectively compress the pre-preparation time.

ReGANS also has its limitations. The threshold calculation process for image binarization is a bit conservative, which leads to the missing of terminal images when segmenting. The automatic boundary calculation of the lung nodule starts from an initial manual point. In some extremely complicated situations, such as multiple blood vessels passing through the lung nodules, to avoid the blockage of blood vessels and other tissues, the segmentation of the lung nodules will always leave some areas behind. In addition, determining the extent of pulmonary nodules for radiologists is only part of the diagnostic process, and subsequent analysis of signs such as lobulation, bronchus sign, and pleural indentation requires a more accurate diagnosis. This requires the subsequent segmentation method to be able to not only accurately determine the lesion area, but also to identify each sign. What is more, the segmentation results of pulmonary fibrosis also reflect the limitations of ReGANS. Although the “maximum intensity projection” method applied in ReGANS can define the lesion area of the lung nodule image well, it can also cause overlapping features of multiple lesions in some more complex images, which instead interferes with the algorithm’s determination of the suspected area of the lesion and also interferes with the “segmentation threshold calculation” and “segmentation result optimization” in the later stages of the algorithm.

An evolved method is then required to convert the image information into medical clinical description, so as to identify the lesion signs from the perspective of image semantics, and finally, provide more accurate diagnostic information for radiologists. This is also the future research direction of ReGANS.

Conclusions

The experiments showed that this semi-automatic lung nodule segmentation method can accurately and quickly segment all the lung nodule images in a set of CT data while also preserving the original characteristics of lesions. It can effectively assist the radiologist’s diagnosis and provide reliable training data for intelligent algorithms.