Abstract
Mammography is the gold standard screening technique in breast cancer, but it has some limitations for women with dense breasts. In such cases, sonography is usually recommended as an additional imaging technique. A traditional sonogram produces a two-dimensional (2D) visualization of the breast and is highly operator dependent. Automated breast ultrasound (ABUS) has also been proposed to produce a full 3D scan of the breast automatically with reduced operator dependency, facilitating double reading and comparison with past exams. When using ABUS, lesion segmentation and tracking changes over time are challenging tasks, as the three-dimensional (3D) nature of the images makes the analysis difficult and tedious for radiologists. The goal of this work is to develop a semi-automatic framework for breast lesion segmentation in ABUS volumes which is based on the Watershed algorithm. The effect of different de-noising methods on segmentation is studied showing a significant impact (
Keywords
Introduction
Breast cancer is the most common form of cancer in the female population. As shown in the World Health Organization’s cancer report, 1 for all cancer diagnosed in women worldwide, 25.1% were from breast cancer with an estimated 14.7% of reported deaths. In the United States, it is estimated that approximately 12.4% of women could be diagnosed with breast cancer at some point during their lifetime. 2 Moreover, it has been demonstrated that breast cancer survival rate is dependent on the stage at which cancer is diagnosed. Therefore, appropriate imaging techniques are necessary to diagnose this disease at an early stage.
X-ray mammography is the gold standard imaging modality used for breast cancer screening. Although it is a fast and cost-effective way for screening large populations, it has a few limitations for imaging dense tissues as the compression of glandular breast structures during mammography can either simulate or occult lesion patterns, reducing diagnosis performance.3,4 Hence, the diagnosis of cancer using mammography for women with dense breasts becomes challenging and difficult, leading to more missed cancers 5 and less sensitivity. 6 In addition, according to the study of Boyd et al., 7 women with dense breasts are at a higher risk (4-6 times) of having a cancerous tumor compared with fatty or non-dense breasts.
A possible solution to improve cancer detection in dense breasts is personalized breast cancer screening where different imaging modalities are used depending on the individual patient risk. These modalities typically include Ultrasound, Digital Breast Tomosynthesis, and Magnetic Resonance Imaging. 8
Conventional Ultrasound or Hand-held Ultrasound (HHUS) is performed using a hand-held probe and produces two-dimensional (2D) scans. This modality does not systematically cover the whole breast and is highly operator dependent. These limitations can be addressed using ABUS, which generates three-dimensional (3D) scans of the breast and has a standardized acquisition protocol. ABUS is a recent imaging technology that is also suggested as a complementary imaging modality to mammography, especially in women with dense breasts who are at a higher risk and represent approximately 35% of the screened population. 9 With the patient lying in a supine position, ABUS follows a process in which ultrasound gel is applied to the breast surface and then a scanner with a much larger transducer compared with HHUS is placed on the breast to scan the whole breast. The transducer is in contact with the skin through a thin membrane, which adapts itself with the shape of the breast to capture a full 3D view. 10 ABUS overcomes the limitations of the HHUS in terms of obtaining a standardized full 3D volume of the breast instead of arbitrarily scanning only the suspected area in 2D. The volumes produced could be viewed in different planes (axial, sagittal, or coronal), and also three to five different scans can be acquired in different orientations to cover the whole breast. In addition, ABUS volumes can be systematically acquired by a technician, and the radiologist can later analyze them in different batches.
Previous studies on ABUS for breast cancer detection showed an increase in the diagnostic yield from 3.6 per 1000 with mammography alone to 7.2 per 1000 women screened with automated ultrasound. 11 In recent years, there has been a lot of development in ABUS technology, both clinically and also in the field of Computer Aided Diagnosis (CAD). The main focus of CAD tools developed for ABUS has been to improve reading efficiency of large breast volumes and reducing the inter- and intra-reader variability. During diagnosis, when scans over time are available (temporal studies), it is also possible to track the lesion changes and see the effect of biopsy (if performed). However, this comes at the cost of longer reading and evaluation times, given the larger amount of images to be interpreted.
In breast imaging, segmentation is widely used as an intermediate step to discriminate between normal and abnormal tissues based on certain image properties (e.g., intensity variation, texture, etc.). However, due to inherent speckle noise and low contrast of breast ultrasound images, automatic lesion segmentation is still a challenging task.
Although many CAD algorithms have been explored in 2D HHUS,12–18 only a few studies have been proposed for 3D ABUS volumes. For instance, Diaz et al.
19
extended the segmented method developed by Pons et al.
15
for 2D HHUS based on Markov Random Field-Maximum a Posteriori to 3D volumes, showing a decrease in performance from 0.75 to 0.55, in terms of Dice Similarity Coefficient (DSC). Chen et al.
20
used an active contour model to segment breast tumors in 3D images reconstructed from 2D ultrasound, while Moon et al.
21
used speckle and morphological features to classify breast masses in ABUS volumes showing a classification accuracy of 84.4%. Tao et al.
22
developed a breast lesion segmentation approach directly for ABUS using dynamic programming in combination with a spiral scanning technique (
Similar to Tao et al.,22,23 in our previous work, 19 a semi-automatic framework was presented for the applicability of different segmentation algorithms from state of the art on the lesion segmentation of ABUS volumes. The framework was semi-automatic in a way that the expert (radiologist) selects a point in the suspected region. Results suggested the use of the watershed algorithm for ABUS lesion segmentation obtained the best results with an average DSC of 0.74 on a dataset of 20 ABUS volumes (16 benign and 4 malignant lesions). In this work, we further investigate the impact of different de-noising algorithms on segmentation results, using a larger and more challenging dataset of 28 temporal pairs leading to a total dataset of 56 ABUS volumes (30 benign and 26 malignant lesions).
According to Response Evaluation Criteria in Solid Tumors (RECIST) 24 guidelines, lesion change is typically analyzed by measuring the longest lesion axis in three dimensions, and the lesion volume is estimated using this information, without accounting for more accurate volumetric calculation. In this work, we segment the lesion to estimate its volume, which is compared with ground truth (GT) volume to measure the accuracy using Pearson correlation coefficient. The lesion volume analysis has been used in other areas of medical imaging such as Multiple Sclerosis lesion in brain25–27 and lung computed tomography (CT) images. 28 It has been also shown in the literature28,29 that the tumor (lesion) volume measurements can be used to quantify the disease progression.
The main contribution of this work is a semi-automatic framework that performs lesion segmentation in 3D ABUS volumes, discusses the impact of different de-noising methods, and provides the temporal volumetric assessment of breast lesion changes using a temporal dataset. From the clinical perspective, this could be used to provide valuable information to radiologists about the changes in lesions before and after a surgery or therapy is performed. In the following sections, the details of the formulated framework are outlined, where the performance of different de-noising algorithms is evaluated with the watershed segmentation (WAT), followed by temporal analysis of histologically proven benign and malignant lesions.
Material and Method
Dataset
The datasets used in this work are collected from the high-risk ABUS screening trials at Radboud University Medical Centre, Nijmegen (Netherlands), between 2011 and 2014 using a Siemens ACUSON S2000 ABVS (Siemens Medical Solutions, Mountain View, California). This retrospective study is approved by the local institutional review board, and the requirement for informed consent is waived.
The 28 ABUS volumes in three to five different views: anterior-posterior, medial and lateral (left and right), were acquired from 15 patients (average age
Summary of the ABUS Dataset.
ABUS = automated breast ultrasound.
To evaluate the performance of the proposed framework, an experienced radiologist with more than 4 years of experience in ABUS performed localization and manual segmentation on a coronal 2D slice containing the lesion. This initial segmentation was expanded to a 3D volume using the open source Insight Segmentation and Registration Tool-Kit (ITK)-Snap 30 software by medical imaging experts. Furthermore, the 3D volumes were verified (and modified when needed) by another experienced radiologist (25 years in breast imaging) to produce the final GT volume. All the lesions were histologically proven from the biopsy and the categorization is used to evaluate the performance of the segmentation algorithm. It is to be noted that the classification of the lesion is beyond the scope of this work.
Methodology
The formulated framework for lesion segmentation is described in Figure 1. The input to the framework is the 3D ABUS volume and a seed point (in Cartesian coordinates) that corresponds to a voxel within the lesion region. In this work, the seed points are located manually by selecting one of the most inner pixels in the central slice of the lesion following an expert’s annotations. The output of the framework is the binary 3D lesion segmentation and its estimated volume. The framework is discussed in the following sub-sections.

Developed segmentation framework, the corresponding seed point is shown with an “x” marker. Analyzed de-noising methods are listed under the de-noising block. AD = anisotropic diffusion; AD-LBR = anisotropic diffusion-lattice basis reduction.
Masking
The acquired ABUS volumes are high-resolution 3D volumes, which make the segmentation process computationally expensive. Hence, all ABUS volumes are down-sampled from their original resolution
As the volume of the lesion is small compared with the entire ABUS volume, a masking step is added to speed up the processing algorithms. This is performed to include enough distance from the lesion’s border to the edge of the mask to avoid edge artefacts in the volume of interest. Assuming an elliptical shape, the central slice of the prior and current lesion showed an average major axis of

ABUS volume (0.6
De-noising
Similar to conventional HHUS, ABUS volumes suffer from low contrast and speckle noise patterns, which limit the efficacy of posterior analysis steps. Therefore, it would be advantageous to perform suitable de-noising of ABUS volumes before applying any segmentation algorithm. In this work, we investigate the impact of different de-noising techniques as referred from the state of the art, including median filtering, Gaussian filtering, Anisotropic Diffusion (AD), and Anisotropic Diffusion-Lattice Basis Reduction (AD-LBR). Because the ABUS images are in 3D, all the de-noising algorithms used are for 3D images using implementations in ITK. 31 A brief description of these methods is as follows:
In this work, the feature scale
The results for different de-noising methods are shown in Figure 3.

A qualitative example of the different image de-noising methods. (a) Original image without de-noising, (b) Median filter (r = 3 × 3 × 3 pixels), (c) Gaussian smoothing (
Segmentation
In this work, the WAT segmentation algorithm is used, which is a morphological method used extensively in the field of image processing. The underlying theory of the approach is well documented in literature,39–41 and a brief overview is provided here. The implementation of watershed includes creation of a topological map based on the pixels’ gray-level value, for example, a white pixel represents peak, black pixel represents valley, and all other intermediate pixels can be distributed accordingly. As a result, several “catchment basins” are formed corresponding to different local minima. The two adjacent basins are separated by rigid lines to form two separate regions in the image called watershed regions, as shown in Figure 4.

A schematic overview of watershed in one dimension (1D).
In this work, ITK implementation of WAT segmentation is used with the following parameters:
The final 3D segmentation corresponds to the binary region of the WAT algorithm, which contains the seed point. The code for developed framework is available online (GitHub: http://www.github.com/richa624252/Tool-For-ABUS-lesion-segmentation).
Evaluation Measures
DSC is a commonly used measure to provide information regarding the extent of overlap between two areas or volumes42,43 and is typically measured on the scale of 0 to 1 where 1 represents complete overlap with GT. In this work, DSC is measured using False Positives (FP): voxels belonging to the lesion in the segmentation but not part of lesion in GT volume; True Positive (TP): voxels corresponding to lesion in both segmentation and GT volume; and False Negatives (FN): voxels belonging to the lesion in GT but not included in the segmented volume.
The volumetric measure is another metric used to analyze the performance of the segmentation framework. During the acquisition of ABUS volumes, because a small compression is used to stabilize the breast, which is much less than X-ray mammography, it can be assumed that the lesion volumes remain fairly constant during the process. This makes the lesion volume an important parameter from the radiologist’s point of view, especially for analyzing the temporal evolution of the lesion.
The lesion volumes (in
where
The presented framework is developed using the open source ITK libraries. 31 All the image processing routines are implemented in C++, and the computations are performed on a Microsoft Windows 10 machine with an Intel Core i7-4790 processor at 3.6 GHz with 32GB RAM. All the statistical analysis is performed using one-pair sample test (t test) in Matlab 2016a (MathWorks, Massachusetts).
Results
Segmentation Results
The lesion segmentation is performed on 56 ABUS volumes of 0.6 mm3 isotropic voxel spacing using the proposed framework. As a first step, the performance of different de-noising algorithms is evaluated by comparing segmentation outputs (in terms of DSC, FP, FN). The obtained results are shown in Table 2, where μ and σ refer to the mean and standard deviation for all the volumes in the dataset, respectively.
Results Using Different De-noising Filtering with Watershed Segmentation Where μ and σ refer to the Mean and Standard Deviation, respectively.
The bold values signify the best results obtained from the different experiments. DSC = dice similarity coefficient; FP = false positives; FN = false negatives; AD = anisotropic diffusion; AD-LBR = anisotropic diffusion-lattice basis reduction.
In this table, the impact of de-noising on segmentation results is illustrated, and it is observed that AD-LBR outperforms other de-noising algorithms based on the evaluation measures used. This can be explained by the fact that AD-LBR performs smoothing over the volume considering the structure tensor, that is, location and direction of the gradient value, resulting in optimum smoothing and preserving the edge information. These edge information sources complement the WAT algorithm in segmenting 3D volumes. WAT segmentation using AD-LBR also resulted in lower FPs and FNs compared with other de-noising methods. These differences are statistically significant (
The box plot in Figure 5 shows that the DSC values obtained for the majority of cases using AD-LBR are in the range

Box plot summarizing the performance of different de-noising algorithms on lesion segmentation. DSC = dice similarity coefficient; AD = anisotropic diffusion; AD-LBR = anisotropic diffusion-lattice basis reduction.
As a qualitative result, lesion segmentation performed using the developed framework for benign and malignant ABUS cases (coronal view) are shown in Figure 6.

Lesion segmentation: (a) Benign (DSC = 0.90; FP = 0.04; FN = 0.14), (b) Malign (DSC = 0.85; FP = 0.11; FN = 0.18). DSC = dice similarity coefficient; FP = false positives; FN = false negatives.
Because all lesions are rated by an expert radiologist according to the BI-RADS
44
classification, a separate analysis for benign (30 volumes) and malignant (26 volumes) cases is also performed to see if there are any segmentation limitations according to the malignancy of the lesions in the segmentation results. For benign cases, the
Volumetric analysis of the output of the proposed segmentation framework with the

Volumetric correlation between Segmentation (
To quantify the agreement between the

Bland-Altman plot of
A volumetric temporal analysis is then performed on the temporal ABUS volumes (prior and current) to evaluate the changes in volume for both

Volumetric correlation between the
We observe here that for some lesions, the framework showed a decrease in volume in the temporal study. These cases are individually analyzed, and it is found that these are malignant cases where a biopsy was performed between the consecutive scans. Therefore, the framework is successfully able to capture the decrease in lesion volume from prior to current, but due to unavailability of the pathological biopsy sample, the volumetric differences could not be verified.
Finally, the applicability of the developed framework for the temporal analysis of breast lesion segmentation is substantiated by performing an error analysis for the developed framework. The obtained results showed that the temporal variation between the changes in GT and segmented volumes is not statistically significant (
Discussion and Conclusion
In this paper, we proposed a lesion segmentation framework for ABUS volumes for breast cancer, and provide support for the radiologist’s diagnosis in breast cancer examinations. The main contribution of this work is the development of a semi-automatic framework to segment lesion in ABUS that could also be used for temporal analysis of breast volumes. This is clearly shown by the fact that the temporal variation in the segmented volumes (
Although lesion volume is used as a measure of the accuracy of the proposed segmentation algorithms, the potential applications of knowing such 3D information of the lesion are large: volumetric analysis for accurate knowledge of temporal lesion changes, feature extraction (e.g., texture) for lesion characterization, and so forth. The effect of different de-noising algorithms on lesion segmentation for ABUS volumes has also been explored. The results showed a significant difference in the segmentation output when the WAT segmentation algorithm is used with different de-noising methods, in which AD-LBR outperformed (
The results of the volumetric analysis performed on the lesion segmentation showed a high correlation between GT and segmented volumes (
In general, the temporal lesion volume change is similar for both the GT and the segmentation. In two out of 28 volume pairs, the trend between prior and current cases for GT and segmentation is different, and it is found that the segmentation showed a decrease in the volume, whereas the GT showed an increase in the volume. This issue requires further analysis but could be attributed to specific lesion properties, which could not be captured by the used segmentation algorithm.
As the proposed tool is semi-automatic, it requires a manual selection of a voxel within the lesion (i.e., seed point) prior to the WAT segmentation. For this reason, an additional analysis is performed to check the sensitivity of the seed point selection with the proposed method (AD-LBR + WAT). Segmentation results in terms of DSC are evaluated for 15 different seed points randomly located within the lesion area, avoiding the boundaries for all the volumes. The resulting average DSC is
One of the limitations of this work is the lack of an ABUS database to test the repeatability of the proposed segmentation tool. ABUS is a relatively new technique and its clinical use is still not widely spread. Thus, there is a lack of a good public database. Furthermore, inter- and intra-reader variability is a well-known issue in medical imaging that can influence the performance evaluation of the automatic segmentation frameworks. Considering all these issues, the analysis presented here may not be fully extrapolated to clinical practice. However, this work presents an important step toward the development of an automatic tool for breast lesion quantification.
As part of future work, it would be beneficial to work with a larger and repeatable dataset to test the repeatability of the developed framework. Another advantage of having a large dataset would be the possible use of machine learning approaches on ABUS volumes. It would be also useful to make the segmentation process automatic, which could be done either by using another detection tool on the top of the proposed framework or to improve the current framework for automatic lesion detection. An application of the tool in a CAD workstation can be developed by combining the registration of the temporal volumes with the tool to perform segmentation of the lesion. The code developed for this framework is made open source for the members of the scientific community to try the developed approach on their datasets.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work is partially supported by the SMARTER project funded by the Ministry of Economy and Competitiveness of Spain, under project reference DPI2015-68442-R. O.D. is funded by the SCARtool project (H2020-MSCA-IF-2014, reference 657875), a research funded by the European Union within the Marie Sklodowska-Curie Innovative Training Networks. R.A. is funded by the support of the Secretariat of Universities and Research, Ministry of Economy and Knowledge, Government of Catalonia Ref. ECO/1794/2015 FIDGR-2016.
