Detection of malignant tissue in mammography image using morphology based segmentation technique

Sharma N 1, Manjhi J 2

1Neha Sharma, M Tech Biomedical, Shobhit institute of Engg. and Technology Meerut, India, 2Jayanand Manjhi, Professor and co-ordinator Biomedical Department, Shobhit university, Meerut, UP, India

Address for correspondence: Neha Sharma, Email: nehasharma29bm@gmail.com



Abstract

Breast cancer is the leading cause of the death among the women. Mammography is the best diagnostic technique for the breast cancer. But not all breast cancer can be seen by mammogram. Although breast cancer can be mortal, people have the highest chances to survive if cancer could be detected at the early stages. But there are certain limitations of the segmentation technique it is difficult to find the effected region perfectly. The proposed work deals with an approach for extracting the malignant masses in the mammography image for the detection of earlier breast cancer. The steps involve in this work are removal of noise from the background information, thresholding and retrieving the largest region of interest, performing morphological operations and extracting the ROI and identifying the malignant masses from the image. This method is compared with Enhancement, edge detection, Region Growing, Watershed Transformation techniques and found more accurate, sensitive, and precise in comparison to the others.

Keywords- Mammogram, Malignancy, Morphological, Median filter, ROI, Segmentation, Thresholding



Manuscript received: 12th February 2016, Reviewed: 22nd February 2016
Author Corrected: 3rd March 2016, Accepted for Publication: 19th March 2016

Introduction

Breast cancer is a malignant tumor that starts in the cells of breast. A malignant tumor is a group of cancer cells that can grow into surrounding tissues or spread to distant areas of the body. The disease occurs almost entirely in women. Signs of breast cancer may include a lump in the breast, a change in breast shape, dimpling of the skin, fluid coming from the nipple, or a red scaly patch of skin. In those with distant spread of the disease, there may be bone pain, swollen lymph nodes, and shortness of breath or yellow skin. Breast cancer can be separated into different types based on the way the cancer cells look under the microscope [1]. Most breast cancer are carcinomas, a type of cancer that start in the cells(epithelial cells) that line organs and tissues like the breast. In fact breast cancers are often a type of carcinoma called adenocarcinoma, which is carcinoma that starts in glandular tissue. Other types of cancers can occur in breast too, such as sarcoma which starts in the cells of muscle, fat or connective tissue. In some cases a single breast tumour can be a combination of different types or be a mixture of invasive and in situ cancer. And in some rare types of breast cancer, the cancer cells may not form a tumour at all. Breast cancer is the most common cancer among women. About 1 in 8(12%) women in the world will devlop invasive breast cancer during lifetime [2]. Mammography is one of the best and commonly used technique in the diagnosis of breast cancer. It is a specific type of imaging that uses a low- dose x-ray system to examine breasts but sometimes it is difficult for the radiologist to detect tumor or cancerous mass so CAD techniques are used for breast masses detection. To detect the malignant region the image is segmented or partioned into different region [3].

The techniques availabe for segmentation of mammography images include: Image enhancement technique, histogram equalisation technique, watershed marker based technique, region growing technique, K- means clustering technique, Edge detection, transformation techniques etc. The drawbacks of these techniques are that this generally lacks in providing robustness and imperceptibility requirement. It usually enhances the whole image in a uniform manner, which in many cases produces undesirable results. It is not possible to selectively enhance edges or other required information effectively. Gaussian filters are mostly used in the removal of white noise. So it is not possible to remove noise completely or properly [4]. Some techniques reduce the details of the image or image information. watershed method did not give that much accurate result as the region extracted from the image is the final extracted result that is not correct.

All the drawbacks of existing techniques are completely overcome with the proposed method for segmentation of mammography image i.e. morphological technique and thresholding of image. This technique not only overcomes all the limitations but also increase image quality which is measured with the help of image quality parameters.

Materials and Methods

For image segmentation the MATLAB software of 7.1 version is used. And one sample of mammography image is taken from MIAS database. The image is a true color image with size 1O24x1O24. The proposed research intends to develop an image processing algorithm to extract the malignant mass from the mammography image for the detection of breast cancer. The work proposed is based on the following procedure: (a) removing the noise with median filter and the background information. (b)Applying thresholding and retrieving the largest region of interest (ROI). (c) Performing the morphological operations and extracting the ROI and identifying the malignant mass from the screened images of the breast. This method was tested on the mammography image taken from the MIAS database and implemented using MATLAB code. The technique is compared with other existing technique i.e. enhancement technique, watershed, region-growing This technique consist of two processes:

A. Pre-processing- Pre-processing is an important issue in low-level image processing. The underlying principle of pre-processing is to enlarge the intensity difference between objects and background and to produce reliable representations of breast tissue structure [5]. The main objective of this process is to improve the quality of the image to make it ready to further processing by removing the unrelated and surplus parts in the back ground of the mammogram. Breast border extraction and pectoral muscle suppression is also a part of pre-processing. Pre- processing include following steps:

i. Initial cropping: cropping is one of the most important part of pre processing step. Imcrop command is used for cropping the image. Cropping is done on the raw mammogram image taken from the MIAS data base. In MIAS database each image is clipped or padded so that it is 1024x 1024 pixels which is the size of all raw mammographic image. Therefore the dark padding and the region outside the breast are excluded from the image. After cropping only the region of interest is kept for further processing of image.

ii. Intensity adjustment: Image enhancement techniques are used to improve an image, where "improve" is sometimes defined objectively (e.g., increase the signal-to-noise ratio), and sometimes subjectively (e.g., make certain features easier to see by modifying the colors or image's intensity values).[5]

iii. CLAHE (Contrast limited adaptive histogram equalization): CLAHE is a modification of the original histogram equalization operating on small regions in the image, called tiles, rather than the entire image. Each tile's contrast is enhanced, then the neighbouring tiles are combined using bilinear interpolation (directly implemented by MATLAB) to eliminate artificially induced boundaries.

iv. Noise reduction: After contrast enhancement noise is reduced from the image using median filter. A median filter is a nonlinear filter efficient in removing salt and pepper noise median tends to keep the sharpness of image edges while removing noise. Median filtering corresponds to non- linear smoothening which is used to reduce random noise. This smoothening filter allows edge preservation when compared to other smoothening filter. This filtering ranks the image pixels in a certain neighbourhood of certain pixel and replaces the value of this by median of neighbourhood.

B. Image segmentation- The most important stage of mass detection CAD schemes is to separate the suspicious regions that may contain masses from the background parenchyma i.e., to partition the mammogram into several non-overlapping regions, then extract regions of interests (ROIs), and locate the suspicious mass candidates from ROIs. The suspicious area is an area that is brighter than its surroundings, has almost uniform density, has a regular shape with varying size, and has fuzzy boundaries. Segmentation of masses from the rest of the image can be a complex process due to the diversity of mass characteristics from one image to the other.[5,6]

C. Remove background information- In order to remove the background information such as wedges and labels in the mammogram images, first convert the grayscale or color image to binary image by using threshold technique and morphological operations. Thresholding is the simplest method of image segmentation. From a grayscale image, thresholding can be used to create binary images. After the gray scale mammogram images are converted into binary, remove the noise or background information from the binary image by removing all the smaller objects except the largest mammography part.

D. Thresholding- Here Histogram thresholding and slicing techniques are used to segment the image. They may be applied directly to an image, but can also be combined with pre- and post-processing techniques. Thresholding is also known as binarization of image. Thresholding approaches segment scalar images by creating a binary partitioning of the image intensities.

E. Removing connected components corresponding to the pectoral muscles- This is a supplemental stage to remove the high intensity areas that are known to be non-mass regions like the pectoral muscle and any artifact labels that are attached to the borders of the image. White neighborhoods all over the image that are connected to the image's borders are removed during this stage, resulting in a darker image.

F. Elimination of noise by locating connected segment larger area- Once the mammogram image has obtained, result after the series of steps of the proposed method, images. In obtained images noise is observed. To eliminate this noise will seek the connected component with the greatest area, and the rest is eliminated, so that with this process, the resulting image is segmentation of abnormality present in mammography. With this process the smaller binary objects are removed from the mammography image leaving behind the largest part.

G. Perform subtraction- Image subtraction or pixel subtraction is a process whereby the digital numeric value of one pixel or whole image is subtracted from another image which makes the background disappear leaving only the target. It simply compares the previous frame image with the current one. Image subtraction is a tool for transient object discovery and characterization, Z = imsubtract(X, Y) subtracts each element in array Y from the corresponding element in array X and returns the difference in the corresponding element of the output array Z. X and Y are real, non sparse numeric arrays of the same size and class, or Y is a double scalar. In this paper subtraction is done on two images, the final segmented image and the gray scale image. After subtracting segmented image from the gray scale image the final image of the tumor is obtained.

H. Erosion- Erosion is one of two fundamental operations in morphological image processing from which all other morphological operations are based. It was originally defined for binary images, later being extended to grayscale images, and subsequently to complete lattices. The erosion, of f by a flat structuring element b at any location (x, y) is defined as the minimum value of the image in the region coincident with b when the origin of b is at (x, y). Erosion removes small-scale details from a binary image but simultaneously reduces the size of regions of interest, too.

Result and Discussion

figure01

figure02

Figure 3 (a) Original mammography image (b) Gray- scale image (c) Cropped image (d) intensity adjusted image (e) CLAHE image (f) median filtered image (g) entropy image (h) standard deviation image (i) local range of an image (j) binary image ( k) ROI region (l) pectoral muscle removed image (m) background subtracted image (n) final segmented image

The quality of the image is obtained by calculating image quality parameters shown in table below: The overall accuracy is the ratio between the total number of correctly classified regions and the test set size (total number of region s). Hitting denotes the ratio of correct segmentation while missing denotes the ratio of missing mass. Over hitting denotes the ratio of false mass segmented, Relative hitting denotes the relative correct ratio against segmentation result, relative missing denotes relative missing ratio against segmentation result. Sensitivity is a true positive measure that refers to the proportion of image that contains cancerous mass which have been classified correctly. Specificity is a true negative measure that refers to the proportion of image containing a cancerous mass that have been incorrectly classified [7].  

Table- 3.1 Diff Image Performance Parameters of Improved Morphological Technique

Accuracy

Sensitivity

Precision

specificity

Missing

Over hitting

Relative missing

Kapps

77%

89%

82%

29%

10%

19%

9%

85%


When the values of improved morphological technique is compared with the values obtained from other technique i.e. Enhancement, edge detection, Region Growing, Watershed Transformation techniques it is concluded that this technique is better than other technique and removes all the drawbacks and limitation of existing technique. This technique provide robustness and with this technique it is possible to enhance edges and the technique is efficient for noisy images.

Conclusion and future work: Identifying breast cancer is a challenging problem in medical image processing and medical fields. Digital mammogram on a particular segmentation algorithm makes it difficult to identify the breast cancer accurately. The acquisition parameters also influence the quality of the image. Mammography image segmentation using this technique is efficient in getting the malignant breast region and extracting the malignant mass. For future work it may planned to develop an algorithm to acquire a smoother breast region for image processing, and it can also be used to study the behavior of the breast, to improve the edge detection, to classify between breast regions and masses, and the algorithm can also detect abnormalities in segmented breast and which can produce more accurate results by increasing image quality parameters than proposed method.

Appendices

Formulae:

Accuracy =(True Positive Region+True Negative Regions)/(False Positive+true positive+true negative+False Negative )................ i

sensitivity =(True Positive Region)/(True Positive Regions+False Negative Regions )..............................ii
specificity =(True negative Region)/(True negative Regions+False positive Regions ).............................iii

Precision =(True positive Region)/(True positive Regions+False positive Regions ).................................iv

Missing =(false negative Region)/(True positive Regions+False negative Regions )....................................v

over-hitting =(false positive Region)/(True positive Regions+False negative Regions ).......................vi

Relative missing =(false negative Region)/(True positive Regions+False positive Regions )................vii

Kapps=(2*hitting)/(2*hitting+missing+over hitting )........................................................viii

T.P- True positive region is the region of image that contains cancerous mass.

T.N- True negative region is the region that does not contain malignant mass.

F.P- False positive region refers to portion of image incorrectly classified as cancerous mass.

F.N- False negative region refers to the portion of image incorrectly classified as not a portion of cancerous mass.

Acknowledgement

The work presented in this thesis could not have been done without the help of all of whom I want to thank. First, It gives me immense pleasure to express my deepest gratitude and indebtedness to Dr Jayanand, my thesis supervisor, Associate Professor, and Coordinator Centre for Biomedical Engineering., Shobhit University, Meerut, for his patient guidance and support under whose inspiration, encouragement and guidance I have successfully completed this thesis work. I am truly very fortunate to have the opportunity to work with him. I found his guidance to be extremely valuable. My heartiest thanks goes to my co-supervisor Mrs. Shiva faculty, Assistant professor During my project, I was immensely helped by her and I am thankful to her for all the help she provided. I submit my gratitude to my parents for without their support nothing could be done.

Funding: Nil, Conflict of interest: None initiated.
Permission from IRB: Yes

References


1. Guyton, A.C., & Hall, J.E.(2000), text book of medical physiology(10th ed).

2. American Cancer Society. Breast Cancer Facts & Figs. American Cancer Society, Inc, 2013. Available online at: http://www.cancer.org/acs/groups/content/@epidemiologysurveilance/documents/document/acspc-036845.pdf. cited on 23th March 2016.

3. Bronzino, J. The biomedical engineering handbook (2nd ed., vol.1). US. 2000.CRC press.

4. H.P. Chan, B. Sahiner, K.L. Lam. Computerized analysis of mammographic microcalcifications in morphological and texture feature spaces. Med. Phys. vol -25, no. 10, 2007-2019.


5. N. Petrick, H. P. Chan, B. and D. Wei. An adaptive density weighted contrast enhancement filter for mammographic breast mass detection. IEEE Trans. Med. Image, vol. 15, pp. 59-67, 2000.


6. A. Sahakyan, H. Sarukhanyan. Segmentation of the breast region in digital mammograms and detection of masses”, International journal of advanced computer science and applications, pp.102, vol- 3, no. 2, 2012.


7. Warfield, S. K. Zou and K. H.Wells. Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation. IEEE Transactions on Medical Imaging.vol.23, no.7, July2004.


8. Yasmin, M. Sharif, and S. Mohsin. Survey Paper on Diagnosis of Breast Cancer Using Image Processing Technique. Research Journal Of Recent Sciences, vol. 2, no. 10, pp. 88-98, October 2013.


9. I.K. Maitra, S. Nag and S.K. Bandhopadhyay. Identification of abnormal masses in digital mammography images. International Journal Of Computer Graphics, vol. 2, no. 1, 2011.




How to cite this article?

Sharma N, Manjhi J, Detection of malignant tissue in mammography image using morphology based segmentation technique: Int J Med Res Rev 2016;4(4):635-640.doi: 10.17511/ijmrr.2016.i04.27.