Daniel W McKee
Assistant Professor
Computer Information Science

Home
Office Hours
Curriculum Vitae
Courses
Publications
Astronomy
CIS Dept Web
Publications

2006

Using Computational Intelligence for Computer-Aided Diagnosis of Screen Film Mammograms. (book chapter)

SPIE MI 2006. An adaptive image segmentation process for the classification of lung biopsy images.

2005

CIT 2005. Emerging Technology in Teaching Computational Intelligence for Engineers and Computer Scientists via Distance Learning.

ASEE St. Lawrence Section Conference. Teaching Computational Intelligence for Engineers in the 21st Century via Distance Learning.

2004

SPIE MI 2004. Breast cancer classification improvements using a new kernel function with evolutionary-programming-configured Support Vector Machines.

SPIE MI 2004. New results in Computer Aided Diagnosis (CAD) of breast cancer using a recently developed SVM/GRNN Oracle hybrid.

2003

IEEE CSMC 2003. Breast Cancer Computer Aided Diagnosis (CAD) Using a Recently Developed SVM/GRNN Oracle Hybrid.

SPIE MI 2003. Improving the predictive value of mammography using a specialized evolutionary programming hybrid and fitness functions.

SPIE MI 2003. Application of Support Vector Machines to breast cancer screening using mammogram and clinical history data.

2002

ANNIE '02. Improving mammogram screening using a bank of support vector machines (SVMs).

IEEE CEC 2002. Performance tradeoff between evolutionary computation (EC)/adaptive boosting (AB) hybrid and support vector machine breast cancer classification paradigms.

2001

ISAS 2001. Performance analysis of evolutionary computation (EC)/adaptive boosting (AB) hybrids for breast cancer classification.

Master's Thesis. Boosting evolved artificial neural networks to improve breast cancer classification accuracy.

IEEE SMCia 2001. New results in breast cancer classification obtained from an evolutionary computation/adaptive boosting hybrid using mammogram and history data. Runner up for Best Paper Award.

IEEE CEC 2001. Application of evolutionary computation and neural network hybrids for breast cancer classification using mammogram and history data.

SPIE MI 2001. Application of adaptive boosting to EP-derived multi-layer feedforward neural networks (MLFNs) to improve benign/malignant breast cancer classification.

2000

SCI2000. Using evolutionary computation to develop neural network breast cancer benign/malignant classification models.

Paper Details

  • Using Computational Intelligence for Computer-Aided Diagnosis of Screen Film Mammograms. (book chapter)

    • Land, Walker H., Jr., McKee, Daniel W., Anderson, Frances R., Masters, Timothy, Lo, Joseph Y., Embrechts, Mark and Heine John. “Using Computational Intelligence for Computer-Aided Diagnosis of Screen Film Mammograms.” (book chapter) In Suri, Jasjit S. and Rangayyan, Rangaraj M., eds. Recent Advances in Mammography, Breast Imaging, and Computer-aided Diagnosis of Breast Cancer. SPIE Press, 2006.

    • This chapter describes paradigms and hybrids applied to the Computer Aided Diagnosis (CAD) of breast cancer, using mammogram screen film data. The chapter is organized into the following sections:

      • Evolutionary Programming (EP) / Evolutionary Strategies (ES) family of neural networks (NNs), derived and evaluated using statistical cross validation techniques with results measured by ROC analysis.

      • EP/ Adaptive Boosting (AB) hybrid derived family of NNs, with CAD results.

      • EP approach which evolves not only NN parameters, but also the architecture, with CAD results.

      • EP derived Support Vector Machines (SVMs), with CAD results.

      • SVM/GRNN oracle hybrid formulation, including the development of the Probabilistic Neural network (PNN) and the Generalized Regression Neural Network (GRNN), with CAD performance results.

      • Partial Least Squares (PLS) and Kernel-PLS (K-PLS), with CAD performance results.

      • Modular Artificial Intelligence (AI) system design and the use of Knowledge Engineering (KE) and Knowledge Representation (KR), with CAD performance results.
         

  • SPIE MI 2006. An adaptive image segmentation process for the classification of lung biopsy images.

    • McKee, D. W., Land, W. H., Jr., Zhukov, T., Song, D., and Qian, W. “An adaptive image segmentation process for the classification of lung biopsy images.” SPIE Medical Imaging 2006, San Diego, CA, 11-16 February, 2006.

    • Abstract - The purpose of this study was to develop a computer-based second opinion diagnostic tool that could read microscope images of lung tissue and classify the tissue sample as normal or cancerous. This problem can be broken down into three areas: segmentation, feature extraction and measurement, and classification. We introduce a kernel-based extension of fuzzy c-means to provide a coarse initial segmentation, with heuristically-based mechanisms to improve the accuracy of the segmentation. The segmented image is then processed to extract and quantify features. Finally, the measured features are used by a Support Vector Machine (SVM) to classify the tissue sample. The performance of this approach was tested using a database of 85 images collected at the Moffitt Cancer Center and Research Institute. These images represent a wide variety of normal lung tissue samples, as well as multiple types of lung cancer. When used with a subset of the data containing images from the normal and adenocarcinoma classes, we were able to correctly classify 78% of the images, with a ROC AZ of 0.758.
       

  • CIT 2005. Emerging Technology in Teaching Computational Intelligence for Engineers and Computer Scientists via Distance Learning.

    • Land, W. H., Jr., and McKee, D. W. “Emerging Technology in Teaching Computational Intelligence for Engineers and Computer Scientists via Distance Learning.” SUNY Conference on Instructional Technology, CIT 2005, Binghamton, NY, 23-26 May, 2005.

    • Abstract - The presenter shows the practical benefits of teaching computational intelligence technology by distance learning. Specifically, engineers, computer scientists, medical personnel as well as some of the Liberal Arts disciplines will be using computers driven by intelligent software as a matter of course in the 21st century. This assertion is currently supported by the fact that computer aided diagnostic (CAD) intelligent software is currently being developed as a second opinion diagnostic aid for breast and lung cancer, in addition to being used for diagnosing heart abnormalities and prostate cancer. In addition, intelligent computer detection and classification software has been and is being developed for the detection of deadly bio terrorism nerve agents, such as organophosphates. The primary objective of this paper is to describe how distance learning is currently being employed to teach Support Vector Machine (SVM) concepts as a mechanism to develop these intelligent software packages. In addition a 25 minute movie titled 'Topics in Computational Intelligence' will be shown. It demonstrates the proficiency with which graduate and upper level undergraduate students' learn this CAD technology in the distance learning environment, by reporting on research projects in which they developed a CAD tool to diagnose breast cancer from a mammogram screen film data set using SVM technology. This film was previously shown twice: first as part of an invited paper on teaching Evolutionary Computational Concepts at the 2002 International World Congress for Computational Intelligence, and secondly as a paper for the 2005 St. Lawrence conference. No additional equipment beyond a screen and a connection to a laptop computer will be required to view the film. In this film, the students demonstrate their understanding of SVM theory, its application, development and testing as applied to the development of CAD diagnostic software packages.
       

  • ASEE St. Lawrence Section Conference. Teaching Computational Intelligence for Engineers in the 21st Century via Distance Learning.

    • Land, W. H., Jr., and McKee, D. W. “Teaching Computational Intelligence for Engineers in the 21st Century via Distance Learning.” American Society for Engineering Education (ASEE) St. Lawrence Section Conference, Binghamton. NY, 8-9 April, 2005.

    • Abstract - Engineers, computer scientists, medical personnel as well as some of the Liberal Arts disciplines will be using computers driven by intelligent software as a matter of course in 2020. This assertion is currently supported by the fact that computer aided diagnostic (CAD) intelligent software is currently being developed as a second opinion diagnostic aids for breast and lung cancer in addition to being used for diagnosing heart abnormalities and prostate cancer. In addition, intelligent computer detection and classification software has been and is being developed for the detection of deadly bio terrorism nerve agents, such as organophosphates.

      The primary objective of this paper is to describe how distance learning is currently being employed to teach Support Vector Machine (SVM) concepts as a mechanism to develop these intelligent software packages. This presentation also contains a 25min movie titled "Topics in Computational Intelligence" that demonstrates the proficiency with which graduate and upper level undergraduate students' learn this CAD technology in the distance learning environment, by reporting on research projects in which they developed a CAD tool to diagnose breast cancer from a mammogram screen film data set using SVM technology. This film was first shown as part of an invited paper on teaching Evolutionary Computational Concepts at the 2002 International World Congress for Computational Intelligence. No additional equipment beyond a screen and a connection to a laptop computer will be required to view the film. In this film, the students demonstrate their understanding of SVM theory, its application, development and testing as applied to the development of CAD diagnostic software packages.
       

  • SPIE MI 2004. Breast cancer classification improvements using a new kernel function with evolutionary-programming-configured Support Vector Machines.

    • Land, W. H., Jr., McKee, D. W., Anderson, F. R., and Lo, J. Y. "Breast cancer classification improvements using a new kernel function with evolutionary-programming-configured Support Vector Machines." SPIE Medical Imaging 2004, San Diego, CA, 14-19 February, 2004.

    • Abstract - Mammography is an effective tool for the early detection of breast cancer; however, most women referred for biopsy based on mammographic findings do not, in fact, have cancer. This study is part of an ongoing effort to reduce the number of benign cases referred for biopsy by developing tools to aid physicians in classifying suspicious lesions. Specifically, this study examines the use of an Evolutionary Programming (EP)-derived Support Vector Machine (SVM) with a modified radial basis function (RBF) kernel, and compares this with results using a normal Gaussian radial basis function kernel. Results demonstrate that the modified kernel can provide moderate performance improvements; however, due to its ability to create a more complex decision surface, this kernel can easily begin to memorize the training data resulting in a loss of generalization ability. Nonetheless, these methods could reduce the number of benign cases referred for biopsy by over half, while missing less than 5% of malignancies. Future work will focus on methods to improve the EP process to preserve SVMs which generalize well.

     

  • SPIE MI 2004. New results in Computer Aided Diagnosis (CAD) of breast cancer using a recently developed SVM/GRNN Oracle hybrid.

    • Land, W. H., Jr., Wong, L, McKee, D., Masters, T., Anderson, F., Raturi, A. and Lo, J. Y. "New results in Computer Aided Diagnosis (CAD) of breast cancer using a recently developed SVM/GRNN Oracle hybrid." SPIE Medical Imaging 2004, San Diego, CA, 14-19 February, 2004.

    • Abstract - Breast cancer is second only to lung cancer as a tumor-related cause of death in women. Currently, the method of choice for the early detection of breast cancer is mammography. While sensitive to the detection of non palpable breast lesions, its positive predictive value (PPV) is low, resulting in biopsies that are only 15%-34% likely to reveal malignancy. This paper explores the use of a recently designed Support Vector Machine (SVM)/Generalized Regression Neural Network (GRNN) Oracle hybrid to classify breast lesions and evaluate the software’s performance as an interpretive aid to radiologists. The main objective of the research was to perform an independent analysis, using a new, integrated film screen mammogram data base of approximately 2500 cases from five separate institutions, to verify results obtained previously.  This study demonstrated the following:

      • The DE crossover constant has little, if any,  effect on measures of performance (MOP).

      • A specificity of approximately 5.6% is achieved at 100% sensitivity, which increases to approximately 36% at 95% sensitivity.

      • PPV increases from 51% to 56% as sensitivity is decreased from 100 to 95%, respectively.

  • Breast Cancer Computer Aided Diagnosis (CAD) Using a Recently Developed SVM/GRNN Oracle Hybrid.

    • Land, W. H., Jr., Wong, L., McKee, D. W., Masters, T., and Anderson, F. R. "Breast Cancer Computer Aided Diagnosis (CAD) Using a Recently Developed SVM/GRNN Oracle Hybrid." 2003 IEEE International Conference on Systems, Man & Cybernetics, Washington, DC, 5-8 October, 2003.

    • Abstract - Carcinoma of the breast is second only to lung cancer as a tumor-related cause of death in women. For 2003, it has been reported that 211,300 new cases and 39,800 deaths will occur just in the US. It has been proposed, however, that the mortality from breast cancer could be decreased by up to 25% if all women in appropriate age groups were screened regularly. Currently, the method of choice for the early detection of breast cancer is mammography, due to its general widespread availability, low cost, speed, and non-invasiveness. At the same time, while mammography is sensitive to the detection of breast cancer, its positive predictive value (PPV) is low, resulting in costly, invasive biopsies that are only 15%-34% likely to reveal malignancy at histological examination. This paper explores the use of a newly designed Support Vector Machine (SVM) / Generalized Regression Neural Network (GRNN) Oracle hybrid and evaluates its performance as an interpretive aid to radiologists. The authors demonstrate that this hybrid has the potential to (1) improve both the specificity and PPV of screen film mammography at 95-100% sensitivity, and (2) consistently produce partial AZ values (defined as average specificity over the top 10% of the ROC curve) of greater than 50%, using a data set of ~2000 lesions from four different hospitals. As expected, initial experiments demonstrated that combining age and mass margin (AgeMM) provided the most accurate diagnostic performance. Secondly, the value of the crossover constant, CR = 0.6, provided the best AZ, while CR =0.8 resulted in the most accurate partial AZ, Specificity, and PPV at the lower sensitivities. Finally, the results of the GRNN oracle output were essentially the same those of the SVM suggesting that the SVMs, as anticipated, had optimized the diagnostic performance. Practically, this means that at 100% sensitivity (which means no cancers lesions are misdiagnosed) and using a crossover constant of 0.8, approximately 454 biopsies would be avoided using this SVM/GRNN oracle diagnostic aid when compared to the circumstance where all 1979 samples were biopsied. (Note that the distribution of benign/malignant samples were about 50/50 for this data set.)

     
  • Improving the predictive value of mammography using a specialized evolutionary programming hybrid and fitness functions.

    • Land, W. H., McKee, D. W., Lo, J. Y., and Anderson, F. R. "Improving the predictive value of mammography using a specialized evolutionary programming hybrid and fitness functions." SPIE Medical Imaging 2003, San Diego, CA, 15-20 February, 2003.

    • Abstract - Mammography is an effective tool for the early detection of breast cancer; however, most women referred for biopsy based on mammographic findings do not, have cancer. This study is part of an ongoing effort to reduce the number of benign cases referred for biopsy by developing tools to aid physicians in classifying suspicious lesions. Specifically, this study examines the use of an Evolutionary Programming (EP)/Adaptive Boosting (AB) hybrid, specifically modified to focus on improving the performance of computer-assisted diagnostic (CAD) tools at high specificity levels (missing few or no cancers). An EP/AB hybrid developed by the authors and used in previous studies was modified with two new fitness functions: 1) a function which favored networks with high PPV values at thresholds corresponding to high sensitivities, and 2) a function which favored networks with the highest partial ROC Az (normalized area above 90% sensitivity). The modified hybrid with specialized fitness functions was evaluated using k-fold cross-validation against two real-word mammogram data sets. Results indicate that the number of benign cases referred for biopsy might be reduced by over a third, while missing no cancers. If sensitivity is allowed to decrease to 97% (missing 3% of the cancers), the number of spared biopsies could be raised to over half.
       

  • Application of Support Vector Machines to breast cancer screening using mammogram and clinical history data.

    • Land, W. H., McKee, D. W., Velazquez, R., Wong, L., Lo, J. Y., and Anderson, F. R. "Application of Support Vector Machines to breast cancer screening using mammogram and clinical history data." SPIE Medical Imaging 2003, San Diego, CA, 15-20 February, 2003.

    • Abstract - The objectives of this paper are to discuss: (1) the development and testing of a new Evolutionary Programming (EP) method to optimally configure Support Vector Machine (SVM) parameters for facilitating the diagnosis of breast cancer; (2) evaluation of EP derived learning machines when the number of BI-RADS(TM) and clinical history discriminators are reduced from 16 to 7; (3) establishing system performance for several SVM kernels in addition to the EP/Adaptive Boosting (EP/AB) hybrid using the Digital Database for Screening Mammography, University of South Florida (DDSM USF) and Duke data sets; and (4) obtaining a preliminary evaluation of the measurement of SVM learning machine inter-institutional generalization capability using BI-RADSTM data.  Measuring performance of the SVM designs and EP/AB hybrid against these objectives will provide quantitative evidence that the software packages described can generalize to larger patient data sets from different institutions.  Most iterative methods currently in use to optimize learning machine parameters are time consuming processes, which sometimes yield sub-optimal values resulting in performance degradation.  SVMs are new machine intelligence paradigms, which use the Structural Risk Minimization (SRM) concept to develop learning machines.  These learning machines can always be trained to provide global minima, given that the machine parameters are optimally computed.  In addition, several system performance studies are described which include EP derived SVM performance as a function of: (a) population and generation size as well as a method for generating initial populations and (b) iteratively derived versus EP derived learning machine parameters.  Finally, the authors describe a set of experiments providing preliminary evidence that both the EP/AB hybrid and SVM Computer Aided Diagnostic C++ software packages will work across a large population of patients, based on a data set of approximately 2,500 samples from five different institutions.
       

  • Improving mammogram screening using a bank of support vector machines (SVMs).

    • Land, W. H., Jr., McKee, D. W., Lo, J. Y., & Anderson, F. R. "Improving mammogram screening using a bank of support vector machines (SVMs)." Artificial Neural Networks in Engineering (ANNIE ’02).  St. Louis, MO, 10-13 November, 2002.

    • Abstract - The focus of this study was to build and evaluate a new bank of SVM designs to address the problem of high false positives that currently results from mammogram screening,. The basis of the design is to partition the BIRADS variables into three separate categories based on our understanding of the discriminating information contained in the mammogram BIRADSTM findings. That is, after ascertaining the presence of a suspicious finding on a mammogram that would be recommended for biopsy, the radiologist documents the BIRADS lesion descriptor values. This information, along with the clinical history, would be used as input to this new bank of SVMs an aid to the physician for improving the specificity and positive predictive value PPV of the benign/malignant diagnosis task. Comparing the new SVM mass classifier with the previously configured single SVM that used all data base inputs provided significant classification accuracy improvements for all performance measures. That is, overall Az improved by 11.6%, specificity and PPV improved by 110.6% and 31.6%, respectively, at 100% sensitivity (missing no cancers), while specificity and PPV improved by 54% and 35.9%,respectively, at 95% sensitivity (missing 5% of the cancers).
       

  • Performance tradeoff between evolutionary computation (EC)/adaptive boosting (AB) hybrid and support vector machine breast cancer classification paradigms.

    • Land, W. H., Jr., Bryden, M., Lo, J. Y., McKee, D. W., & Anderson, F. R. "Performance tradeoff between evolutionary computation (EC)/adaptive boosting (AB) hybrid and support vector machine breast cancer classification paradigms."  IEEE Congress on Evolutionary Computation, Honolulu, HI, 12-17 May, 2002.

    • Abstract - This paper describes a breast cancer classification performance trade-off analysis using two computational intelligence paradigms. The first, an Evolutionary Programming (EP) / Adaptive Boosting (AB) based hybrid, intelligently combines the outputs from an iteratively “called” weak learning algorithm (one which performs at least slightly better than random guessing) in order to “boost” the performance of an EP-derived weak learner. The second paradigm is Support Vector Machines (SVMs). SVMs are new and radically different types of classifiers and learning machines that use a hypothesis space of linear functions in a high dimensional feature space. The most important advantage of a SVM, unlike neural networks, is that SVM training always finds a global minimum. Furthermore, the SVM has inherent ability to solve pattern classification without incorporating any problem-domain knowledge. In this study, the both the EP/AB hybrid and SVM were employed as pattern classifiers, operating on mammography data used for breast cancer detection. The main focus of the study was to construct and seek the best EP/AB hybrid and SVM configurations for optimum specificity and positive predictive value at very high sensitivities. Using a mammogram database of 500 biopsy proven samples, the best performing SVM, on average, was able to achieve (under statistical 5-fold cross-validation) a specificity of 45.0% and a positive predictive value (PPV) of 50.1% at 100% sensitivity. At 97% sensitivity, a specificity of 55.8% and a PPV of 55.2% were obtained. The best performing EP/AB hybrid obtained slightly lower, but comparable, results.
       

  • Performance analysis of evolutionary computation (EC)/adaptive boosting (AB) hybrids for breast cancer classification.

    • Land, W. H., Jr., Masters, T., Lo, J. Y., McKee, D. W., & Anderson, F. R. “Performance analysis of evolutionary computation (EC)/adaptive boosting (AB) hybrids for breast cancer classification.”  6th International conference on Information Systems Analysis and Synthesis (ISAS2001), Orlando, FL, 22-25 July, 2001.

    • Abstract - A new neural network technology was developed to improve the diagnosis of breast cancer using mammogram findings. The paradigm, Adaptive Boosting (AB), focuses on finding weak learning algorithm(s) that initially need to provide slightly better than “random” performance (i.e., approximately 55%) when processing a mammogram training set. By successive development of additional architectures (using the mammogram training set), the adaptive boosting process improves performance of the basic Evolutionary Programming derived neural network architectures. The results of these several EP-derived hybrid architectures are then intelligently combined and tested using a similar validation mammogram data set. Because we were particularly interested in maximizing positive predictive value (PPV) and specificity at high sensitivity levels, the error-minimization selection method used by the EP component was replaced with a selection method favoring those solutions with the best predictive value. Preliminary results using the new fitness function were then compared with optimized results using the original fitness function. Preliminary results using the Duke University mammogram database of 500 biopsy proven samples show that this PPV-based hybrid was able to achieve (under statistical 5-fold cross validation), on average, PPV and specificity results comparable to the best results obtained using the error-minimization hybrid.
       

  • Boosting evolved artificial neural networks to improve breast cancer classification accuracy.

    • McKee, D. W. Boosting evolved artificial neural networks to improve breast cancer classification accuracy. Master's Thesis, State University of New York, Binghamton, NY, 2001.

    • Abstract - Two parallel models were developed to predict malignancy based upon mammogram data. Both used evolutionary computation to develop multi-layer feedforward networks, which were later used as the basis for a new adaptive boosting algorithm. Because good performance at very high sensitivity levels (missing few cancers) is more clinically relevant than overall performance, a model was developed specifically to enhance positive predictive value at high sensitivities. The performance of this model, with and without boosting, was compared to the performance of a more traditional error-minimization model. These results of these models were also compared to the results of previous studies that used the same data.

      This study found that boosted evolved networks can improve the predictive value of mammography by as much as 49%, and that the evolutionary and boosting processes can be modified to specifically enhance that predictive value.
       

  • New results in breast cancer classification obtained from an evolutionary computation/adaptive boosting hybrid using mammogram and history data.

    • Land, W. H., Masters, T., Lo, J. Y., McKee, D. W., & Anderson, F. R. “New results in breast cancer classification obtained from an evolutionary computation/adaptive boosting hybrid using mammogram and history data.” IEEE Mountain Workshop on Soft Computing in Industrial Applications, Blackburg, VA, 25-27 June, 2001.

    • Abstract - A new neural network technology was developed to improve the diagnosis of breast cancer using mammogram findings. The paradigm, Adaptive Boosting (AB), uses a markedly different theory in solving Computational Intelligence (CI) problems. AB, a new machine learning paradigm, focuses on finding weak learning algorithm(s) that initially need to provide slightly better than “random” performance (i.e., approximately 55%) when processing a mammogram training set. By successive development of additional architectures (using the mammogram training set), the adaptive boosting process improves performance of the basic Evolutionary Programming derived neural network architectures. The results of these several EP-derived hybrid architectures are then intelligently combined and tested using a similar validation mammogram data set. Optimization, focused on improving specificity and positive predictive value at very high sensitivities, with an analysis of the performance of the hybrid would be most meaningful. Using the DUKE mammogram database of 500 biopsy proven samples, this hybrid, on average, was able to achieve (under statistical 5-fold cross-validation) a specificity of 48.3% and a positive predictive value (PPV) of 51.8% while maintaining 100% sensitivity. At 97% sensitivity, a specificity of 56.6% and a PPV of 55.8% were obtained.
       

  • Application of evolutionary computation and neural network hybrids for breast cancer classification using mammogram and history data.

    • Land, W. H., Jr., Masters, T., Lo, J. Y., and McKee, D. W.  “Application of evolutionary computation and neural network hybrids for breast cancer classification using mammogram and history data.”  IEEE Congress on Evolutionary Computation, Seoul, Korea, 27-29 May 2001.

    • Abstract - Mammography is the modality of choice for early detection of breast cancer, primarily because of its sensitivity. However, mammography has low predictive power, resulting in the biopsy of a large number of benign lesions. This paper explores the use and evaluation of two neural network hybrids as an aid to clinicians. These hybrids have the potential for improving both the sensitivity and specificity of breast cancer screening. The first hybrid, the Generalized Regression Neural Network (GRNN) Oracle, improves the performance output of a set of learning algorithms that operate accurately over the entire (defined) learning space. The second hybrid, an Evolutionary Programming/Adaptive Boosting (EP/AB) based hybrid, intelligently combines the outputs from an iteratively “weak” learning algorithm to “boost” the performance of the weak learner. The second part of this paper discusses modifications to improve the EP/AB hybrid’s performance to show how the use of the EP/AB hybrid is superior to use of an EP only classification system for breast cancer screening.
       

  • Application of adaptive boosting to EP-derived multi-layer feedforward neural networks (MLFNs) to improve benign/malignant breast cancer classification.

    • Land, W. H., Jr., Masters, T., Lo, J. Y., and McKee, D. W.  “Application of adaptive boosting to EP-derived multi-layer feedforward neural networks (MLFNs) to improve benign/malignant breast cancer classification.”  SPIE Medical Imaging Conference, San Diego, CA, 17-23 February 2001.

    • Abstract - A new neural network technology was developed for improving the benign/malignant diagnosis of breast cancer using mammogram findings. A new paradigm, Adaptive Boosting (AB), uses a markedly different theory in solutioning Computational Intelligence (CI) problems. AB, a new machine learning paradigm, focuses on finding weak learning algorithm(s) that initially need to provide slightly better than “random” performance (i.e., approximately 55%) when processing a mammogram training set. Then, by successive development of additional architectures (using the mammogram training set), the adaptive boosting process improves the performance of the basic Evolutionary Programming derived neural network architectures. The results of these several EP-derived hybrid architectures are then intelligently combined and tested using a similar validation mammogram data set. Optimization focused on improving specificity and positive predictive value at very high sensitivities, where an analysis of the performance of the hybrid would be most meaningful. Using the DUKE mammogram database of 500 biopsy proven samples, on average this hybrid was able to achieve (under statistical 5-fold cross-validation) a specificity of 48.3% and a positive predictive value (PPV) of 51.8% while maintaining 100% sensitivity. At 97% sensitivity, a specificity of 56.6% and a PPV of 55.8% were obtained.
       

  • Using evolutionary computation to develop neural network breast cancer benign/malignant classification models.

    • Land, W. H., Jr., Masters, T., Lo, J. Y., and McKee, D.  “Using evolutionary computation to develop neural network breast cancer benign/malignant classification models.” 4th World Conference on Systemics, Cybernetics and Informatics (SCI2000), 10:343-347.

    • Abstract - An Evolutionary Programming (EP) benign/malignant breast cancer neural network classification model was developed and investigated which predicts the outcome of mammography-induced breast biopsy. By combining input variables consisting of mammography lesion descriptors and patient history data, the EP derived neural network predicted whether the lesion was benign or malignant. Computer Aided Diagnostic (CAD) tools such as this model may aid in reducing the number of unnecessary biopsies and thus the cost of mammography screening for breast cancer. The EP process was optimized using a data set of 500 biopsy-proven cases from Duke University Medical Center. Results showed that the best EP derived neural network classifier provided an ROC Az index of 0.843 ± 0.053 when averaging performance over 5-fold cross-validation statistical experiments.

 

Mansfield University
©2006 Mansfield University of Pennsylvania
Last Reviewed on  06/18/2008 .