|
Artificial Neural Networks and the ![]() Abstract of the study in which this neural network was used Background: Ovarian dysplasia has been described in the ovarian surface epithelium by histologic and morphometric studies. This study evaluates ovarian dysplasia in epithelial inclusion cysts both adjacent to overt carcinoma and also incidentally found in ovaries removed for non-neoplastic diseases including oophorectomies for family history of ovarian cancer, using an artificial neural network. Methods: Histologic sections from 37 ovaries of which 26 were diagnosed with dysplasia in epithelial inclusion cysts (10 adjacent to carcinoma and 16 incidental) and 11 with benign epithelial inclusion cysts were evaluated by tracing nuclear profiles and assessing measures of nuclear area, shape and texture. These were analyzed using artificial neural networks and also statistically using the Kruskal-Wallis test with the Dunn procedure to compare the morphologic similarity of incidental and adjacent dysplasia. Results: Neither statistical nor artificial neural network analysis was able to distinguish between incidental and adjacent dysplasia. Both types differed significantly from the control cases. Conclusions: Neural networks are powerful classification tools
when applied to multiple Deligdisch L, Einstein AJ, Guera D, Gil J. Ovarian dysplasia in epithelial inclusion cysts: a morphometric approach using neural networks. Cancer 1995; 76: 1027-1034. CANCER Vol. 76, No. 6, 1995, pp. 1027-1034. Copyright © 1995 American Cancer Society. Reprinted by permission of Wiley-Liss, Inc., a subsidiary of John Wiley & Sons, Inc. This work was supported in part by a traineeship on NIH MSTP Training Grant GM 7280. An overview of neural networks:
The architecture of a typical backpropagation network for use in morphometry is illustrated in the figure below. As is shown, input neurons represent morphometric features, numerical measures of histologic features such as nuclear size and shape, chromatin appearance, and tissue architecture, while output neurons code for the diagnosis. The values taken by output variables are generally constrained to the interval [0, 1]; thus, e.g., a value near one for the "Malignant" neuron is identified with a diagnosis of malignancy.
The classificatory power of neural networks is contained in the weights and biases, known together as the connection matrix. Determination of the optimal connection matrix for a particular set of training facts is a computationally difficult problem; for even very simple networks, the problem has been mathematically demonstrated to be NP-complete (not computable in polynomial time). The approach generally used in ANNs is an adaptive one, in which the weights and biases are "learned" over repeated iterations through the training set. The connection matrix is initially set at random, and the values of hidden and output neurons are computed for a feature vector. Output neuron values are compared with their target values, e.g., (0,1) for malignancy in the example. If the difference between the output value and its target value, known as the error factor, is less than a specified training tolerance, the case represented by the parameter value is regarded to be correctly classified, and the next case is considered. If the error factor is too large, then the connection matrix is modified using some learning rule, most typically the generalized delta rule or a modification thereof. These learning rules reflect factors such as the transfer function, neuron values, error factors, weights, and biases. Initially, output biases and connection weights between output neurons and the last layer of hidden neurons are adjusted. These changes are then backpropagated, one layer at a time, until the whole connection matrix has been modified. Cycling through the training set continues until all cases are correctly classified or, barring this, some other specified stopping point. Using the final connection matrix, a new set of inputs can be used to predict output values. Numerous factors are incorporated into the design of a neural network, and these may affect its training and ability to generalize. Such factors include network topology, noise, order of training facts, choice of transfer function, initial weights, range specification, learning rate, training tolerance, testing tolerance, and the training set. Adapted from: This work was supported in part by a traineeship on NIH MSTP Training Grant GM 7280 and by a Hans Elias Bursary. |