Neural Nets

4. Neural Nets

Background and issues… Neural nets are based on the CAS of the brain; how the human mind functions. Can neural net technology improve health care decision making? Can neural net technology be used by normal clinicians and managers to improve day-to-day decision making?

Can neural net (NN) technology improve health care decision making?… The "Dooley criteria" for use of neural nets over other decision support methods... If health care decisions can be described as: (1) a finite number of categories that represent the "answer," (2) a complex set of characteristics that change across the categories, and (3) there exists a good amount of data to teach what belongs in the categories... then neural nets is a good approach. Neural nets are overkill if the pattern is linear... other traditional methods are better for linear situations. If the "answer" is a continuous variable rather than a category, non-linear statistical methods are just as good.

Suggested area where NNs might be useful: predicting readmission rates in CHF population, cardiology, cardiac surgery, OB.

We have examples of Dooley's three criteria…

Plsek post... A good illustration of Kevin's three conditions is the application of neural nets to the problem of recognizing handwriting. By analogy, we can extend this to the healthcare application of mammography reading.

(1) finite number of discrete "categories" that represent "the answer." In handwriting recognition there are only 26 letters, along with numbers and a few other special marks. Every character written has to be one of those. The question is "given the character in front of me, which one is it?"

So, in mamography reading are there a finite number of "answers" that one can give? A good indication of this would be if there exists automated dictation report generators for it. All the doc has to do is say a keyword and prewritten text appears on the report. I know that these exist for some areas of radiology and pathology.

(2) complex set of characteristics which change across categories. In handwriting, the details of just exactly how I make an "a" and how someone else makes an "a" can be amazingly complex and subtle. I may even have trouble making an "a" exactly the same twice. But, still, there is something recognizable about an "a" versus a "b"

In mamography reading, how much skill is involved in discerning the patterns that lead to a conclusion? Is it a pretty simple task; seen one seen 'em all? Or is there lots of nuance?

(3) good amount of data to teach it what belongs to each category. The neat thing about neural nets is that they are not "programmed" by an expert. They actually learn in much the same way that a child learns (or a med student!). In handwriting analysis, we can feed the neural net 1000 examples of what the letter "a" looks like when written by many different people. This is referred to as "training" the neural net.

In mamography reading, could someone assemble many different examples of each of the possible outcome-answers to be used as a training set for the neural net? An indicator of this might be to look at medical education in a given area. To what extent do we actually train young docs by basically giving them lots of examples and telling them when they get it right and when wrong?

Steve Larned post... I mentioned at our Princeton meeting that we have met with George Davis and Walter Lowell of the Augusta Mental Health Institute (AMHI), who have used ANNs to model psychiatric LOS. This problem, and many clinical problems, meet Kevin's criteria:

1. finite number of discrete categories representing the "answer" or "output" (in their example, LOS in the state hospital) 2. complex set of characteristics which change across categories (in their example diagnostic categories, medication on admission, treatment in the hospital, medication in the hospital, admitting physician, community to which pt discharged, discharge medication, family milieu therapy, etc.) 3. sufficient data (in their example, the entire database pertaining to patients admitted at AMHI in recent years).

And we also have more explanation of NN technology…

Dooley post... Let me be more specific. A NN is a classic "black box". It takes numerical inputs and turns them into a single (typically) numerical output. People are often wowed by such magic. Like other black boxes though, it suffers from two typical problems: you cannot figure out WHY it works (if it does), and if you put the "wrong" data in, you can't get a good answer out (garbage in, garbage out).

The input to the NN is called the feature vector. The choice of features that you choose is paramount--and this task is often shortchanged. You can think of the output as a diagnosis.

Before I get into mechanics, let me clarify the "numbers" issue: even though the output of a NN is a number, NN's in general are not nearly as good at predicting numerical outcomes as they are at predicting classification-type outcomes. If a NN is used for numerical prediction, one is essentially doing nonlinear regression (and one would be better off doing that explicitly).

Let me give three parallel examples:

Outputs/Diagnoses/Predictions: Case A--Does the patient have cancer or not? Case B--Should a low, medium, or high dosage of medicine be given? Case C--Is this EEG "normal"?

As I said, the output of the NN is a number, between 0 and 1. We need a simple interpretive filter that changes the number into a category. Call the number Y:

Case A--If Y<0.5, cancerous; if Y>0.5, not. Case B--If Y<0.33, low; .33<.66 medium, otherwise high Case C--If Y<0.5, normal; if Y>0.5, abnormal

This is what I meant by "finite number of discrete categories".

Now onto the feature vector. These must be numbers also. You can have a lot of features. But the more you use, the longer you have to train the NN.

For case A, what I could do is take an X-ray and represent it as a rectangular grid of pixels. Each pixel would then be assigned a "gray scale" number representing the particular shade: 0 would be completely white, and 1 would be completely black, and numbers in-between would represent different shades of grayness. A 12"x12" x-ray may have thousands of pixels. Thus, the feature vector is very long. NN tend to be very good at such "picture recognition".

For case B, I would have to use my expertise to come up with "features" that matter. Let's say I have (a) white blood cell count, (b) results from yes/no test A, (c) temperature, (d & e) blood pressure. Obviously (a), (c), and (d,e) are numbers already. They simply need to be scaled so that they range between 0 and 1 (no problem). Let (b) be 0 if the test is negative, and 1 if the test is positive. Our feature vector is size 5.

For case C, I have a time series of data. What I'll need to do is somehow characterize that into relevant features. I might take as features (a) the mean, (b) the std. deviation, (c) kurtosis, (d1-d10) the first 10 values of the auto-correlation function, (e) the value of the Lyapunov exponent, (f1-64) the values of a 64-resolution power spectrum, etc. My feature vector would be quite large.

Obviously, in each case, the features allow me to predict the outcome. When I said "wide range of features that vary across outcome", what I meant was (a) if they don't vary, then outputs cannot be associated with inputs, and (b) if their relationship is simple rather than complex, a NN is overkill.

The final step is training. I need to feed the NN several hundred cases in order for it to learn how to predict. Each case in a feature vector and the corresponding "right answer" (output). After it learns, then I can feed it a feature vector (a new case) and it will predict the answer.

Examples of Neural Net Technology Use to Improve Healthcare Decision Making… This is a topic that is already being explored in healthcare. There are several examples. Our role might be simply to review the literature here. Some examples and published papers are cited below. Could probably get more with a literature search.

As mentioned in Steve Larned's post above, The Augusta Mental Health Institute has used NNs to model psychiatric LOS.

Mammogram reading (company: Natural Selections reports that ROC area under curve was >90%; good for a diagnostic test). Reference: Fogel DB, Wasson EC, Boughton EM, Porto VW. A step toward computer-assisted mamography using evolutionary programming and neural networks. Cancer Letters 119(1997), 93-97.

Locating cardiac hypertrophy on the basis of ECG information alone. This is difficult for expert human diagnosticians to do. Reported results are an "Overall diagnostic accuracy rate of about 70%… We consider the neural network superior both to computer-based electrocardiography and to a well-trained cardiologist." Reference: Ouyang N, Yamauchi K. Using a neural network to diagnose the hypertrophic portions of hypertrophic cardiomyopathy. MD Computing 15(2), 1998, 106-109.

Determining objective diagnostic probabilities of insulin-dependent diabetes mellitus. Reported a 9% better prediction accuracy when compared to classical statistical methods (regression). Notes "…our findings corroborate other research showing that neural networks perform better than classical statistical methods." (5 references given to support this) Reference: Armoni A. Use of neural networks in medical diagnosis. MD Computing. 15(2), 1998, 100-104.

Some aspect of COPD or diabetes (company: MedAI, subsidiary of Adventist Health; demonstrated NN was better than traditional multiple regression at predicting patients at risk for hospitalization; Al Herzog and Jacquie Byers getting more info). This may be the same info as the two articles cited previously.

Outside healthcare… There is a story on CNN about a genetic algorithm based computer system at the Vovlo truck plant in Dublin, Virginia. The web version of the report is on the CNN website (where you can also download the entire 2 minute, 30 second report on VXtreme video). Here is the URL for the story: http://www.cnn.com/TECH/science/9802/26/t_t/artificial.intelligence/
(Note: if you're typing that in by hand, there is an underscore character between the two t's as in ....26/t_t/artificial.....)

Holland's new book has good material on NNs...

Potential Demonstration Project… Steve Larned at Maine Medical Center has purchased NN software called "BrainMaker"... will make available to researchers in cardiology, cardiac surgery, OB... hope to build clinical "flight simulators" to explore non-linear connections between observed clinical end-points and factors in clinical data bases. They also hope to involve Gerry O'Connor and the Northern New England Cardiovascular Study Group in this project.

Ultimate Goal of This Line of Thinking… Improved clinical diagnostic and "prediction" tools based on NN technology; leading to better care and better outcomes for patients

.

Next | Previous | Edge-TQM Contents | Applications Contents