-
2006-05-09
10/774,024
2004-02-06
US 7,043,401 B2
2006-05-09
-
-
Michael Nghiem | Tung Lau
2024-04-04
A process involves collecting data relating to a particular condition and parsing the data from an original set of variables into subsets. For each subset defined, Mahalanobis distances are computed for known normal and abnormal values and the square root of these Mahalanobis distances is computed. A multiple Mahalanobis distance is calculated based upon the square root of Mahalanobis distances. Signal to noise ratios are obtained for each run of an orthogonal array in order to identify important subsets. This process has applications in identifying important variables or combinations thereof from a large number of potential contributors to a condition. The multidimensional system is robust and performs predictive data analysis well even when there are incidences of multi-collinearity and variables with zero standard deviations in reference group or unit space. Separate methods are provided: adjoint matrix Gram-Schmidt's method for multi-collinearity problems, and modified Gram-Schmidt method for the cases where there are variables with zero standard deviation to achieve data analysis.
Get notified when new applications in this technology area are published.
H03F1/26 IPC
Details of amplifiers with only discharge tubes, only semiconductor devices or only unspecified devices as amplifying elements Modifications of amplifiers to reduce influence of noise generated by amplifying elements
This application is a continuation-in-part of U.S. patent application Ser. No. 10/293,092 filed Nov. 13, 2002, which claims priority of U.S. Provisional Patent Application Ser. No. 60/338,574 filed Nov. 13, 2001. These applications are incorporated herein by reference.
Design of a good information system based on several characteristics is an important requirement for successfully carrying out any decision-making activity. In many cases though a significant amount of information is available, we fail to use such information in a meaningful way. As we require high quality products in day-to-day life, it is also required to have high quality information systems to make robust decisions or predictions. To produce high quality products, it is well established that the variability in the processes must be reduced first. Variability can be accurately measured and reduced only if we have a suitable measurement system with appropriate measures. Similarly, in the design of information systems, it is essential to develop a measurement scale and use appropriate measures to make accurate predictions or decisions.
Usually, information systems deal with multidimensional characteristics. A multidimensional system could be an inspection system, a medical diagnosis system, a sensor system, a face/voice recognition system (any pattern recognition system), credit card/loan approval system, a weather forecasting system or a university admission system. As we encounter these multidimensional systems in day-to-day life, it is important to have a measurement scale by which degree of abnormality (severity) can be measured to take appropriate decisions. In the case of medical diagnosis, the degree of abnormality refers to the severity of diseases and in the case of credit card/loan approval system it refers to the ability to pay back the balance/loan. If we have a measurement scale based on the characteristics of multidimensional systems, it greatly enhances the decision maker's ability to take judicious decisions. While developing a multidimensional measurement scale, it is essential to keep in mind the following criteria: 1) having a base or reference point to the scale, 2) validation of the scale, and 3) selection of useful subset of variables with suitable measures for future use.
There are several multivariate methods. These methods are being used in multidimensional applications, but still there are incidences of false alarms in applications like weather forecasting, airbag sensor operation, and medical diagnosis. These problems could be because of not having an adequate measurement system with suitable measures to determine or predict the degree of severity accurately.
A process for multivariate data analysis includes the steps of using an adjoint matrix to compute a new distance for a data set in a Mahalanobis space. The relation of a datum relative to the Mahalanobis space is then determined.
A medical diagnosis process includes defining a set of variables relating to a patient condition and collecting a data set of the set of variables for a normal group. Standardized values of the set of variables of the normal group are then computed and used to construct a Mahalanobis space. A distance for an abnormal value outside the Mahalanobis space is then computed. Important variables from the set of variables are identified based on orthogonal arrays and signal to noise ratios. Subsequent monitoring of conditions occurs based upon the important variables.
FIG. 1 is a schematic illustrating a multi-dimensional diagnosis system of the present invention;
FIG. 2 is a graphical representation of a voice recognition pattern according to the present invention parsed into the letter k subsets that correspond to k patterns numbered from 1,2, . . . k where each pattern starts at a low value, reaches a maximum and then again returns to the low value;
FIG. 3 is a graphical representation of MDAs values for normal and abnormal values for nine separate data points;
FIG. 4 is a graphical representation of MDA values for normal versus abnormal values with important variable usage, for the data of FIG. 3;
FIG. 5 is a graphical representation of Gram-Schmidt predicted values as a function of variable number compared with assigned values for a seventeen variable test set; and
FIG. 6 is a graphical representation of Gram-Schmidt predicted values as a function of variable number compared with assigned values for a nineteen variable test set including two variables with zero standard deviation.
The inventive method helps develop multidimensional measurement scale by integrating mathematical and statistical concepts such as Mahalanobis distance and Gram-Schmidt's orthogonalization method, with the principles of quality engineering or Taguchi Methods.
The selection of unit group (Mahalanobis group) is the most important aspect of MTS and its related methods. Every individual observation in this group has a unique pattern. Since the conditions of the observations are measured from this group, it is desirable that observations within this group be as uniform as possible. From this group, the distances (of observations outside of this group) are measured to perform the diagnosis. These distances, which are similar to the Mahalanobis distance, indicate the degree of severities of individual observations. A group of observations is needed (as in the case of the reference group) to measure distances because with one observation a correlation structure cannot be obtained. It should be noted that the correlation matrix corresponding to this reference group is also used to measure distances outside of this group. In MTS, S/N ratios are calculated based on the observations that are outside of the unit space.
In MTS and its related methods, the diagnosis is performed after validating the scale with variables defining the multidimensional system. The validation is done with observations outside of unit group by computing S/N ratios. S/N ratio is the measure of correlation between βinput signalβ and βoutputβ of the system. If there is a good correlation (higher S/N ratio), then the scale is useful for diagnosis.
One of the main objectives of the present invention is to introduce a scale based on all input characteristics to measure the degree of abnormality. In the case of medical diagnosis, for example, the aim is to measure the degree of severity of each disease based on this scale. To construct such a scale, Mahalanobis distance (MD) is used. MD is a squared distance (also denoted as D2) and is calculated for jth observation, in a sample of size n with k variables, by using the following formula:
MDj=Dj2=(1/k)ZijCβ1Zβ²ijββ(1)
Where, j=1 to n
There is also an alternate way to compute MD values using Gram-Schmidt's orthogonalization process. It can be seen that MD in Equation (1) is obtained by scaling, that is by dividing with k, the original Mahalanobis distance. MD can be considered as the mean square deviation (MSD) in multidimensional spaces. The present invention focuses on constructing a normal group, or in the application of medical diagnosis a healthy group, from a data population, called Mahalanobis Space (MS). Defining the normal group or MS is the choice of a specialist conducting the data analysis. In case of medical diagnosis, the MS is constructed only for the people who are healthy and in case of manufacturing inspection system, the MS is constructed for high quality products. Thus, MS is a database for the normal group consisting of the following quantities:
Since MD values are used to define the normal group, this group is designated as the Mahalanobis Space. It can be easily shown, with standardized values, that MS has zero point as the mean vector and the average MD as unity. Because the average MD of MS is unity, MS is also called as the unit space. The zero point and the unit distance are used as reference point for the scale of normalcy relating to inclusion of a subject within MS. This scale is often operative in identifying the conditions outside the Mahalanobis Space. In order to validate the accuracy of the scale, different kinds of known conditions outside MS are used. If the scale is good, these conditions should have MDs that match with decision maker's judgment. In this application, the conditions outside MS are not considered as a separate group (population) because the occurrence of these conditions are unique, for example a patient may be abnormal because of high blood pressure or because of high sugar content. Because of this reason, the same correlation matrix of the MS is used to compute the MD values of each abnormal. MD of an abnormal point is the distance of that point from the center point of MS.
In the next phase of the invention, orthogonal arrays (OAs) and signal-to-noise (S/N) ratios are used to choose the relevant variables. There are different kinds of S/N ratios depending on the prior knowledge about the severity of the abnormals.
A typical multidimensional system used in the present invention is as shown in FIG. 1, where X1,X2, . . . ,Xn correspond to the variables that provide a set of information to make a decision. Using these variables, MS is constructed for the healthy or normal group, which becomes the reference point for the measurement scale. After constructing the MS, the measurement scale is validated by considering the conditions outside MS. These outside conditions are typically checked with the given input signals and in the presence of noise factors (if any). If the noise factors are present, a correct decision has to be made about the state of the system. In the context of multivariate diagnosis system, it would be appropriate to consider two types of noise conditions. They are 1) active noise and 2) criminal noise. Example for active noise condition is change in usage environment such as conditions in different manufacturing environments or different hospitals and the example for criminal noise conditions are unexpected conditions such as terrorist attacks on 11 Sep. 2001 in which the system is operating. It is important to design multivariate information systems considering these two types of noise conditions. In FIG. 1, the input signal is the true value of the state of the system, if known. The output (MD) should have a good correlation with the true state of the system (input signal). In most applications, it is not easy to obtain the true states of the system. In such cases, the working averages of the different classes, where the classes correspond to the different degrees of severity, can be considered as the input signals.
After validating the measurement scale, OAs and S/N ratios are used to identify the variables of importance. OAs are used to minimize the number of variable combinations to be tested. The variables are allocated to the columns of the array. In MTS analysis only two level OAs are used as there are only two levels for the variablesβpresence and absence. To identify the variables of importance, S/N ratios are used.
The inventive process can illustratively be applied to a multidimensional system in four stages. The steps in each exemplary stage are listed below:
Stage I: Construction of a Measurement Scale with Mahalanobis Space (Unit Space) as the Reference
Monitor the conditions using the scale, which is developed with the help of the useful set of variables. Based on the values of MDs, appropriate corrective actions can be taken. The decision to take the necessary actions depends on the value of the threshold.
In case of medical diagnosis application, above steps have to be performed for each kind of disease in the subsequent phases of diagnosis. It is appreciated that many additional applications for the present invention exist as illustratively recited in βThe Mahalanobis Taguchi StrategyβA Pattern Technology Systemβ by G. Taguchi and R. Jugulum, John-Wiley, 2002 and in βThe Mahalanobis Taguchi Systemβ by G. Taguchi et al., McGraw-Hill, 2001.
According to the present invention, an adjoint matrix method is used to calculate MD values.
If A is a square matrix, the inverse can be computed for square matrices only, then its inverse Aβ1 is given as:
Aβ1=(1/det. A) Aadjββ(2)
Where,
Aadj is called adjoint matrix of A. Adjoint matrix is transpose of cofactor matrix, which is obtained by cofactors of all the elements of matrix A, det. A is called determinant of the matrix A. The determinant is a characteristic number (scalar) associated with a square matrix. A matrix is said to be singular if its determinant is zero.
As mentioned before, the determinant is a characteristic number associated with a square matrix. The importance of determinant can be realized when solving a system of linear equations using matrix algebra. The solution to the system of equations contains inverse matrix term, which is obtained by dividing the adjoint matrix by determinant. If the determinant is zero then, the solution does not exist.
Considering a 2Γ2 matrix as shown below:
A
=
[
a
11
a
12
a
21
a
22
]
The determinant of this matrix is a11 a22 βa12 a21.
Considering a 3Γ3 matrix as shown below:
A
=
[
a
11
a
12
a
13
a
21
a
22
a
23
a
31
a
32
a
33
]
The determinant of A can be calculated as:
det. A=a11A11+a12A12+a13A13
Where,
A11=(a22a33βa23a32); A12=β(a21a33βa23a31); A13=(a21a32βa22a31) are called as cofactors of the elements a11,a12, and a13 of matrix A respectively. Along a row or a column, the cofactors will have alternate plus and minus sign with the first cofactor having a positive sign.
The above equation is obtained by using the elements of the first row and the sub matrices obtained by deleting the rows and columns passing through these elements. The same value of determinant can be obtained by using other rows or any column of the matrix. In general, the determinant of a nΓn square matrix can be written as:
det. A=ai1Ai1+ai2Ai2+ . . . +ainAin along any row index i, where, i=1,2, . . . , n or
det. A=a1jA1j+a2jA2j+ . . . +anjAnj along any column index j, where, j=1,2, . . . ,n
Cofactor
From the above discussion, it is clear that the cofactor of Aij of an element aij is the factor remaining after the element aij is factored out. The method of computing the co-factors is explained above for a 3Γ3 matrix. Along a row or a column the cofactors will have alternate signs of positive and negative with the first cofactor having a positive sign.
Adjoint Matrix of a Square Matrix
The adjoint of a square matrix A is obtained by replacing each element of A with its own cofactor and transposing the result.
Considering a 3Γ3 matrix as shown below:
A
=
[
a
11
a
12
a
13
a
21
a
22
a
23
a
31
a
32
a
33
]
The cofactor matrix containing cofactors (Aijs) of the elements of the above matrix can be written as:
A
=
[
a
11
a
12
a
13
a
21
a
22
a
23
a
31
a
32
a
33
]
The adjoint of the matrix A, which is obtained by transposing the cofactor matrix, can be written as:
Adj
.
β’
A
=
[
a
11
a
12
a
13
a
21
a
22
a
23
a
31
a
32
a
33
]
Inverse Matrix
The inverse of matrix A (denoted as Aβ1) can be obtained by dividing the elements of its adjoint by the determinant.
Singular and Non-Singular Matrices
If the determinant of a square matrix is zero then, it is called a singular matrix. Otherwise, the matrix is known as non-singular.
The present invention is applied to solve a number of longstanding data analysis problems. These are exemplified as follows.
Multi-collinearity Problems
Multi-collinearity problems arise out of strong correlations. When there are strong correlations, the determinant of correlation matrix tends to become zero thereby making the matrix singular. In such cases, the inverse matrix will be inaccurate or cannot be computed (because determinant term is in the denominator of Equation (2)). As a result, scaled MDs will also be inaccurate or cannot be computed. Such problems can be avoided if we use a matrix form, which is not affected by determinant term. From Equation (2), it is clear that adjoint matrix satisfies this requirement.
MD values in MTS method are computed by using inverse of the correlation matrix (Cβ1, where C is correlation matrix). In the present invention, the adjoint matrix is used to calculate the distances. If MDA denotes the distances obtained from adjoint matrix method, then equation for MDA can be written as:
MDAj=(1/k)Zij Cadj Zijβ²ββ(3)
Where, j=1 to n
The relationship between the conventional MD and the MDAs in Equation (3) can be written as:
MDj=(1/det.C)MDAjββ(4)
Thus, an MDA value is similar to a MD value with different properties, that is, the average MDA is not unity. Like in the case of MD values, MDA values represent the distances from the normal group and can be used to measure the degree of abnormalities. In adjoint matrix method also, the Mahalanobis space contains means, standard deviations and correlation structure of the normal or healthy group. Here, the Mahalanobis space cannot be called as unit space since the average of MDAs is not unity.
Ξ²-adjustment Method
The present invention has applications in multivariate analysis in the presence of small correlation coefficients in correlation matrix. When there are small correlation coefficients, the adjustment factor Ξ² is calculated as follows.
Ξ²
=
0
β’
β’
if
β’
β’
r
β€
1
/
β
n
Ξ²
=
1
-
1
n
-
1
β’
(
1
r
2
-
1
)
β’
β’
if
β’
β’
r
>
1
/
β
n
(
5
)
where r is correlation coefficient and n is sample size.
After computing Ξ², the elements of the correlation matrix are adjusted by multiplying them with Ξ². This adjusted matrix is used to carry out MTS analysis or analysis with adjoint matrix.
To explain the applicability of Ξ²-adjustment method, Dr. Kanetaka's data on liver disease testing is used. The data contains observations of healthy group as well as of the conditions outside Mahalanobis space (MS). The healthy group (MS) is constructed based on observations on 200 people, who do not have any health problems. There are 17 abnormal conditions. This example is chosen since the correlation matrix in this case contains a few small correlation coefficients. The corresponding Ξ²-adjusted correlation matrix (using Equation (5)) is as shown in Table 1.
| TABLE 1 |
| Ξ²-adjusted correlation matrix |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | |
| X1 | 1.000 | β0.281 | β0.261 | β0.392 | β0.199 | 0.052 | 0.000 | 0.185 | 0.277 |
| X2 | β0.281 | 1.000 | 0.055 | 0.406 | 0.687 | 0.271 | 0.368 | β0.061 | 0.000 |
| X3 | β0.261 | 0.055 | 1.000 | 0.417 | 0.178 | 0.024 | 0.103 | 0.002 | 0.000 |
| X4 | β0.392 | 0.406 | 0.417 | 1.000 | 0.301 | 0.000 | 0.000 | 0.000 | β0.059 |
| X5 | β0.199 | 0.687 | 0.178 | 0.301 | 1.000 | 0.332 | 0.374 | 0.000 | 0.000 |
| X6 | 0.052 | 0.271 | 0.024 | 0.000 | 0.332 | 1.000 | 0.788 | 0.301 | 0.149 |
| X7 | 0.000 | 0.368 | 0.103 | 0.000 | 0.374 | 0.788 | 1.000 | 0.109 | 0.000 |
| X8 | 0.185 | β0.061 | 0.002 | 0.000 | 0.000 | 0.301 | 0.109 | 1.000 | 0.208 |
| X9 | 0.277 | 0.000 | 0.000 | β0.059 | 0.000 | 0.149 | 0.000 | 0.208 | 1.000 |
| X10 | β0.056 | 0.643 | 0.149 | 0.252 | 0.572 | 0.544 | 0.562 | 0.090 | 0.000 |
| X11 | β0.067 | 0.384 | 0.155 | 0.197 | 0.419 | 0.528 | 0.500 | 0.206 | 0.113 |
| X12 | 0.247 | β0.217 | 0.000 | β0.100 | 0.000 | 0.115 | 0.097 | 0.231 | 0.143 |
| X13 | 0.099 | 0.252 | 0.127 | 0.050 | 0.355 | 0.305 | 0.362 | 0.054 | 0.080 |
| X14 | 0.267 | β0.201 | 0.014 | β0.099 | 0.000 | 0.139 | 0.115 | 0.238 | 0.139 |
| X15 | β0.276 | 0.885 | 0.117 | 0.353 | 0.640 | 0.307 | 0.387 | 0.000 | β0.007 |
| X16 | 0.000 | 0.236 | β0.078 | 0.036 | 0.099 | 0.154 | 0.064 | 0.043 | β0.044 |
| X17 | β0.265 | 0.796 | 0.173 | 0.403 | 0.671 | 0.347 | 0.425 | 0.000 | 0.000 |
| X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | ||
| X1 | β0.056 | β0.067 | 0.247 | 0.099 | 0.267 | β0.276 | 0.000 | β0.265 | |
| X2 | 0.643 | 0.384 | β0.217 | 0.252 | β0.201 | 0.885 | 0.236 | 0.796 | |
| X3 | 0.149 | 0.155 | 0.000 | 0.127 | 0.014 | 0.117 | β0.078 | 0.173 | |
| X4 | 0.252 | 0.197 | β0.100 | 0.050 | β0.099 | 0.353 | 0.036 | 0.403 | |
| X5 | 0.572 | 0.419 | 0.000 | 0.355 | 0.000 | 0.640 | 0.099 | 0.671 | |
| X6 | 0.544 | 0.528 | 0.115 | 0.305 | 0.139 | 0.307 | 0.154 | 0.347 | |
| X7 | 0.562 | 0.500 | 0.097 | 0.362 | 0.115 | 0.387 | 0.064 | 0.425 | |
| X8 | 0.090 | 0.206 | 0.231 | 0.054 | 0.238 | 0.000 | 0.043 | 0.000 | |
| X9 | 0.000 | 0.113 | 0.143 | 0.080 | 0.139 | β0.007 | β0.044 | 0.000 | |
| X10 | 1.000 | 0.679 | 0.000 | 0.427 | 0.016 | 0.607 | 0.103 | 0.645 | |
| X11 | 0.679 | 1.000 | 0.128 | 0.329 | 0.120 | 0.436 | 0.000 | 0.457 | |
| X12 | 0.000 | 0.128 | 1.000 | 0.296 | 0.966 | β0.105 | 0.000 | 0.000 | |
| X13 | 0.427 | 0.329 | 0.296 | 1.000 | 0.304 | 0.249 | 0.000 | 0.339 | |
| X14 | 0.016 | 0.120 | 0.966 | 0.304 | 1.000 | β0.077 | 0.000 | 0.000 | |
| X15 | 0.607 | 0.436 | β0.105 | 0.249 | β0.077 | 1.000 | 0.262 | 0.768 | |
| X16 | 0.103 | 0.000 | 0.000 | 0.000 | 0.000 | 0.262 | 1.000 | 0.149 | |
| X17 | 0.645 | 0.457 | 0.000 | 0.339 | 0.000 | 0.768 | 0.149 | 1.000 | |
With this matrix, MTS analysis is carried out with dynamic S/N ratio analysis and as a result the following useful variable combination was obtained: X4-X5- X7-X10-X12-X13-X14-X15-X16-X17 This combination is very similar to the useful variable set obtained without Ξ²-adjustment; the only difference is presence of variables X7 and X16.
With this useful variable set, S/N ratio analysis is carried out to measure improvement in overall system performance. From the Table 2, which shows system performance in the form of S/N ratios, it is clear that there is a gain of 0.91 dB units if useful variables are used instead of entire set of variables.
| TABLE 2 |
| S/N Ratio Analysis (Ξ²-adjustment method) |
| S/N ratio-optimal system | 43.81 dB | |
| S/N ratio-original system | 42.90 dB | |
| Gain | β0.91 dB | |
In an alternate embodiment of the present invention, a Mahalanobis distance is computed using a Gram-Schmidt orthogonalization process (GSP). GSP is often a more robust and sample size insensitive orthogonalization process. Like in MTS, using the inventive MTGS method, the coefficients of orthogonal expansion of unit group are also used to predict the conditions outside this group. The usefulness of this space is tested with signal to noise ratios, like control factors are tested in hardware design. According to the Gram-Schmidt process, original variables are converted to orthogonal and independent variables. The Gram-Schmidt orthogonalization process is particularly well suited to identify the direction of abnormals. While measuring the degree of abnormality of a given value, a longer distance corresponds to higher degree of severity. In some instances, such as stock performance or financial market predictions, longer distance can represent favorable situations if the normal space is constructed based on companies with average performance. In such an instance, both underperforming and outperforming companies will have longer distances. Distinguishment of these diametrically abnormal situations is preferably performed with the Gram-Schmidt orthogonalization process (GSP).
The GSP operates on a set of given linearly independent vectors Z1, Z2, . . . Zk, to determine a corresponding set of mutually perpendicular vectors U1, U2, . . . Uk with the same linear span as shown in Equation (6).
The Gram-Schmidt's vectors are constructed sequentially by setting up Equations (7).
U1=Z.
U2=Z2β((Zβ²2U1)/(Uβ²1U1))U1
Uk=Zkβ((Zβ²kU1)/(Uβ²1U1))U1β. . . β((Zβ²kUkβ1)/(Uβ²kβ1Ukβ1)Ukβ1ββ(7)
Where, β² denotes a vector transpose. While calculating MD using GSP, standardized values of the variables are used. Therefore, in the above set of Equations (7), Z1, Z2, . . . Zk correspond to standardized values.
Calculation of MD Using Gram-Schmidt Process (GSP)
Beginning with a sample of size n, where each sample contains observations on k variables. After standardizing the variables, a set of standardized vectors is obtained. Let these vectors be:
Z1=(z11, z12, . . . , z1n)
Z2=(z21, z22, . . . , z2n)
Zk=(zk1,zk2, . . . , zkn)ββ(8)
After performing GSP, the orthogonal vectors are as follows:
U1=(u11,u12, . . . , u1n)
U2=(u21, u22, . . . , u2n)
Uk=(uk1,uk2, . . . ,ukn)ββ(9)
It is easily shown that mean of vectors U1,U2, . . . ,Uk is zero. Let s1,s2, . . . sk be standard deviations (s.d.s) of U1,U2, . . . ,Uk respectively. Since the sample of size is n, there are n different MDs. MD corresponding to jth observation of the sample is computed using Equation (10).
MDj=(1/k) [(u1j2/s12)+(u2j2/s22)+ . . . +(ukj2/sk2)]ββ(10)
Where, j=1. . . n, the values of MD obtained from Equations (1) and (10) are exactly the same. In MTGS methodology, abnormal MDs are computed from the means, standard deviations and Gram-Schmidt coefficients of the normal group or Mahalanobis space, while the Mahalanobis space is a database including means, standard deviations, Gram-Schmidt coefficients and the Mahalanobis distances.
Predictions Based on Gram-Schmidt Variables
According to the present invention, a method of making predictions using Gram-Schmidt (GS) variables without calculating the Mahalanobis distance is provided. This method is useful in situations where the reference group consists of the variables with small or even zero standard deviation or variance. In the most extreme case where if variables have zero standard deviations then correlations with other variables are not possible and hence calculation of Mahalanobis distances is not possible, although variables with zero standard deviations represent very important patterns. This type of situation is frequently seen in pattern recognition problems.
The method of making predictions according to one embodiment of the present invention is described in the following steps:
S T = Total β’ β’ Sum β’ β’ of β’ β’ Squares = β i = 1 t β’ U ij 2 r = Sum β’ β’ of β’ β’ squares β’ β’ due β’ β’ to β’ β’ input β’ β’ signal = β i = 1 t β’ M i 2 S Ξ² = Sum β’ β’ of β’ β’ Squares β’ β’ due β’ β’ to β’ β’ Slope = ( 1 / r ) β‘ [ β i = 1 t β’ M i β’ β’ U ij ] 2
Ξ² j = [ β i = 1 t β’ M i β’ β’ U ij ] / r ( 12 )
Y i = β j = 1 k β’ ( Ξ· j β’ U ij Ξ² j ) β j = 1 k β’ Ξ· j ( 14 )
Selection of suitable subsets is very important in multivariate diagnosis/pattern recognition activities as it is difficult to handle large datasets with several numbers of variables. The present invention applies a new metric called Multiple Mahalanobis Distance (MMD) for computing S/N ratios to select suitable subsets. This method is useful in complex situations, illustratively including voice recognition or TV picture recognition. In these cases, the number of variables runs into the order of several thousands. Use of MMD method helps in reducing the problem complexity and to make effective decisions in complex situations.
In MMD method, large number of variables is divided into several subsets containing local variables. For example, in a voice recognition pattern (as shown in FIG. 2), let there be k subsets. The subsets correspond to k patterns numbered from 1,2, . . . k. Each pattern starts at a low value, reaches a maximum and then again returns to the low value. These patterns (subsets) are described by a set of respective local variables. In MMD method, for each subset the Mahalanobis distances are calculated. These Mahalanobis distances are used to calculate MMD. Using abnormal MMDs, S/N ratios are calculated to determine useful subsets. In this way the complexity of the problems is reduced.
This method is also useful for identifying the subsets (or variables in the subsets) corresponding to different failure modes or patterns that are responsible for higher values of MDs. For example in the case of final product inspection system, use of MMD method would help to find out variables corresponding to different processes that are responsible for product failure.
If the variables corresponding to different subsets or processes cannot be identified then, decision-maker can select subsets from the original set of variables and identify the best subsets required.
Exemplary Steps in Inventive Process
The adjoint matrix method is applied to liver disease test data considered earlier. For the purpose of better understanding of the discussion, correlation matrix, inverse matrix and adjoint matrix corresponding to the 17 variables are given in Tables 3, 4, and 5 respectively. In this case the determinant of the correlation matrix is 0.00001314.
The Mahalanobis distances calculated by inverse matrix method and adjoint matrix method (MDAs), are given in Table 6 (for normal group) and in Table 7 (for abnormal group). From the Table 6, it is clear that the average MDAs for normals do not converge to 1.0. MDAs and MDs are related according to the Equation (4).
| TABLE 3 |
| Correlation matrix |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | |
| X1 | 1.000 | β0.297 | β0.278 | β0.403 | β0.220 | 0.101 | 0.041 | 0.208 | 0.293 |
| X2 | β0.297 | 1.000 | 0.103 | 0.416 | 0.690 | 0.287 | 0.379 | β0.108 | β0.048 |
| X3 | β0.278 | 0.103 | 1.000 | 0.427 | 0.202 | 0.084 | 0.139 | 0.072 | 0.011 |
| X4 | β0.403 | 0.416 | 0.427 | 1.000 | 0.315 | 0.038 | 0.056 | 0.010 | β0.106 |
| X5 | β0.220 | 0.690 | 0.202 | 0.315 | 1.000 | 0.345 | 0.385 | 0.063 | β0.057 |
| X6 | 0.101 | 0.287 | 0.084 | 0.038 | 0.345 | 1.000 | 0.790 | 0.316 | 0.177 |
| X7 | 0.041 | 0.379 | 0.139 | 0.056 | 0.385 | 0.790 | 1.000 | 0.143 | 0.068 |
| X8 | 0.208 | β0.108 | 0.072 | 0.010 | 0.063 | 0.316 | 0.143 | 1.000 | 0.229 |
| X9 | 0.293 | β0.048 | 0.011 | β0.106 | β0.057 | 0.177 | 0.068 | 0.229 | 1.000 |
| X10 | β0.104 | 0.647 | 0.177 | 0.269 | 0.578 | 0.550 | 0.568 | 0.129 | 0.065 |
| X11 | β0.112 | 0.395 | 0.182 | 0.219 | 0.429 | 0.535 | 0.507 | 0.227 | 0.147 |
| X12 | 0.264 | β0.237 | 0.070 | β0.136 | 0.012 | 0.148 | 0.134 | 0.250 | 0.171 |
| X13 | 0.135 | 0.269 | 0.158 | 0.100 | 0.367 | 0.320 | 0.373 | 0.103 | 0.121 |
| X14 | 0.283 | β0.222 | 0.078 | β0.135 | 0.032 | 0.168 | 0.148 | 0.257 | 0.168 |
| X15 | β0.292 | 0.886 | 0.150 | 0.365 | 0.644 | 0.321 | 0.398 | β0.063 | β0.075 |
| X16 | β0.019 | 0.254 | β0.119 | 0.091 | 0.135 | 0.181 | 0.109 | 0.095 | β0.096 |
| X17 | β0.282 | 0.798 | 0.198 | 0.413 | 0.675 | 0.359 | 0.435 | β0.015 | β0.061 |
| X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | ||
| X1 | β0.104 | β0.112 | 0.264 | 0.135 | 0.283 | β0.292 | β0.019 | β0.282 | |
| X2 | 0.647 | 0.395 | β0.237 | 0.269 | β0.222 | 0.886 | 0.254 | 0.798 | |
| X3 | 0.177 | 0.182 | 0.070 | 0.158 | 0.078 | 0.150 | β0.119 | 0.198 | |
| X4 | 0.269 | 0.219 | β0.136 | 0.100 | β0.135 | 0.365 | 0.091 | 0.413 | |
| X5 | 0.578 | 0.429 | 0.012 | 0.367 | 0.032 | 0.644 | 0.135 | 0.675 | |
| X6 | 0.550 | 0.535 | 0.148 | 0.320 | 0.168 | 0.321 | 0.181 | 0.359 | |
| X7 | 0.568 | 0.507 | 0.134 | 0.373 | 0.148 | 0.398 | 0.109 | 0.435 | |
| X8 | 0.129 | 0.227 | 0.250 | 0.103 | 0.257 | β0.063 | 0.095 | β0.015 | |
| X9 | 0.065 | 0.147 | 0.171 | 0.121 | 0.168 | β0.075 | β0.096 | β0.061 | |
| X10 | 1.000 | 0.683 | 0.052 | 0.437 | 0.079 | 0.612 | 0.138 | 0.649 | |
| X11 | 0.683 | 1.000 | 0.159 | 0.342 | 0.152 | 0.445 | 0.048 | 0.465 | |
| X12 | 0.052 | 0.159 | 1.000 | 0.310 | 0.967 | β0.140 | β0.004 | β0.023 | |
| X13 | 0.437 | 0.342 | 0.310 | 1.000 | 0.318 | 0.267 | β0.041 | 0.352 | |
| X14 | 0.079 | 0.152 | 0.967 | 0.318 | 1.000 | β0.119 | 0.025 | β0.011 | |
| X15 | 0.612 | 0.445 | β0.140 | 0.267 | β0.119 | 1.000 | 0.279 | 0.771 | |
| X16 | 0.138 | 0.048 | β0.004 | β0.041 | 0.025 | 0.279 | 1.000 | 0.177 | |
| X17 | 0.649 | 0.465 | β0.023 | 0.352 | β0.011 | 0.771 | 0.177 | 1.000 | |
| TABLE 4 |
| Inverse matrix |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | |
| X1 | 1.592 | β0.003 | 0.307 | 0.297 | 0.118 | β0.082 | β0.116 | β0.193 | β0.304 |
| X2 | β0.003 | 8.136 | 0.658 | β0.706 | β1.281 | 0.627 | β0.439 | 0.379 | β0.576 |
| X3 | 0.307 | 0.658 | 1.442 | β0.594 | β0.169 | 0.136 | β0.258 | β0.066 | β0.123 |
| X4 | 0.297 | β0.706 | β0.594 | 1.677 | 0.101 | 0.009 | 0.272 | β0.143 | 0.088 |
| X5 | 0.118 | β1.281 | β0.169 | 0.101 | 2.357 | β0.197 | 0.110 | β0.193 | 0.200 |
| X6 | β0.082 | 0.627 | 0.136 | 0.009 | β0.197 | 3.403 | β2.266 | β0.483 | β0.297 |
| X7 | β0.116 | β0.439 | β0.258 | 0.272 | 0.110 | β2.266 | 3.192 | 0.275 | 0.252 |
| X8 | β0.193 | 0.379 | β0.066 | β0.143 | β0.193 | β0.483 | 0.275 | 1.338 | β0.157 |
| X9 | β0.304 | β0.576 | β0.123 | 0.088 | 0.200 | β0.297 | 0.252 | β0.157 | 1.247 |
| X10 | β0.113 | β1.482 | β0.115 | 0.071 | β0.034 | β0.436 | β0.172 | β0.056 | 0.101 |
| X11 | 0.248 | 0.748 | 0.070 | β0.157 | β0.121 | β0.348 | β0.133 | β0.179 | β0.218 |
| X12 | 0.337 | β0.192 | 0.223 | 0.026 | 0.210 | 0.332 | β0.240 | β0.103 | β0.118 |
| X13 | β0.284 | β0.077 | β0.097 | β0.049 | β0.235 | 0.044 | β0.195 | 0.064 | β0.034 |
| X14 | β0.552 | 1.358 | β0.304 | 0.055 | β0.440 | β0.156 | 0.106 | β0.028 | β0.006 |
| X15 | 0.146 | β4.277 | β0.315 | 0.317 | 0.077 | β0.108 | β0.009 | 0.022 | 0.240 |
| X16 | β0.028 | β0.316 | 0.194 | β0.103 | 0.108 | β0.338 | 0.147 | β0.143 | 0.157 |
| X17 | 0.198 | β1.525 | β0.023 | β0.296 | β0.429 | β0.104 | β0.153 | 0.012 | 0.131 |
| X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | ||
| X1 | β0.113 | 0.248 | 0.337 | β0.284 | β0.552 | 0.146 | β0.028 | 0.198 | |
| X2 | β1.482 | 0.748 | β0.192 | β0.077 | 1.358 | β4.277 | β0.316 | β1.525 | |
| X3 | β0.115 | 0.070 | 0.223 | β0.097 | β0.304 | β0.315 | 0.194 | β0.023 | |
| X4 | 0.071 | β0.157 | 0.026 | β0.049 | 0.055 | 0.317 | β0.103 | β0.296 | |
| X5 | β0.034 | β0.121 | 0.210 | β0.235 | β0.440 | 0.077 | 0.108 | β0.429 | |
| X6 | β0.436 | β0.348 | 0.332 | 0.044 | β0.156 | β0.108 | β0.338 | β0.104 | |
| X7 | β0.172 | β0.133 | β0.240 | β0.195 | 0.106 | β0.009 | 0.147 | β0.153 | |
| X8 | β0.056 | β0.179 | β0.103 | 0.064 | β0.028 | 0.022 | β0.143 | 0.012 | |
| X9 | 0.101 | β0.218 | β0.118 | β0.034 | β0.006 | 0.240 | 0.157 | 0.131 | |
| X10 | 3.321 | β1.247 | 0.928 | β0.335 | β1.004 | 0.386 | 0.041 | β0.350 | |
| X11 | β1.247 | 2.302 | β0.880 | β0.001 | 0.754 | β0.637 | 0.151 | β0.036 | |
| X12 | 0.928 | β0.880 | 16.234 | β0.293 | β15.614 | 0.589 | 0.274 | β0.363 | |
| X13 | β0.335 | β0.001 | β0.293 | 1.537 | β0.096 | 0.043 | 0.167 | β0.145 | |
| X14 | β1.004 | 0.754 | β15.614 | β0.096 | 16.526 | β0.826 | β0.463 | β0.018 | |
| X15 | 0.386 | β0.637 | 0.589 | 0.043 | β0.826 | 5.415 | β0.330 | β0.691 | |
| X16 | 0.041 | 0.151 | 0.274 | 0.167 | β0.463 | β0.330 | 1.249 | 0.120 | |
| X17 | β0.350 | β0.036 | β0.363 | β0.145 | β0.018 | β0.691 | 0.120 | 3.599 | |
| TABLE 5 |
| Adjoint matrix |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | |
| X1 | β2.09Eβ05 | ββ3.8Eβ08 | β4.03Eβ06 | β3.9Eβ06 | β1.55Eβ06 | β1.07Eβ06 | β1.52Eβ06 | β2.53Eβ06 | βββ4Eβ06 |
| X2 | ββ3.8Eβ08 | β0.000107 | β8.65Eβ06 | β9.27Eβ06 | β1.68Eβ05 | β8.24Eβ06 | β5.77Eβ06 | β4.98Eβ06 | β7.57Eβ06 |
| X3 | β4.03Eβ06 | β8.65Eβ06 | β1.89Eβ05 | β7.81Eβ06 | β2.22Eβ06 | β1.78Eβ06 | ββ3.4Eβ06 | β8.65Eβ07 | β1.62Eβ06 |
| X4 | β3.9Eβ06 | β9.27Eβ06 | β7.81Eβ06 | β2.2Eβ05 | β1.33Eβ06 | β1.18Eβ07 | β3.57Eβ06 | β1.88Eβ06 | β1.16Eβ06 |
| X5 | β1.55Eβ06 | β1.68Eβ05 | β2.22Eβ06 | β1.33Eβ06 | β3.1Eβ05 | β2.59Eβ06 | β1.44Eβ06 | β2.54Eβ06 | β2.63Eβ06 |
| X6 | β1.07Eβ06 | β8.24Eβ06 | β1.78Eβ06 | β1.18Eβ07 | β2.59Eβ06 | β4.47Eβ05 | β2.98Eβ05 | β6.35Eβ06 | β3.91Eβ06 |
| X7 | β1.52Eβ06 | β5.77Eβ06 | ββ3.4Eβ06 | β3.57Eβ06 | β1.44Eβ06 | β2.98Eβ05 | β4.19Eβ05 | β3.61Eβ06 | β3.31Eβ06 |
| X8 | β2.53Eβ06 | β4.98Eβ06 | β8.65Eβ07 | β1.88Eβ06 | β2.54Eβ06 | β6.35Eβ06 | β3.61Eβ06 | β1.76Eβ05 | β2.07Eβ06 |
| X9 | βββ4Eβ06 | β7.57Eβ06 | β1.62Eβ06 | β1.16Eβ06 | β2.63Eβ06 | β3.91Eβ06 | β3.31Eβ06 | β2.07Eβ06 | β1.64Eβ05 |
| X10 | β1.49Eβ06 | β1.95Eβ05 | β1.51Eβ06 | β9.35Eβ07 | ββ4.5Eβ07 | β5.74Eβ06 | β2.26Eβ06 | β7.31Eβ07 | β1.32Eβ06 |
| X11 | β3.26Eβ06 | β9.83Eβ06 | β9.22Eβ07 | β2.06Eβ06 | ββ1.6Eβ06 | β4.57Eβ06 | β1.75Eβ06 | β2.35Eβ06 | β2.86Eβ06 |
| X12 | β4.43Eβ06 | β2.53Eβ06 | β2.93Eβ06 | β3.41Eβ07 | β2.77Eβ06 | β4.36Eβ06 | β3.16Eβ06 | β1.35Eβ06 | β1.56Eβ06 |
| X13 | β3.73Eβ06 | β1.01Eβ06 | β1.27Eβ06 | β6.46Eβ07 | β3.09Eβ06 | β5.75Eβ07 | β2.56Eβ06 | β8.37Eβ07 | β4.48Eβ07 |
| X14 | β7.25Eβ06 | β1.78Eβ05 | β3.99Eβ06 | β7.2Eβ07 | β5.78Eβ06 | β2.05Eβ06 | β1.4Eβ06 | β3.73Eβ07 | β8.37Eβ08 |
| X15 | β1.92Eβ06 | β5.62Eβ05 | β4.13Eβ06 | β4.17Eβ06 | β1.02Eβ06 | β1.42Eβ06 | β1.18Eβ07 | β2.92Eβ07 | β3.15Eβ06 |
| X16 | β3.63Eβ07 | β4.16Eβ06 | β2.55Eβ06 | β1.36Eβ06 | β1.42Eβ06 | β4.44Eβ06 | β1.94Eβ06 | β1.87Eβ06 | β2.06Eβ06 |
| X17 | β2.6Eβ06 | βββ2Eβ05 | β3.04Eβ07 | β3.89Eβ06 | β5.64Eβ06 | β1.37Eβ06 | β2.01Eβ06 | β1.61Eβ07 | β1.72Eβ06 |
| X10 | X11 | X12 | X13 | X14 | X15 | X16 | X17 | ||
| X1 | β1.49Eβ06 | β3.26Eβ06 | β4.43Eβ06 | β3.73Eβ06 | β7.25Eβ06 | β1.92Eβ06 | β3.63Eβ07 | β2.6Eβ06 | |
| X2 | β1.95Eβ05 | β9.83Eβ06 | β2.53Eβ06 | β1.01Eβ06 | β1.78Eβ05 | β5.62Eβ05 | β4.16Eβ06 | βββ2Eβ05 | |
| X3 | β1.51Eβ06 | β9.22Eβ07 | β2.93Eβ06 | β1.27Eβ06 | β3.99Eβ06 | β4.13Eβ06 | β2.55Eβ06 | β3.04Eβ07 | |
| X4 | β9.35Eβ07 | β2.06Eβ06 | β3.41Eβ07 | β6.46Eβ07 | β7.2Eβ07 | β4.17Eβ06 | β1.36Eβ06 | β3.89Eβ06 | |
| X5 | ββ4.5Eβ07 | ββ1.6Eβ06 | β2.77Eβ06 | β3.09Eβ06 | β5.78Eβ06 | β1.02Eβ06 | β1.42Eβ06 | β5.64Eβ06 | |
| X6 | β5.74Eβ06 | β4.57Eβ06 | β4.36Eβ06 | β5.75Eβ07 | β2.05Eβ06 | β1.42Eβ06 | β4.44Eβ06 | β1.37Eβ06 | |
| X7 | β2.26Eβ06 | β1.75Eβ06 | β3.16Eβ06 | β2.56Eβ06 | β1.4Eβ06 | β1.18Eβ07 | β1.94Eβ06 | β2.01Eβ06 | |
| X8 | β7.31Eβ07 | β2.35Eβ06 | β1.35Eβ06 | β8.37Eβ07 | β3.73Eβ07 | β2.92Eβ07 | β1.87Eβ06 | β1.61Eβ07 | |
| X9 | β1.32Eβ06 | β2.86Eβ06 | β1.56Eβ06 | β4.48Eβ07 | β8.37Eβ08 | β3.15Eβ06 | β2.06Eβ06 | β1.72Eβ06 | |
| X10 | β4.36Eβ05 | β1.64Eβ05 | β1.22Eβ05 | β4.41Eβ06 | β1.32Eβ05 | β5.07Eβ06 | β5.42Eβ07 | β4.59Eβ06 | |
| X11 | β1.64Eβ05 | β3.02Eβ05 | β1.16Eβ05 | β1.73Eβ08 | β9.91Eβ06 | β8.37Eβ06 | β1.98Eβ06 | β4.68Eβ07 | |
| X12 | β1.22Eβ05 | β1.16Eβ05 | β0.000213 | β3.85Eβ06 | β0.000205 | β7.74Eβ06 | β3.6Eβ06 | β4.77Eβ06 | |
| X13 | β4.41Eβ06 | β1.73Eβ08 | β3.85Eβ06 | β2.02Eβ05 | β1.27Eβ06 | β5.62Eβ07 | β2.19Eβ06 | ββ1.9Eβ06 | |
| X14 | β1.32Eβ05 | β9.91Eβ06 | β0.000205 | β1.27Eβ06 | β0.000217 | β1.09Eβ05 | β6.08Eβ06 | β2.41Eβ07 | |
| X15 | β5.07Eβ06 | β8.37Eβ06 | β7.74Eβ06 | β5.62Eβ07 | β1.09Eβ05 | β7.12Eβ05 | β4.34Eβ06 | β9.08Eβ06 | |
| X16 | β5.42Eβ07 | β1.98Eβ06 | β3.6Eβ06 | β2.19Eβ06 | β6.08Eβ06 | β4.34Eβ06 | β1.64Eβ05 | β1.58Eβ06 | |
| X17 | β4.59Eβ06 | β4.68Eβ07 | β4.77Eβ06 | ββ1.9Eβ06 | β2.41Eβ07 | β9.08Eβ06 | β1.58Eβ06 | β4.73Eβ05 | |
| TABLE 6 |
| MDs and MDAs for normal group |
| S. No. |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | |
| MD-inverse | 0.378374 | 0.431373 | 0.403562 | 0.500211 | 0.515396 | 0.495501 | 0.583142 |
| MD-Adjoint | 0.000005 | 0.000006 | 0.000005 | 0.000007 | 0.000007 | 0.000007 | 0.000008 |
| S. No. |
| 8 | . . . | 196 | 197 | 198 | 199 | 200 | Average | |
| MD-inverse | 0.565654 | . . . | 1.74 | 1.75 | 1.78 | 1.76 | 2.36 | 0.995 |
| MD-Adjoint | 0.000007 | . . . | 0.00002 | 0.00002 | 0.00002 | 0.00002 | 0.00003 | 0.000013 |
| TABLE 7 |
| MDs and MDAs for abnormals |
| S. No. |
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
| MD-Inverse | 7.72741 | 8.41629 | 10.29148 | 7.20516 | 10.59075 | 10.55711 | 13.31775 | 14.81278 |
| MD-adjoint | 0.00010 | 0.00011 | 0.00014 | 0.00009 | 0.00014 | 0.00014 | 0.00017 | 0.00019 |
| S. No. |
| . . . | 13 | 14 | 15 | 16 | 17 | Average | ||
| MD-Inverse | . . . | 19.65543 | 43.04050 | 78.64045 | 97.27242 | 135.70578 | 30.39451 | |
| MD-adjoint | . . . | 0.00026 | 0.00057 | 0.00103 | 0.00128 | 0.00178 | 0.00040 | |
L32(231) OA is used to accommodate 17 variables. Table 8 gives dynamic S/N ratios for all the combinations of this array with inverse matrix method and adjoint matrix method. Table 9 shows gain in S/N ratios for both the methods. It is clear that gains in S/N ratios are same for both methods. The important variable combination based on these gains is: X4-X5-X10-X12-X13-X14-X15-X17. From Table 10, which shows system performance in the form of S/N ratios, it is clear that there is a gain of 1.98 dB units if useful variables are used instead of all the variables. This gain is also exactly same as that obtained in inverse matrix method.
Hence, even if an adjoint matrix method is used, the ultimate results would be the same. However, MDA values are advantageous because it will not take into account the determinant of correlation matrix. In case of multi-collinearity problems, as the determinant tend to become zero, the inverse matrix becomes inefficient giving rise to inaccurate MDs. Such problems can be avoided if MDAs are used based on adjoint matrix method.
| TABLE 8 |
| Dynamic S/N ratios for the combinations of L32(231) array |
| S/N ratio | S/N ratio | |
| Run | (Inverse) | (Adjoint) |
| 1 | β6.252 | 42.560 |
| 2 | β6.119 | 42.693 |
| 3 | β10.024 | 38.788 |
| 4 | β10.181 | 38.631 |
| 5 | β10.348 | 38.464 |
| 6 | β10.495 | 38.317 |
| 7 | β7.934 | 40.878 |
| 8 | β8.177 | 40.635 |
| 9 | β9.234 | 39.578 |
| 10 | β9.631 | 39.181 |
| 11 | β3.338 | 45.474 |
| 12 | β3.406 | 45.406 |
| 13 | β10.932 | 37.880 |
| 14 | β11.121 | 37.691 |
| 15 | β6.495 | 42.317 |
| 16 | β7.265 | 41.547 |
| 17 | β7.898 | 40.914 |
| 18 | β7.665 | 41.147 |
| 19 | β10.156 | 38.656 |
| 20 | β9.901 | 38.911 |
| 21 | β5.431 | 43.381 |
| 22 | β5.312 | 43.500 |
| 23 | β7.603 | 41.209 |
| 24 | β7.498 | 41.314 |
| 25 | β11.412 | 37.400 |
| 26 | β11.100 | 37.712 |
| 27 | β5.874 | 42.938 |
| 28 | β4.989 | 43.823 |
| 29 | β9.238 | 39.574 |
| 30 | β8.989 | 39.823 |
| 31 | β5.544 | 43.268 |
| 32 | β5.303 | 43.509 |
| TABLE 9 |
| Gain in S/N Ratios |
| Inverse Method | Adjoint Method | ||||||
| Variable | Level 1 | Level 2 | Gain | Variable | Level 1 | Level 2 | Gain |
| X1 | β8.185 | β7.745 | β0.440 | X1 | 40.627 | 41.067 | β0.440 |
| X2 | β8.187 | β7.742 | β0.445 | X2 | 40.625 | 41.070 | β0.445 |
| X3 | β8.249 | β7.680 | β0.569 | X3 | 40.563 | 41.132 | β0.569 |
| X4 | β7.949 | β7.980 | 0.031 | X4 | 40.863 | 40.832 | 0.031 |
| X5 | β7.069 | β8.860 | 1.791 | X5 | 41.743 | 39.952 | 1.791 |
| X6 | β8.318 | β7.611 | β0.706 | X6 | 40.494 | 41.201 | β0.706 |
| X7 | β7.976 | β7.954 | β0.022 | X7 | 40.836 | 40.858 | β0.022 |
| X8 | β8.824 | β7.105 | β1.718 | X8 | 39.988 | 41.707 | β1.718 |
| X9 | β8.188 | β7.742 | β0.446 | X9 | 40.625 | 41.070 | β0.446 |
| X10 | β6.358 | β9.571 | 3.212 | X10 | 42.454 | 39.241 | 3.212 |
| X11 | β8.101 | β7.828 | β0.273 | X11 | 40.711 | 40.984 | β0.273 |
| X12 | β7.821 | β8.108 | 0.287 | X12 | 40.991 | 40.704 | 0.287 |
| X13 | β7.562 | β8.367 | 0.805 | X13 | 41.250 | 40.445 | 0.805 |
| X14 | β7.315 | β8.615 | 1.300 | X14 | 41.497 | 40.197 | 1.300 |
| X15 | β7.590 | β8.339 | 0.749 | X15 | 41.222 | 40.473 | 0.749 |
| X16 | β7.982 | β7.947 | β0.035 | X16 | 40.830 | 40.865 | β0.035 |
| X17 | β7.832 | β8.097 | 0.265 | X17 | 40.980 | 40.715 | 0.265 |
| TABLE 10 |
| S/N Ratio Analysis |
| S/N ratio-optimal system | 44.54 dB | |
| S/N ratio-original system | 42.56 dB | |
| Gain | β1.98 dB | |
The adjoint matrix method is applied to another case with 12 variables. In this example, there are 58 normals and 30 abnormals. The MDs corresponding to normals are computed by using MTS methodβthe average MD is 0.92. The reason for this discrepancy is the existence of multi-collinearity. This is clear from the correlation matrix (Table 11), which shows that the variables X10, X11 and X12 have high correlations with each other. The determinant of the matrix is also estimated and it is found to be 8.693Γ10β12 (close to zero), indicating that the matrix is almost singular. Presence of multi-collinearity will also affect the other stages of the MTS method. Hence, adjoint matrix method is used to perform the analysis.
Adjoint Matrix Method
The adjoint of correlation matrix is shown in Table 12.
| TABLE 11 |
| Correlation Matrix |
| X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | X10 | X11 | X12 | |
| X1 | 1 | 0.358 | β0.085 | β0.024 | 0.005 | 0.057 | β0.149 | β0.128 | β0.046 | 0.105 | β0.055 | β0.055 |
| X2 | 0.358 | 1 | 0.014 | 0.022 | 0.003 | β0.097 | β0.271 | β0.079 | 0.061 | 0.325 | 0.023 | 0.023 |
| X3 | β0.085 | 0.014 | 1 | 0.0769 | 0.0708 | 0.0577 | 0.3138 | 0.1603 | 0.0815 | 0.4945 | 0.5286 | 0.5333 |
| X4 | β0.024 | 0.022 | 0.0769 | 1 | β0.135 | β0.018 | 0.296 | β0.206 | 0.062 | 0.597 | 0.624 | 0.622 |
| X5 | 0.005 | 0.003 | 0.0708 | β0.135 | 1 | 0.123 | 0.264 | 0.114 | 0.053 | 0.536 | 0.560 | 0.559 |
| X6 | 0.057 | β0.097 | 0.0577 | β0.018 | 0.123 | 1 | 0.353 | 0.055 | 0.056 | 0.063 | 0.096 | 0.096 |
| X7 | β0.149 | β0.271 | 0.3138 | 0.296 | 0.264 | 0.353 | 1 | 0.103 | 0.092 | 0.395 | 0.508 | 0.508 |
| X8 | β0.128 | β0.079 | 0.1603 | β0.206 | 0.114 | 0.055 | 0.103 | 1 | β0.153 | β0.032 | β0.002 | β0.0004 |
| X9 | β0.046 | 0.061 | 0.0815 | 0.062 | 0.053 | 0.056 | 0.092 | β0.153 | 1 | 0.116 | 0.104 | 0.104 |
| X10 | 0.105 | 0.325 | 0.4945 | 0.597 | 0.536 | 0.063 | 0.395 | β0.032 | 0.116 | 1 | 0.951 | 0.951 |
| X11 | β0.055 | 0.023 | 0.5286 | 0.624 | 0.560 | 0.096 | 0.508 | β0.002 | 0.104 | 0.951 | 1 | 0.999 |
| X12 | β0.055 | 0.023 | 0.5333 | 0.622 | 0.559 | 0.096 | 0.508 | β0.0004 | 0.104 | 0.951 | 0.999 | 1 |
| TABLE 12 |
| Adjoint Matrix |
| X1 | X2 | X3 | X4 | X5 | X6 | |
| X1 | β1.00912Eβ10 | β4.70272Eβ10 | β1.61623Eβ10 | β2.76032Eβ10 | β2.57713Eβ10 | β5.48951Eβ12 |
| X2 | β4.70263Eβ10 | β2.50034Eβ09 | β9.18237Eβ10 | β1.55621Eβ09 | β1.45406Eβ09 | β2.10511Eβ11 |
| X3 | β1.61527Eβ10 | β9.17746Eβ10 | β1.06463Eβ09 | β1.63137Eβ09 | β1.50922Eβ09 | β5.28862Eβ13 |
| X4 | β2.7594Eβ10 | β1.55576Eβ09 | β1.63154Eβ09 | β2.56985Eβ09 | β2.37158Eβ09 | β3.57245Eβ13 |
| X5 | β2.57631Eβ10 | β1.45366Eβ09 | β1.50939Eβ09 | β2.37159Eβ09 | β2.20389Eβ09 | β1.73783Eβ12 |
| X6 | ββ5.4903Eβ12 | β2.10556Eβ11 | β5.23064Eβ13 | β3.64155Eβ13 | β1.74411Eβ12 | β1.06058Eβ11 |
| X7 | β5.04604Eβ12 | β2.83284Eβ11 | β2.05079Eβ11 | β3.50574Eβ11 | β3.34989Eβ11 | β4.37759Eβ12 |
| X8 | β7.12086Eβ13 | β3.11071Eβ12 | β9.19606Eβ12 | β1.10978Eβ11 | β1.29962Eβ11 | β1.97598Eβ13 |
| X9 | β1.43722Eβ12 | β8.0730Eβ13 | β1.32908Eβ11 | β1.89556Eβ11 | β1.78591Eβ11 | β5.79657Eβ13 |
| X10 | β1.66565Eβ09 | β8.74446Eβ09 | ββ3.1875Eβ09 | ββ5.4102Eβ09 | β5.05514Eβ09 | β7.53194Eβ11 |
| X11 | β7.60305Eβ10 | β4.38609Eβ09 | β5.67096Eβ09 | β6.22205Eβ09 | β5.62443Eβ09 | β5.56545Eβ13 |
| X12 | β4.14615Eβ10 | β1.61673Eβ09 | β5.08692Eβ09 | β4.90701Eβ09 | β4.36272Eβ09 | β6.98298Eβ11 |
| X7 | X8 | X9 | X10 | X11 | X12 | |
| X1 | ββ5.043Eβ12 | β7.14809Eβ13 | β1.43647Eβ12 | β1.66567Eβ09 | β7.66095Eβ10 | β4.08691Eβ10 |
| X2 | β2.83118Eβ11 | β3.09613Eβ12 | β8.03373Eβ13 | ββ8.7444Eβ09 | β4.41674Eβ09 | β1.58527Eβ09 |
| X3 | β2.04944Eβ11 | β9.18812Eβ12 | ββ1.3292Eβ11 | β3.18575Eβ09 | β5.68418Eβ09 | β5.10159Eβ09 |
| X4 | β3.50392Eβ11 | β1.10855Eβ11 | β1.89581Eβ11 | β5.40857Eβ09 | β6.24469Eβ09 | β4.93127Eβ09 |
| X5 | β3.34823Eβ11 | β1.29848Eβ11 | β1.78615Eβ11 | ββ5.0537Eβ09 | β5.64554Eβ09 | β4.38529Eβ09 |
| X6 | β4.37752Eβ12 | β1.97695Eβ13 | β5.79622Eβ13 | β7.5335Eβ11 | β3.17881Eβ13 | ββ6.9595Eβ11 |
| X7 | β1.58563Eβ11 | β1.42556Eβ12 | β1.00253Eβ12 | β8.62928Eβ11 | β1.25906Eβ10 | ββ1.486Eβ10 |
| X8 | β1.42569Eβ12 | β1.01743Eβ11 | β1.84668Eβ12 | β1.04492Eβ11 | β1.34899Eβ10 | β1.25096Eβ10 |
| X9 | β1.00246Eβ12 | β1.84666Eβ12 | β9.46854Eβ12 | β6.93471Eβ12 | β2.47767Eβ11 | β5.98708Eβ11 |
| X10 | β8.62349Eβ11 | β1.03982Eβ11 | β6.92086Eβ12 | β3.07209Eβ08 | β1.50768Eβ08 | β6.10343Eβ09 |
| X11 | β1.26294Eβ10 | β1.35001Eβ10 | β2.47494Eβ11 | β1.49692Eβ08 | β2.88114Eβ07 | β2.83899Eβ07 |
| X12 | β1.48962Eβ10 | β1.25168Eβ10 | β5.98339Eβ11 | β6.21375Eβ09 | ββ2.8383Eβ07 | β2.97854Eβ07 |
After computing MDA values for normals, the measurement scale is validated by computing abnormal MDA values. FIG. 3 indicates that there is a clear distinction between normals and abnormals.
In the next step, important variables are selected using L16(215) array. The S/N ratio analysis was performed based on larger-the-better criterion in usual way. The gains in S/N ratios are shown in Table 13. From this table, it is clear that the variables X1-X2-X3- X4- X6- X10-X11-X12 have positive gains and hence they are important. The confirmation run with these variables (FIG. 4) indicates that distinction (between normals and abnormals) is very good.
| TABLE 13 |
| Gain in S/N ratio |
| Variable | Level 1 | Level 2 | Gain | |
| X1 | β102.90 | β105.01 | 2.12 | |
| X2 | β103.53 | β104.38 | 0.86 | |
| X3 | β103.84 | β104.07 | 0.22 | |
| X4 | β103.72 | β104.19 | 0.47 | |
| X5 | β104.04 | β103.86 | β0.18 | |
| X6 | β103.87 | β104.04 | 0.16 | |
| X7 | β104.18 | β103.72 | β0.46 | |
| X8 | β104.14 | β103.77 | β0.37 | |
| X9 | β104.33 | β103.58 | β0.76 | |
| X10 | β103.51 | β104.40 | 0.90 | |
| X11 | β103.78 | β104.13 | 0.35 | |
| X12 | β103.43 | β104.48 | 1.05 | |
Therefore, adjoint matrix method can safely replace inverse matrix method as it is as efficient as inverse matrix method in general and more efficient when there are problems of multi-collinearity.
From the 17 variables, eight subsets (as shown in Table 14) are selected. These subsets are selected to illustrate the MMD methodology; there is no rational for this selection. It is to be noted that the number of variables in each subset are not the same.
| TABLE 14 |
| Subsets for MMD analysis |
| Subset | Variables | |
| S1 | X1 - X2 - X3 - X4 | |
| S2 | X5 - X6 - X7 - X8 | |
| S3 | X9 - X10 - X11 - X12 | |
| S4 | X13 - X14 - X15 - X16 - X17 | |
| S5 | X3 - X4 - X5 - X6 | |
| S6 | X10 - X11 - X12 - X13 - X14 - X15 | |
| S7 | X14 - X15 - X16 - X17 | |
| S8 | X2 - X5 - X7 - X10 - X12 - X13 - X14 - X15 | |
For each subset, Mahalanobis distances are computed with the help of correlation matrices of respective variables. Therefore, we have eight sets of MDs (for normals and abnormals) corresponding to the subsets. The βMDs provide data corresponding to the subsets that are considered as control factors. Tables 15 and 16 show sample data (βMDs) for normals and abnormals.
| TABLE 15 |
| βMDs for normals (sample data) |
| S. No | S1 | S2 | S3 | S4 | S5 | S6 | S7 | S8 |
| β1 | 0.873 | 0.545 | 0.707 | 0.756 | 0.796 | 0.505 | 0.832 | 0.574 |
| β2 | 0.762 | 0.540 | 0.929 | 0.710 | 0.499 | 0.688 | 0.606 | 0.807 |
| β3 | 1.022 | 0.688 | 0.550 | 0.623 | 0.955 | 0.479 | 0.697 | 0.613 |
| β4 | 1.102 | 0.544 | 0.769 | 0.740 | 1.225 | 0.648 | 0.827 | 0.681 |
| β5 | 1.022 | 0.640 | 0.602 | 0.888 | 0.815 | 0.782 | 0.934 | 0.695 |
| . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | . | . | . |
| 196 | 1.041 | 0.786 | 1.691 | 1.513 | 0.500 | 1.550 | 1.539 | 1.411 |
| 197 | 1.467 | 1.310 | 2.101 | 1.201 | 1.457 | 1.481 | 0.611 | 1.373 |
| 198 | 1.086 | 1.278 | 0.974 | 1.406 | 1.410 | 1.834 | 0.994 | 1.648 |
| 199 | 1.238 | 0.999 | 1.107 | 1.061 | 1.206 | 1.132 | 0.964 | 1.700 |
| 200 | 1.391 | 0.924 | 0.979 | 0.680 | 1.094 | 2.156 | 0.750 | 1.844 |
| TABLE 16 |
| βMDs for abnormals (sample data) |
| S. No | S1 | S2 | S3 | S4 | S5 | S6 | S7 | S8 |
| β1 | 1.339 | 2.930 | 2.610 | 3.428 | 2.574 | 3.277 | 2.913 | 3.734 |
| β2 | 1.491 | 3.469 | 1.931 | 1.511 | 3.267 | 3.388 | 1.687 | 3.932 |
| β3 | 1.251 | 2.700 | 0.742 | 2.631 | 2.447 | 3.322 | 2.660 | 4.365 |
| β4 | 2.124 | 2.507 | 2.041 | 3.240 | 2.518 | 3.058 | 2.009 | 3.395 |
| β5 | 1.010 | 2.182 | 2.867 | 1.279 | 1.861 | 4.035 | 1.090 | 4.440 |
| . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | . | . | . |
| . | . | . | . | . | . | . | . | . |
| 13 | 1.769 | 2.819 | 6.544 | 2.153 | 2.352 | 6.023 | 2.177 | 5.776 |
| 14 | 1.898 | 2.045 | 3.817 | 4.551 | 2.443 | 10.213 | 1.969 | 9.275 |
| 15 | 1.624 | 12.681 | 2.116 | 3.672 | 12.248 | 9.064 | 1.202 | 11.426 |
| 16 | 5.453 | 13.314 | 3.630 | 1.022 | 13.515 | 10.095 | 1.108 | 12.121 |
| 17 | 4.511 | 16.425 | 5.489 | 3.684 | 12.027 | 11.142 | 2.264 | 10.939 |
After arranging the data (βMDs) in this manner, MMD analysis is carried out. In this analysis, MMDs are Mahalanobis distances obtained from βMDs. Table 17 and 18 provide sample values of MMDs for normals and abnormals respectively.
| TABLE 17 |
| MMDs for normals (sample values) |
| Condition | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | . . . | 198 | 199 | 200 |
| MMD | 0.558 | 0.861 | 0.425 | 0.786 | 0.413 | 1.655 | 0.357 | 0.660 | 0.641 | 0.717 | . . . | 2.243 | 2.243 | 4.979 |
| TABLE 18 |
| MMDs for abnormals (sample values) |
| Condition | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | . . . | 15 | 16 | 17 |
| MMD | 22.52 | 29.86 | 30.61 | 23.47 | 27.05 | 57.12 | 61.61 | 52.64 | 50.77 | 66.15 | . . . | 515.50 | 601.30 | 592.37 |
The next step is to assign the subsets to the columns of a suitable orthogonal array. Since there are eight subsets, L12(211) array was selected. The abnormal MMDs are computed for each run of this array. After performing average response analysis, gains in S/N ratios are computed for all the subsets. These details are shown in Table 19.
| TABLE 19 |
| Gain in S/N ratios |
| Level 1 | Level 2 | Gain | |
| S1 | 15.498 | 18.053 | β2.555 | |
| S2 | 17.463 | 16.089 | 1.374 | |
| S3 | 16.712 | 16.839 | β0.127 | |
| S4 | 15.925 | 17.627 | β1.702 | |
| S5 | 17.626 | 15.926 | 1.700 | |
| S6 | 17.243 | 16.309 | 0.934 | |
| S7 | 15.683 | 17.869 | β2.186 | |
| S8 | 18.556 | 14.996 | 3.560 | |
In order to demonstrate the applicability of Gram-Schmidt process to predict abnormal conditions without computing the Mahalanobis distances, it is applied to the medical diagnosis case example previously discussed with 17 abnormal conditions. Out of 17 conditions, the first ten conditions are considered mild and the remaining seven conditions are considered as medium. This judgment was made by Dr. Kanetaka, who is a liver disease diagnosis specialist in Japan. For the purposes of prediction and since true values of severity are unknown, a value of 3 is assigned for the mild group and a value of 9 is assigned for the medium group. Table 20 provides the summary of data analysis for abnormals in this case example generated by GSP. FIG. 5 shows that there is a good match between actual level of severity and predicted values.
Intentionally, two variables with zero standard deviations are introduced. These variables are considered as the first and second variables and now the total number of variables is 19. Table 21 provides the summary of data analysis for abnormals in this instance. Like the data of FIG. 5, there is a good match between actual level of severity and predicted values as shown in FIG. 6.
| TABLE 20 |
| Summary of data analysis |
| Abnormal | Mi(*) | U1 | U2 | U3 | U4 | U5 | U6 | U7 | U8 | U9 |
| β1 | 3 | 12.3150 | β4.3293 | 0.5390 | β0.1329 | β275.8953 | 8.9867 | 3.6369 | 39.6987 | 3.9237 |
| β2 | 3 | 16.3150 | β3.8353 | β0.5307 | β0.1940 | β319.9262 | 18.5124 | 10.4489 | 8.8425 | 121.4416 |
| β3 | 3 | 7.3150 | β4.9467 | β0.4990 | β0.1476 | β290.7776 | 5.0656 | β2.3387 | 37.3756 | 14.1496 |
| β4 | 3 | 8.3150 | 4.1768 | 0.8948 | β0.3463 | β290.5790 | 7.1602 | 6.2609 | 33.9778 | β6.3051 |
| β5 | 3 | 7.3150 | 4.0533 | 0.1872 | β0.2073 | β343.8372 | β1.7947 | 2.2598 | β13.7225 | 107.8699 |
| β6 | 3 | 6.3150 | 3.9298 | β0.0204 | 0.0298 | β291.4935 | 22.5440 | 42.6023 | 25.0359 | 46.8510 |
| β7 | 3 | 6.3150 | 3.9298 | β0.0204 | 0.0298 | β291.4935 | 22.5440 | 42.6023 | 25.0359 | 46.8510 |
| β8 | 3 | 15.3150 | 5.0412 | 0.0479 | β0.2370 | β341.6418 | 2.4900 | 10.2859 | 24.1717 | β39.7420 |
| β9 | 3 | 10.3150 | 4.4238 | β0.3900 | β0.0740 | β305.7720 | 6.8612 | 25.8275 | 38.7990 | β12.8711 |
| 10 | 3 | 16.3150 | 5.1647 | 0.7555 | 0.5240 | β370.9332 | 1.6020 | 4.0528 | 31.8943 | β47.3453 |
| 11 | 9 | 1.3150 | β5.6876 | 0.1555 | β0.2124 | β392.3597 | 8.9167 | 6.3679 | 72.2190 | 98.6360 |
| 12 | 9 | 11.3150 | 4.5473 | β0.1824 | 0.0890 | β184.5821 | 12.6035 | 7.9285 | 28.6425 | 68.2139 |
| 13 | 9 | 28.3150 | 6.6465 | 0.3466 | β0.1298 | β350.1662 | 8.8457 | 9.9694 | β1.6063 | β39.3698 |
| 14 | 9 | 16.3150 | 5.1647 | β0.8445 | β0.1499 | β214.0392 | 13.3253 | 3.1878 | 4.0448 | 19.3168 |
| 15 | 9 | 7.3150 | 4.0533 | β0.7128 | β0.2239 | β411.5070 | 123.3593 | 28.1580 | β25.7886 | β99.2166 |
| 16 | 9 | 11.3150 | 4.5473 | β1.9824 | β1.1442 | β501.5225 | 129.5946 | 18.9048 | 58.9901 | β172.0809 |
| 17 | 9 | 16.3150 | 5.1647 | β1.2445 | β1.0684 | β529.9412 | 114.1371 | β43.2615 | 278.3264 | β248.7397 |
| SN Ratio | 0.0532 | 0.0103 | 0.0072 | 0.0144 | 0.0936 | 0.0255 | 0.0027 | 0.0113 | 0.0018 | |
| Beta | 1.7478 | 0.4151 | β0.0568 | β0.0424 | β49.6562 | 6.0562 | 1.0932 | 6.8292 | β4.0406 | |
| Abnormal | U10 | U11 | U12 | U13 | U14 | U15 | U16 | U17 | Yi (Predicted) |
| β1 | β4.9805 | β6.0171 | 91.7303 | 185.2230 | 10.9136 | 0.3937 | 2.6036 | β0.0004 | 3.2929 |
| β2 | 38.7007 | 3.1131 | 15.1255 | 22.6190 | 40.5201 | β0.1010 | β1.7837 | 0.1886 | 4.1392 |
| β3 | 13.9428 | 1.1665 | 15.5386 | 146.1268 | 34.6194 | 0.9444 | β2.8997 | 0.3113 | 3.2915 |
| β4 | 9.1650 | β16.1595 | 59.3059 | 190.3586 | 11.0576 | β0.0293 | β0.5074 | β1.1519 | 4.3706 |
| β5 | 100.2455 | β3.5307 | β9.7133 | 59.8334 | 41.1504 | β0.1751 | β2.4316 | 0.3238 | 3.9720 |
| β6 | β26.2873 | β11.2102 | 56.9628 | 12.4162 | 22.6997 | 0.1299 | β3.6248 | 0.2957 | 3.8183 |
| β7 | β26.2873 | β11.2102 | 56.9628 | 12.4162 | 40.6997 | 0.0817 | β4.4633 | 0.2882 | 4.0515 |
| β8 | 11.8856 | β2.4939 | 73.6173 | 306.5702 | 40.4239 | β0.0696 | 1.4407 | 0.3877 | 5.5233 |
| β9 | 14.9592 | β0.2392 | 151.5367 | 257.3016 | β26.2885 | 0.2335 | β1.2968 | 0.1731 | 4.3308 |
| 10 | 47.0552 | β7.1287 | 134.6759 | 27.4059 | 63.9149 | β0.1744 | β4.2087 | β1.3234 | 4.9031 |
| 11 | 111.2423 | 4.9965 | 80.8134 | 41.7231 | β16.1479 | 0.2477 | 1.7410 | β1.4001 | 2.6406 |
| 12 | 130.3151 | β33.2593 | 38.1854 | 7.0779 | β28.0403 | 0.6798 | 0.5761 | β0.3027 | 3.8651 |
| 13 | 197.4488 | β47.7848 | 31.1967 | β16.6519 | 13.2784 | 0.0870 | 0.6182 | β3.0672 | 7.7527 |
| 14 | 106.7722 | β30.7073 | β41.6234 | 316.9091 | 109.3155 | β0.2722 | β1.5941 | 0.7008 | 7.1760 |
| 15 | β111.3313 | β65.5057 | β54.0664 | 274.0644 | 97.0167 | β0.3268 | β12.9302 | β0.3416 | 9.7211 |
| 16 | β60.0760 | β77.8632 | β90.3734 | 51.7804 | 114.7708 | β0.4847 | β19.8192 | 1.6757 | 12.8493 |
| 17 | 62.1727 | β78.6159 | β84.2628 | 304.0971 | 110.2218 | β0.7889 | β22.3489 | 0.8951 | 15.0976 |
| SN Ratio | 0.0087 | 0.0538 | 0.0003 | 0.0197 | 0.0234 | 0.0002 | 0.0150 | 0.0006 | |
| Beta | 6.7947 | β4.7485 | 1.3030 | 18.9830 | 6.7624 | β0.0061 | β0.8148 | β0.0275 | |
| (*)Mi = True level of severity |
| TABLE 21 |
| Summary of data analysis with 19 variables (2 variables with zero variance) |
| Abnormal | Mi(*) | U1 | U2 | U3 | U4 | U5 | U6 | U7 | U8 | U9 | U10 |
| β1 | 3 | 0 | β5 | 12.3150 | β4.8232 | 0.5086 | β0.0050 | β247.2041 | 5.9966 | 9.8345 | 38.4890 |
| β2 | 3 | 0 | β10 | 16.3150 | β5.8111 | β0.6521 | β0.3753 | β339.3995 | 13.2639 | 21.7816 | 3.5642 |
| β3 | 3 | β3 | 0 | 7.3150 | β5.6876 | β0.5445 | β0.3327 | β317.0591 | 0.4772 | β2.7434 | 27.4543 |
| β4 | 3 | 0 | β4 | 8.3150 | 4.6707 | 0.9390 | 0.0301 | β121.7981 | 7.4455 | 24.5579 | 31.1738 |
| β5 | 3 | β4 | 0 | 7.3150 | 3.3124 | 0.1555 | β0.1327 | β222.0592 | β2.6516 | 8.1690 | β27.5355 |
| β6 | 3 | β4 | 0 | 6.3150 | 3.8063 | β0.0142 | 0.1228 | β174.4202 | 20.2183 | 65.1913 | 10.9267 |
| β7 | 3 | β4 | 5 | 6.3150 | 2.0775 | β0.1205 | β0.0104 | β189.7995 | 21.4092 | 69.3419 | 17.3681 |
| β8 | 3 | β4 | 0 | 15.3150 | 5.0412 | 0.0617 | β0.0996 | β217.5968 | 0.0803 | 11.0121 | 5.9138 |
| β9 | 3 | 0 | 7 | 10.3150 | 4.5473 | β0.3686 | β0.0551 | β208.6192 | 3.1314 | 30.5279 | 18.3591 |
| 10 | 3 | 0 | 7 | 16.3150 | 2.2010 | 0.5871 | 0.7156 | β221.5125 | 1.9417 | 11.9757 | 31.8713 |
| 11 | 9 | 0 | 7 | 1.3150 | β6.4285 | 0.1099 | β0.1937 | β379.0782 | 3.8004 | 18.6867 | 69.3825 |
| 12 | 9 | β4 | 7 | 11.3150 | 2.2010 | β0.3129 | 0.0748 | β72.5950 | 16.7374 | 24.9889 | 30.3079 |
| 13 | 9 | 0 | 5 | 28.3150 | 4.9177 | 0.2541 | 0.0171 | β210.4862 | 10.2735 | 16.2116 | β14.0044 |
| 14 | 9 | 0 | β10 | 16.3150 | 2.0775 | β1.0205 | β0.3085 | β122.6345 | 17.0910 | 23.8848 | 8.1721 |
| 15 | 9 | β5 | 0 | 7.3150 | 2.0775 | β0.8205 | β0.2677 | β308.3187 | 119.6389 | 150.9857 | 34.6156 |
| 16 | 9 | 0 | β10 | 11.3150 | 4.6707 | β1.9610 | β1.3495 | β426.0316 | 125.1581 | 152.5561 | 106.4050 |
| 17 | 9 | β6 | 0 | 16.3150 | 4.3003 | β1.2838 | β1.1606 | β429.4657 | 109.5444 | 65.6972 | 335.1143 |
| SN Ratio | 0.014554 | 5.1Eβ06 | 0.0532 | 0.0031 | 0.0103 | 0.0108 | 0.0692 | 0.0254 | 0.0306 | 0.0131 | |
| Beta | β0.29224 | β0.0137 | 1.7478 | 0.2319 | β0.0664 | β0.0443 | β37.0105 | 5.8358 | 7.3456 | 8.5277 | |
| Abnormal | U11 | U12 | U13 | U14 | U15 | U16 | U17 | U18 | U19 | Yi (predicted) |
| β1 | 53.9519 | 3.2878 | β0.3439 | 91.0072 | 218.5704 | 96.9192 | 0.5103 | 3.8723 | 1.2164 | 3.4401 |
| β2 | 159.4430 | 47.6378 | 25.7567 | 19.9424 | 38.6237 | 46.2383 | β0.0509 | β0.9670 | β0.6080 | 4.5581 |
| β3 | 51.3503 | 3.1881 | β0.1598 | 5.8983 | 115.8052 | 36.4650 | 0.9933 | 0.8872 | 0.2391 | 3.3564 |
| β4 | 53.6886 | 53.5027 | 8.1512 | 51.5605 | 235.2134 | 51.7450 | 0.6046 | 0.4109 | 2.1180 | 3.9019 |
| β5 | 138.1347 | 114.7010 | 28.9314 | β20.1444 | 90.6090 | 21.4934 | 0.4055 | β1.6543 | 2.2187 | 5.4607 |
| β6 | 52.9944 | 27.4622 | 7.3102 | 44.3797 | 61.9772 | 61.8750 | 0.7963 | 0.5993 | 3.5203 | 4.4063 |
| β7 | 82.0347 | 36.8372 | 15.0119 | 58.2735 | 81.3859 | 106.1382 | 0.7853 | 1.9013 | 3.6687 | 5.3589 |
| β8 | β24.2147 | 26.4896 | 2.1381 | 46.1576 | 340.1646 | 99.7916 | 0.6016 | 3.2652 | 3.3896 | 4.9194 |
| β9 | β8.2040 | 44.8183 | 18.3955 | 140.4198 | 327.9404 | 87.4870 | 0.7817 | β0.4917 | 4.4366 | 5.4957 |
| 10 | 32.3709 | 77.3551 | 15.0769 | 128.5416 | 125.2338 | 194.5158 | 0.5961 | 2.2735 | 2.5394 | 5.3191 |
| 11 | 187.6399 | 124.2687 | 54.5396 | 103.9712 | 110.3184 | 88.6731 | 0.2902 | 1.7963 | β0.4638 | 6.5812 |
| 12 | 111.9320 | 173.9234 | 28.8658 | 55.6367 | 130.8877 | 50.1456 | 1.1011 | 2.4019 | 4.3666 | 7.7220 |
| 13 | β15.2533 | 222.0768 | 24.0090 | 40.3074 | 119.3488 | 73.5301 | 0.6111 | 0.7587 | 1.1598 | 7.5838 |
| 14 | 64.6856 | 144.6806 | 16.3409 | β40.5239 | 368.8209 | 75.5318 | 0.3851 | 1.9974 | 3.5958 | 7.0952 |
| 15 | 47.0662 | 60.0346 | 6.3440 | β69.0617 | 300.2732 | 41.4337 | 0.5561 | 0.2120 | 3.1967 | 7.2938 |
| 16 | 9.8985 | 117.4264 | 14.7535 | β96.3428 | 45.6021 | 44.7032 | 0.5710 | β1.0853 | 4.1349 | 8.9292 |
| 17 | 107.1847 | 208.8039 | 51.0812 | β54.2971 | 284.1616 | 78.3278 | 0.2070 | β3.1676 | 3.5720 | 12.3893 |
| SN Ratio | 0.0223 | 0.1451 | 0.0545 | 0.0006 | 0.0388 | 0.0288 | 0.0351 | 0.0020 | 0.0384 | |
| Beta | 9.7306 | 16.3878 | 3.2332 | 1.7585 | 26.0902 | 9.8617 | 0.0785 | 0.0860 | 0.3718 | |
| (*)Mi = True level of severity |
Publications mentioned in the specification are indicative of the levels of those skilled in the art to which the invention pertains. These publications are incorporated herein by reference to the same extent as if each individual publication was specifically and individually incorporated herein by reference.
The foregoing description is illustrative of particular embodiments of the invention, but is not meant to be a limitation upon the practice thereof. The following claims, including all equivalents thereof, are intended to define the scope of the invention.
1. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values; and
using said abnormal predicted values for a future prediction.
2. The process of claim 1 further comprising the step of:
computing dynamic signal to noise ratios for said normal Gram-Schmidt vectors and for said abnormal Gram-Schmidt vectors.
3. The process of claim 1 further comprising the step of: comparing said abnormal predicted values to said observable abnormal values of said plurality of abnormal datum.
4. The process of claim 1 wherein said observable abnormal values are assigned.
5. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values;
using said abnormal predicted values for a future prediction; and
computing dynamic signal to noise ratios for said normal Gram-Schmidt vectors and for said abnormal Gram-Schmidt vectors;
wherein said dynamic signal to noise ratio, Ξ·j is equivalent to:
Ξ²j2/Veββ(13)
where
Ξ² j = [ β i = 1 t β’ M i β’ β’ U ij ] / r ,
Mi is the ith value of said plurality of abnormal datum, Uij is selected from the group consisting of: said normal Gram-Schmidt vectors and said abnormal Gram-Schmidt vectors, Ve is
( β i = 1 t β’ U i β’ β’ j 2 - ( 1 / r ) β‘ [ β i = 1 t β’ M i β’ U i β’ β’ j ] 2 ) / ( t - 1 )
where i is an integer between 1 and t, and j is an integer between 1 and k.
6. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values; and
using said abnormal predicted values for a future prediction;
wherein said observations on k variables relates to medical diagnosis.
7. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values;
using said abnormal predicted values for a future prediction;
wherein said observations on k variables relates to quality of a manufactured product.
8. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values;
using said abnormal predicted values for a future prediction;
wherein said observations on k variables relates to financial markets.
9. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values;
using said abnormal predicted values for a future prediction;
wherein said observations on k variables relates to voice recognition.
10. A process for multivariate data analysis comprising the steps of:
using a computer in conjunction with a Gram-Schmidt orthogonalization process to determine normal Gram-Schmidt vectors defining a set of normal Gram-Schmidt coefficients corresponding to observable normal values of a plurality of normal datum where at least one of said plurality of normal datum has non-zero standard deviation;
computing abnormal Gram-Schmidt vectors corresponding to observable abnormal values of a plurality of abnormal datum from said set of normal Gram-Schmidt coefficients;
computing a signal to noise ratio for said abnormal Gram-Schmidt vectors to obtain abnormal predicted values;
using said abnormal predicted values for a future prediction;
wherein said observations on k variables relates to TV picture recognition.
11. A process for multivariate analysis comprising the steps of:
using a computer to calculate Gram-Schmidt orthogonal vectors satisfying the equation:
Ui=(u11, u12, . . . , u1n)
U2=(u21, u22, . . . , u2n)
Uk=(uk1, uk2, . . . , ukn)
for a sample size n and observations on k variables, wherein the mean of said Gram-Schmidt orthogonal vectors is zero;
calculating for each of said Gram-Schmidt vectors a standard deviation, where at least one of said Gram-Schmidt vectors has a non-zero standard deviation; and
calculating a Mahalanobis distance corresponding to each of the k observations that satisfies the equation:
MDj=(1/k)[(u1j2/s12)+(u2j2/s22)+ . . . +(ukj2/sk2)]
where j is an integer from 1 . . . n.
12. The process of claim 11 further comprising creating a Mahalanobis space database comprising Gram-Schmidt vector means, Gram-Schmidt standard deviations, Gram-Schmidt coefficients, and Mahalanobis distances corresponding to the k observations.
13. The process of claim 11 wherein said observations on k variables relates to medical diagnosis.
14. The process of claim 11 wherein said observations on k variables relates to quality of a manufactured product.
15. The process of claim 11 wherein said observations on k variables relates to financial markets.
16. The process of claim 11 wherein said observations on k variables relates to voice recognition.
17. The process of claim 11 wherein said observations on k variables relates to TV picture recognition.