Health & Environmental Research Online (HERO)


Print Feedback Export to File
86998 
Journal Article 
Positive matrix factorization: a non-negative factor model with optimal utilization of error estimates of data values 
Paatero, P; Tapper, U 
1994 
Yes 
Environmetrics
ISSN: 1180-4009
EISSN: 1099-095X 
111-126 
Maj and Tor Nessling Foundation. #A new variant 'PM F' of factor analysis is described. It is assumed that X is a matrix of observed data and "sigma" is the known matrix of standard deviations of elements of X. Both X and "sigma" are of dimensions n x m. The method solves the bilinear matrix problem X = CF + E where G is the unknown left hand factor matrix (scores) of dimensions n x p. F is the unknown right hand factor matrix (loadings) of dimensions p x m, and E is the matrix of residuals. The problem is solved in the weighted least squares sense: G and F are determined so that the Frobenius norm of E divided (element-by-element) by u is minimized. Furthermore, the solution is constrained so that all the elements of C and F are required to be non-negative. It is shown that the solutions by PMF are usually different from any solutions produced by the customary factor analysis (FA, i.e. principal component analysis (PCA) followed by rotations). Usually PMF pro-duces a better fit to the data than FA. Also, the result of PF is guaranteed to be non-negative, while the result of FA often cannot be rotated so that all negative entries would be eliminated. Different possible application areas of the new method are briefly discussed. In environmental data, the error estimates of data can be widely varying and non-negativity is often an essential feature of the underlying models. Thus it is concluded that PMF is better suited than FA or PCA in many environmental applications. Examples of successful applications of PMF are shown in companion papers. 
FACTOR ANALYSIS; PRINCIPAL COMPONENT ANALYSIS; WEIGHTED LEAST SQUARES; ALTERNATING REGRESSION; ERROR ESTIMATES; SCALING; REPETITIVE MEASUREMENTS