On the determination of appropriate dimensionality in data with error

The study deals with the problem of determining true dimensionality of data-with-error scaled by Kruskal's multidimensional scaling technique. Artificial data was constructed for 6, 8, 12, 16, and 30 point configurations of 1, 2, or 3 true dimensions by adding varying amounts of error to the true distances. Results show how stress is affected by error, number of points, and number of dimensions, and indicate that stress and the “elbow” criterion are inadequate for purposes of identifying true dimensionality when there is error in the data. The Wagenaar-Padmos procedure for identifying true dimensionality and error level is discussed. A simplified technique, involving a measure calledConstraint, is suggested.