Cross Validated Asked by user3358740 on November 21, 2021
I am using R princomp
function (from stats
package) to run a PCA on a data set and I want to compare its output to that of the nonlinear dimensionality reduction method ISOMAP, which I am using under matlab through this toolbox: http://isomap.stanford.edu on the same dataset.
What I am interested in is the intrinsic dimensionality of the dataset as determined according to PCA and to ISOMAP separately, the ultimate goal being to check whether nonlinear dimensionality reduction works better on this dataset than PCA.
With princomp
I get the standard deviations associated to each component, while the ISOMAP package returns residual variances as a function of the manifold dimensionality. How do I compare these two quantities? In other words, how is the residual variance defined in ISOMAP?
This should be irrelevant, but the dataset is 54 points in 5 dimensions.
The original Isomap paper defined "residual variance" as follows (reference 42):
$$text{residual variance} = 1 - R^2(hat D_M, D_y)$$
where $R$ is the Pearson correlation coefficient over all entries of $hat D_M$ and $D_Y$. $hat D_M$ is the euclidean distance matrix for PCA and the geodesic distance matrix for Isomap. $D_Y$ is the euclidean distance matrix of the low dimensional embedding, this matrix changes with the number of dimensions you use for the embedding.
EDIT: This is numerically different than the explained variance of PCA derived from the eigenvalues and I don't know if there is a direct connection between the two of them.
EDIT: I asked here for the connection between residual and explained variance.
Answered by gdkrmr on November 21, 2021
Get help from others!
Recent Questions
Recent Answers
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP