Cross Validated Asked by foobar on January 6, 2021
For PCA, we can print out the number of components vs % variance explained, like in the following picture:
And as human practitioners, we’re typically instructed to choose the number of components at the inflection point of the curve close to explaining all the variance.
Is there an algorithm that looks at the variance explained, and just automatically choose where that inflection point should be?
Parallel Analysis is the standard way to choose the number of components algorithmically. It creates a sampling distribution for each of the eigenvalues and performs a series of hypothesis testing.
Note that this is not the exact conceptualization you mentioned because PA is based on eigenvalues, not the proportions of explained variances.
Answered by LambdaPsi on January 6, 2021
Get help from others!
Recent Answers
Recent Questions
© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP