-
-
Notifications
You must be signed in to change notification settings - Fork 562
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scree Plot for Pricipal Component Analysis #804
Comments
@pawaldia Awesome suggestion! We are catching up on issues and PRs since we just returned from a short hiatus. We will respond properly asap. |
I've done a mockup of some code to make this style of Scree plot:
|
@lschumm - fyi - we have a visualizer to do this - it can be found here: https://github.com/DistrictDataLabs/yellowbrick/blob/develop/yellowbrick/features/decomposition.py |
As directed by @pdamodaran , there's already a scree plot visualizer present in yellowbrick for this purpose. I am closing this issue for now. |
Are there any Elbow-like tool for Determining optimal PCA count? |
@BrandonKMLee I had been working on an explained variance plot for this - the WIP PR is here: #1037 -- it's been a while since I've taken a look at it, but if you want to help get it across the line or just use the code from that PR, I think that's what you're looking for. |
Describe the solution you'd like
I would love to have a plot that shows the amount of variance explained by the prinicipal components. It can help to reduce the dimensionality of the features to the dimensions that explains the variance in the dataset the most. It will help to find the optimal principal components without extensive searching.
Examples
This plot shows that around 350 features out of total 784 explains about 95% of the variance.
The text was updated successfully, but these errors were encountered: