Compute PCA from a data matrix \(X\) (rows = samples, columns = features). The tool centers (and optionally standardizes) the data, builds the covariance matrix, finds eigenvalues/eigenvectors (principal components), reports variance explained, and projects to 2D/3D.
Principal Component Analysis (pca) Preview
Math Linear Algebra • Applications and Advanced Linear Algebra (capstone)
Frequently Asked Questions
Why do we center the data before PCA?
Centering makes the covariance describe variability around the mean. Without centering, the first component can be dominated by the mean offset rather than spread.
When should I use z-score standardization?
Use it when features are measured in different units or scales. Z-scoring prevents large-scale features from dominating the covariance.
What does “explained variance” mean?
It is the fraction of total variance captured by each principal component. Large explained variance for PC1 indicates a strong dominant direction of spread.
Why might some eigenvalues be near zero?
Near-zero eigenvalues indicate directions with little variability, often meaning the data lies close to a lower-dimensional subspace.