Questions on PCA Dimensionality Reduction

Question

In machine learning, PCA is used to reduce the dimensionality of training data. However, from the above picture, I can't understand where is the reduction?

The input data x_i has D dimensions:

The output data x still has D dimensions:

Be sure to check where d replaces D, especially in step 5! So the idea is: instead of using all D components, use only d < D. — sascha

lejlot lejlot · Accepted Answer · 2017-08-19T13:51:02

The crucial element here is misunderstanding what is the output, in this pseudocode the output is y (equation 29), not x (equation 30), consequently you do reduce your data to d dimensions, the final equation shows you that if you would like to move back to original space, you can do it (obviously data will be recovered with some errors, since in meantime we dropped a lot of information when going to d dimensions).

Questions on PCA Dimensionality Reduction

2 Answers