Independant Component Analysis

PCA works to maximize variance, ICA assumes that the features are mixtures of independant sources. It tries to isolate these independant sources that are mixed in this dataset.

Ex: say you only have 3 recordings of a piano, cello and tv that were played but not playing the same musique:

With ICA, it is possible to retrieve each tune independantly.

Blind source separation

Algorithm

g is tanh function, but can be other functions, we have different options. The decorrelation matrix is W.

ICA Assumptions:

The components are statistically independant.

Components have non Gaussian distributions. Very important.

The algo tries to maximise the non gaussianity. One way to calculate this is the w+ above. It is called negentropy. w+ is how we approximate it.

Applications

Brain scan data, retail earnings to find patterns.

PreviousDimensionality Reduction NextRandom Projection

Last updated 5 years ago

Was this helpful?