Whitening transformation

Definition

Suppose $X$ is a random (column) vector with non-singular covariance matrix $\Sigma$ and mean $0$ . Then the transformation $Y=WX$ with a whitening matrix $W$ satisfying the condition $W^{\mathrm {T} }W=\Sigma ^{-1}$ yields the whitened random vector $Y$ with unit diagonal covariance.

If $X$ has non-zero mean $\mu$ , then whitening can be performed by $Y=W(X-\mu )$ .

There are infinitely many possible whitening matrices $W$ that all satisfy the above condition. Commonly used choices are $W=\Sigma ^{-1/2}$ (Mahalanobis or ZCA whitening), $W=L^{T}$ where $L$ is the Cholesky decomposition of $\Sigma ^{-1}$ (Cholesky whitening),^[3] or the eigen-system of $\Sigma$ (PCA whitening).^[4]

Optimal whitening transforms can be singled out by investigating the cross-covariance and cross-correlation of $X$ and $Y$ .^[3] For example, the unique optimal whitening transformation achieving maximal component-wise correlation between original $X$ and whitened $Y$ is produced by the whitening matrix $W=P^{-1/2}V^{-1/2}$ where $P$ is the correlation matrix and $V$ the diagonal variance matrix.

Remove ads

High-dimensional whitening

This modality is a generalization of the pre-whitening procedure extended to more general spaces where $X$ is usually assumed to be a random function or other random objects in a Hilbert space $H$ . One of the main issues of extending whitening to infinite dimensions is that the covariance operator has an unbounded inverse in $H$ . Nevertheless, if one assumes that Picard condition holds for $X$ in the range space of the covariance operator, whitening becomes possible.^[5] A whitening operator can be then defined from the factorization of the Moore–Penrose inverse of the covariance operator, which has effective mapping on Karhunen–Loève type expansions of $X$ . The advantage of these whitening transformations is that they can be optimized according to the underlying topological properties of the data, thus producing more robust whitening representations. High-dimensional features of the data can be exploited through kernel regressors or basis function systems.^[6]

R implementation

An implementation of several whitening procedures in R, including ZCA-whitening and PCA whitening but also CCA whitening, is available in the "whitening" R package ^[7] published on CRAN. The R package "pfica"^[8] allows the computation of high-dimensional whitening representations using basis function systems (B-splines, Fourier basis, etc.).

Whitening transformation

Definition

Whitening a data matrix

High-dimensional whitening

R implementation

See also

References

External links

Wikiwand - on