Data preprocessing
From stats++ wiki
Data preprocessing typically involves (some of) the following steps:
"Efficient BackProp" paper by LeCun, describes the strategy of pre-processing data as:
- shift the data so that its mean is 0
- de-correlate the data
- normalize the data so each input has a variance of ~1
stats++
In stats++, data preprocessing is handled through a Preprocessor object:
Preprocessor