


StandardScaler utility class, which is a quick andĮasy way to perform the following operation on an array-like Than others, it might dominate the objective function and make theĮstimator unable to learn from other features correctly as expected. If a feature has a variance that is orders of magnitude larger Machines or the l1 and l2 regularizers of linear models) may assume thatĪll features are centered around zero or have variance in the same Transform the data to center it by removing the mean value of eachįeature, then scale it by dividing non-constant features by theirįor instance, many elements used in the objective function ofĪ learning algorithm (such as the RBF kernel of Support Vector

In practice we often ignore the shape of the distribution and just Normally distributed data: Gaussian with zero mean and unit variance. Machine learning estimators implemented in scikit-learn they might behaveīadly if the individual features do not more or less look like standard Standardization of datasets is a common requirement for many Standardization, or mean removal and variance scaling ¶ Normalizers on a dataset containing marginal outliers is highlighted inĬompare the effect of different scalers on data with outliers. The behaviors of the different scalers, transformers, and Some outliers are present in the set, robust scalers or transformers are moreĪppropriate. In general, learning algorithms benefit from standardization of the data set. Into a representation that is more suitable for the downstream estimators. Utility functions and transformer classes to change raw feature vectors The sklearn.preprocessing package provides several common
