bqlearn.corruptions.make_sampling_biais¶

bqlearn.corruptions.make_sampling_biais(X, *arrays, a=3, b=8, random_state=None)[source]¶

Synthetic covariate shift by creating a sampling biais using the first axis of a PCA learned of the input features [1].

Parameters:

Xarray-like of shape (n_samples, n_features): The samples.
*arrays: sequence of indexables with length / shape[0] equals to n_samples: Allowed inputs are lists, numpy arrays, scipy-sparse matrices or pandas dataframes.
afloat, default = 3.0
bfloat, default = 8.0
random_stateint or RandomState, default=None: Controls the randomness of the PCA.

Returns:

X_corruptedndarray of shape (n_samples, n_features): The corrupted samples.
*arrays_imbalancedlist, length=len(arrays): The corresponding imbalanced arrays.

References

[1]

Gretton, Arthur, et al. “Covariate shift by kernel mean matching.” Dataset shift in machine learning 3.4 (2009): 5.