bqlearn.corruptions.make_weak_labels¶

bqlearn.corruptions.make_weak_labels(X, y, estimator=None, *, train_size=0.1, stratify=None, discrete=True, random_state=None)[source]¶

Generate weak labels for a given dataset.

Parameters:

Xarray-like of shape (n_samples, n_features): The samples.
yarray-like of shape (n_samples, ): The targets.
estimatorobject, default=None: The estimator used to generate weak labels. If None, LogisticRegression is used as the estimator.
train_sizefloat or int, default=0.1: If float, should be between 0.0 and 1.0 and represent the proportion of the dataset that the estimator will be fitted on. If int, represents the absolute number of samples that the estimator will be fitted on.
stratifyarray-like, default=None: If not None, data is split in a stratified fashion, using this as the class labels.
discreteboolean, default=True: Determines if corrupted labels are the predicted label or sampled from the predicted probability distribution. If False, estimator must support predict_proba.
random_stateint or RandomState, default=None: Controls the random_state of the estimator.

Returns:

y_corruptndarray of shape (n_samples, ): The untrusted targets as predicitions from the fitted estimator.