Skip to content

Commit

Permalink
🎨 improve transformer docstrings
Browse files Browse the repository at this point in the history
  • Loading branch information
Henry committed Jun 30, 2024
1 parent 6e77612 commit d5b707c
Show file tree
Hide file tree
Showing 2 changed files with 14 additions and 9 deletions.
5 changes: 3 additions & 2 deletions vaep/sklearn/ae_transformer.py
Original file line number Diff line number Diff line change
Expand Up @@ -36,9 +36,10 @@ class AETransformer(TransformerMixin, BaseEstimator):
"""Autoencoder transformer (Denoising or Variational).
Autoencoder transformer which can be used to impute missing values
in a dataset it is fitted to. Currently the data is standard normalized
for fitting the model, but imputations are provided the original scale
in a dataset it is fitted to. The data is standard normalized
for fitting the model, but imputations are provided on the original scale
after internally fitting the model.
The data uses the wide data format with samples as rows and features as columns.
Expand Down
18 changes: 11 additions & 7 deletions vaep/sklearn/cf_transformer.py
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,8 @@ class CollaborativeFilteringTransformer(TransformerMixin, BaseEstimator):
three columns. The sample and feature identifiers are embedded into a space which is
then used to predict the quantitative value.
The data is expected as a Series with a MultiIndex of the sample and feature identifiers,
and the quantitative value as its values.
Parameters
----------
Expand Down Expand Up @@ -78,15 +80,17 @@ def fit(self, X: pd.Series, y: pd.Series = None,
Parameters
----------
X : Series, shape (n_samples, )
The training data as a Series with the target_column as entries and name,
which has the item_column and sample_column set in a MultiIndex.
Is of shape (n_samples, )
X : Series, shape (n_values, )
The training data as a Series with the target_column as it values
and target_column as its name. The Series has a MultiIndex defined by the
item_column and sample_column.
Is of shape (n_values, )
y : Series, optional
The validation data as a Series with the target_column as entries and name,
which has the item_column and sample_column set in a MultiIndex.
Is of shape (n_samples, ), by default None
The validation data as a Series with the target_column as it values
and target_column as its name. The Series has a MultiIndex defined by the
item_column and sample_column.
Is of shape (n_values, ), by default None
epochs_max : int, optional
Maximal number of epochs to train, by default 100
Expand Down

0 comments on commit d5b707c

Please sign in to comment.