Migrate away from tf1 #125

timokau · 2020-05-29T17:55:23Z

There has been some internal discussion about this, but I think its time to also open an issue about it. We are still using tensorflow 1, which has been outdated for a while now. Switching to tensorflow 2 would be a significant effort, since the underlying model fundamentally changed (there is no explicit graph construction anymore). At that point, it may be worth evaluating switching to pytorch instead. pytorch is a newer, very popular autodiff framework.

This article comes to the conclusion that

TensorFlow is still mentioned in many more job listings that PyTorch, but the gap is closing. PyTorch has taken the lead in usage in research papers at top conferences and almost closed the gap in Google search results. TensorFlow remains three times more common in usage according to the most recent Stack Overflow Developer Survey.

Here's another relevant article. Overall it seems to me that pytorch is the more future-proof choice, and if we're going to have to rewrite a lot of the code anyway we might as well switch. I do not have any practical experience in pytorch yet though, that's just what I could determine from other's opinions and first impressions.

We should also think about how we want to do the transition. This is a major undertaking and probably will take a while. Should we support tf1 and newthing in parallel? Gradually move models to newthing (thereby having mixed support)? Fork the project? Work on one big PR/branch, effectively blocking most other work for the time due to potential conflicts?

The text was updated successfully, but these errors were encountered:

timokau · 2020-05-30T11:40:42Z

@kiudee pointed me to https://github.com/skorch-dev/skorch on #119. We could also build on top of that library, which may do some of the work for us so that we only have to specify the model structure. I'm not entirely sure how much it gives us over directly interacting with scikit-learn though.

timokau · 2020-10-02T10:08:25Z

I feel like we can't put this off much longer. tf1 is becoming more and more outdated. It doesn't make much sense to continue with #116, since most of the changes are model-specific details now which might change anyway with a partial rewrite.

I have been looking into pytorch and tf2 a bit. I'm tending slightly towards pytorch. I could try to rewrite a part of our codebase (I'm thinking feta_linear and the derived learners) in both pytorch and tf2, then we decide on one. What do you think about that course of action @kiudee?

kiudee · 2020-10-13T08:19:24Z

I agree - the dependency restrictions also become cumbersome to work with (other libraries may require more recent versions).
Would you say we are ready to make a release for this version of the library?

I would also tend towards pytorch. In addition we can simplify some of the learners quite a bit using convolutions. Here is some experiment I did with FATE:

class FATE(nn.Module):
    def __init__(
        self,
        set_encoder,
        n_input_features,
        n_set_features=16,
        n_scorer_layers=1,
        n_scorer_dim=16,
        n_scorer_output=2,
        set_encoder_args=None,
        **kwargs
    ):
        super().__init__()
        if set_encoder_args is None:
            set_encoder_args = dict()
        self.set_encoder = set_encoder(
            input_channels=n_input_features,
            output_channels=n_set_features,
            **set_encoder_args
        )
        full_dim = n_input_features + n_set_features
        layers = [nn.Conv1d(full_dim, n_scorer_dim, 1), nn.ReLU(inplace=True)]
        for _ in range(n_scorer_layers - 1):
            layers.extend(
                (nn.Conv1d(n_scorer_dim, n_scorer_dim, 1), nn.ReLU(inplace=True))
            )
        self.scorer = nn.Sequential(
            *layers, nn.Conv1d(n_scorer_dim, n_scorer_output, 1)
        )

        for m in self.modules():
            if (
                isinstance(m, nn.Linear)
                or isinstance(m, nn.Conv2d)
                or isinstance(m, nn.Conv1d)
            ):
                init.xavier_uniform_(m.weight)
                if m.bias is not None:
                    m.bias.data.zero_()

    def forward(self, x, n_points=None):
        n_batch, n_obj, n_feat = x.shape
        new_x = x.transpose(1, 2).contiguous()
        embedding = self.set_encoder(new_x, n_points)
        if isinstance(embedding, tuple):
            embedding = embedding[0]
        emb_repeat = embedding.view(*embedding.shape, 1).repeat(1, 1, n_obj)
        new_x = torch.cat([new_x, emb_repeat], dim=1)
        scores = self.scorer(new_x)
        return scores.transpose(1, 2).contiguous()

timokau · 2020-10-15T14:20:48Z

Would you say we are ready to make a release for this version of the library?

Yes, but there have been breaking changes so it would have to be a major release.

In addition we can simplify some of the learners quite a bit using convolutions.

Yeah there is a lot of potential for simplification. Your experimental code is interesting, I'll see if I can incorporate some of that into my work. In #164 I'm experimenting with a composition style, where your can specify different kinds of (rank-n) utility functions and have different classes to compose them. I think that might make things simpler and end up in a nicer structure than the current inheritance-based one.

kiudee · 2020-10-16T07:49:18Z

Yes, but there have been breaking changes so it would have to be a major release.

Yes, I think this is warranted. I would say we should create a release candidate before finalizing it and test it.

timokau · 2020-11-05T15:13:30Z

Made a mistake in the annotation of #170, this is not fixed.

kiudee added Priority: Medium enhancement New feature or request labels May 30, 2020

timokau mentioned this issue Jun 2, 2020

MultinomialLogitModel silently chooses l2 for unknown regularization #128

Closed

kiudee added this to the 2.0 milestone Jun 5, 2020

timokau mentioned this issue Oct 9, 2020

PyTorch migration: Remove tensorflow components, add FATE estimators #164

Merged

7 tasks

kiudee removed this from the 2.0 milestone Oct 16, 2020

timokau mentioned this issue Nov 5, 2020

Handle prefix parameters in Learner #170

Merged

7 tasks

kiudee closed this as completed in #170 Nov 5, 2020

timokau reopened this Nov 5, 2020

This was referenced Apr 6, 2021

PyTorch migration: Add FETA estimators #183

Merged

PyTorch migration: Add CmpNet estimators #184

Merged

PyTorch migration: Add RankNet estimators #186

Merged

PyTorch migration: Finishing touches #187

Merged

kiudee mentioned this issue Apr 13, 2021

Migrate to PyTorch #191

Merged

7 tasks

kiudee closed this as completed in #191 Apr 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate away from tf1 #125

Migrate away from tf1 #125

timokau commented May 29, 2020

timokau commented May 30, 2020

timokau commented Oct 2, 2020

kiudee commented Oct 13, 2020

timokau commented Oct 15, 2020

kiudee commented Oct 16, 2020

timokau commented Nov 5, 2020

Migrate away from tf1 #125

Migrate away from tf1 #125

Comments

timokau commented May 29, 2020

timokau commented May 30, 2020

timokau commented Oct 2, 2020

kiudee commented Oct 13, 2020

timokau commented Oct 15, 2020

kiudee commented Oct 16, 2020

timokau commented Nov 5, 2020