add minibatch subsampling (doubly stochastic) objective #84

Red-Portal · 2024-08-11T21:40:51Z

This is a draft for the subsampling variational objective, which addresses #38 . Any perspectives/concerns/comments are welcome! The current plan is to only implement random reshuffling. As I recently showed that there is no point in implementing independent subsampling. Although importance sampling of datapoints could be an interesting addition, it will require custom DynamicPPL.Contexts.

The key design decisions is the following function:

# This function/signature will be moved to src/AdvancedVI.jl
"""
    subsample(model, batch)

# Arguments
- `model`: Model subject to subsampling. Could be the target model or the variational approximation.
- `batch`: Data points or indices corresponding to the subsampled "batch."

# Returns 
- `sub`: Subsampled model.
"""
prob_sub = subsample(prob, batch)
q_sub = subsample(q, batch)

Given a previous slack DM thread with @yebai , this interface could be straightforwardly implemented by Turing models as

@model function pPCA(X::AbstractMatrix{<:Real}, k::Int; data_or_indices = 1:size(X,1))
    N, D = size(X)
    N_sub = length(batch_idx)

    W ~ filldist(Normal(), D, k)
    Z ~ filldist(Normal(), k, N)

    # Subsampling
    # Some AD backends are not happy about `view`.
    # In that case, this step will commit a copy and, therefore, shall not be considered free.
    Z_sub = view(Z, :, idx)
    X_sub = view(X, :, idx)

    genes_mean = W * Z_sub
    return X_sub ~ arraydist([MvNormal(m, Eye(N_sub)) for m in eachcol(genes_mean')])
end;

where data_or_indices could be made a reserved keyword argument for Turing models. Then, I think

using Accessors

function subsample(m::DynamicPPL.Model, batch)
    n, b = length(m.defaults), length(batch)
    m = @set m.defaults = batch
    m = @set m.context = MiniBatchContext(context=m.context; b, n)
    m
end

should generally work?

My current guess would be that subsample(m::DynamicPPL.Model, batch) would have to end up in the main Turing repository.

codecov · 2024-08-11T22:02:25Z

Codecov Report

Attention: Patch coverage is 0% with 29 lines in your changes missing coverage. Please review.

Project coverage is 82.92%. Comparing base (1b36c6e) to head (644d314).
Report is 12 commits behind head on master.

Files	Patch %	Lines
src/objectives/subsampling.jl	0.00%	29 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           master      #84       +/-   ##
===========================================
- Coverage   96.09%   82.92%   -13.18%     
===========================================
  Files          11       12        +1     
  Lines         205      246       +41     
===========================================
+ Hits          197      204        +7     
- Misses          8       42       +34

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Red-Portal · 2024-09-10T00:29:27Z

related: TuringLang/DynamicPPL.jl#633

…ibatch

Red-Portal · 2024-09-10T07:40:57Z

Here is a comparison of the convergence speed of full-batch v.s. subsampling with respect to wall-clock time.

yebai · 2024-09-10T09:18:49Z

I suggest delaying this to AdvancedVI v0.4 so that the syntax in DynamicPPL is implemented.

Red-Portal · 2024-09-10T19:57:33Z

@yebai Sounds good to me.

mhauru

I don't understand the theory nor the wider context of AdvancedVI, so my review is quite shallow, but I don't see any significant problems here. I left a few small local proposals.

mhauru · 2024-09-18T15:10:54Z

docs/src/subsampling.md

+end
+```
+
+Notice that, when computing the log-density, we multiple by a constant `likeadj`.


Suggested change

Notice that, when computing the log-density, we multiple by a constant `likeadj`.

Notice that, when computing the log-density, we multiply by a constant `likeadj`.

mhauru · 2024-09-18T15:17:42Z

docs/src/subsampling.md

+```
+Let's first compare the convergence of full-batch `RepGradELBO` versus subsampled `RepGradELBO` with respect to the number of iterations:
+
+![](subsampling_iteration.svg)


Are these .svg files missing from the repo? I haven't looked at the built docs, just don't see them in the PR.

mhauru · 2024-09-18T15:23:53Z

src/AdvancedVI.jl

+# Returns 
+- `sub`: Subsampled model.
+"""
+subsample(model::Any, ::Any) = model


Do I understand correctly that subsampling is a more general operation than just a VI thing? If that's the case, could this be moved to DynamicPPL, or even AbstractPPL?

Also, I wonder if an empty function without methods would make more sense. Is returning the unmodified original model a reasonable fallback? I could imagine it confusing users, who would call it and get a return value, not realising it's actually just the original model.

@yebai Any comments on the current direction on the PPL side?

mhauru · 2024-09-18T15:24:38Z

src/AdvancedVI.jl

 """
-    estimate_gradient!(rng, obj, adtype, out, prob, λ, restructure, obj_state)
+    estimate_gradient!(rng, obj, adtype, out, prob, λ, restructure, obj_state, objargs...; kwargs...)


Could the varargs be explained in the docstring?

mhauru · 2024-09-18T15:29:42Z

src/objectives/subsampled.jl

+    Subsampled(objective, batchsize, data)
+
+Subsample `objective` over the dataset represented by `data` with minibatches of size `batchsize`.
+


Could you comment on what happens if batchsize does not divide length(data), or whether that's significant at all?

add subsampling objective

548a281

Red-Portal marked this pull request as draft August 11, 2024 21:41

Red-Portal added this to the v0.3.0 milestone Aug 11, 2024

fix wrong function name

644d314

yebai mentioned this pull request Aug 12, 2024

Where are the benchmarks posted? #82

Closed

Red-Portal added 3 commits September 9, 2024 21:03

Merge branch 'master' of github.com:TuringLang/AdvancedVI.jl into min…

0d90755

…ibatch

add subsampled objective with tests

da01ffc

add optional arguments and keyword arguments to RepGradELBO

aa5fd83

Red-Portal marked this pull request as ready for review September 10, 2024 05:46

Red-Portal added 2 commits September 10, 2024 00:38

add elapsed time measurement in stat

1c1b289

add docs for subsampling

d8d6f56

Red-Portal requested review from sunxd3, yebai and mhauru and removed request for sunxd3 September 10, 2024 07:39

fix subsampling example to work with DoG (tweaked scale_eps)

2fe0061

Red-Portal modified the milestones: v0.3.0, v0.4.0 Sep 10, 2024

Red-Portal added the enhancement New feature or request label Sep 10, 2024

mhauru reviewed Sep 18, 2024

View reviewed changes

Red-Portal mentioned this pull request Sep 30, 2024

Migrate to DifferentiationInterface #98

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add minibatch subsampling (doubly stochastic) objective #84

add minibatch subsampling (doubly stochastic) objective #84

Red-Portal commented Aug 11, 2024 •

edited

Loading

codecov bot commented Aug 11, 2024 •

edited

Loading

Red-Portal commented Sep 10, 2024

Red-Portal commented Sep 10, 2024

yebai commented Sep 10, 2024 •

edited

Loading

Red-Portal commented Sep 10, 2024

mhauru left a comment

mhauru Sep 18, 2024

mhauru Sep 18, 2024

mhauru Sep 18, 2024

Red-Portal Sep 18, 2024

mhauru Sep 18, 2024

mhauru Sep 18, 2024

	Notice that, when computing the log-density, we multiple by a constant `likeadj`.
	Notice that, when computing the log-density, we multiply by a constant `likeadj`.

		Subsampled(objective, batchsize, data)

		Subsample `objective` over the dataset represented by `data` with minibatches of size `batchsize`.

add minibatch subsampling (doubly stochastic) objective #84

Are you sure you want to change the base?

add minibatch subsampling (doubly stochastic) objective #84

Conversation

Red-Portal commented Aug 11, 2024 • edited Loading

codecov bot commented Aug 11, 2024 • edited Loading

Codecov Report

Red-Portal commented Sep 10, 2024

Red-Portal commented Sep 10, 2024

yebai commented Sep 10, 2024 • edited Loading

Red-Portal commented Sep 10, 2024

mhauru left a comment

Choose a reason for hiding this comment

mhauru Sep 18, 2024

Choose a reason for hiding this comment

mhauru Sep 18, 2024

Choose a reason for hiding this comment

mhauru Sep 18, 2024

Choose a reason for hiding this comment

Red-Portal Sep 18, 2024

Choose a reason for hiding this comment

mhauru Sep 18, 2024

Choose a reason for hiding this comment

mhauru Sep 18, 2024

Choose a reason for hiding this comment

Red-Portal commented Aug 11, 2024 •

edited

Loading

codecov bot commented Aug 11, 2024 •

edited

Loading

yebai commented Sep 10, 2024 •

edited

Loading