Stochastic FW: optional batched user-provided functions #40

matbesancon · 2021-01-18T22:28:01Z

With the current SFW interface, users provide a function that processes one data point, batching happens a level higher when we call the provided functions.

One possibility would be to make users provide batched functions by default:

f_batched(θ, xs) = sum(f(θ, x_i) for x_i in xs)
g_batched(θ, xs) = sum(g(θ, x_i) for x_i in xs)

What they provide now is the equivalent of the functions f and g above.

The text was updated successfully, but these errors were encountered:

pokutta · 2021-01-18T22:48:51Z

good question - i would say we leave as is for now. the reason is that we need to thing how to best map e.g., variance-reduced methods as they need special batch sizes depending on the iteration.

matbesancon · 2021-01-18T22:57:03Z

OK yes. Even with the alternative version, each iteration can control the batch size by picking the size of the xs list that is passed to {f/g}_batched

pokutta · 2021-01-18T23:04:14Z

ok i will need some extra explanation tomorrow to discuss.

matbesancon · 2021-01-19T11:55:04Z

So for now at the FW function level we have this:

compute_gradient(f, x, rng=rng, batch_size=batch_size)

At the compute_gradient level for f::StochasticObjective:

    rand_indices = if full_evaluation
        eachindex(f.xs)
    else
        rand(rng, eachindex(f.xs), batch_size)
    end
    return sum(f.grad(θ, f.xs[idx]) for idx in rand_indices)

So compute_gradient is the place where the default batching behaviour is defined, and calls f.grad on individual data points. The change in behaviour would be that compute_gradient passes down the rng, batch_size and full_evaluation arguments to f.grad, that is, the user-defined function, which itself implements the batching.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stochastic FW: optional batched user-provided functions #40

Stochastic FW: optional batched user-provided functions #40

matbesancon commented Jan 18, 2021

pokutta commented Jan 18, 2021

matbesancon commented Jan 18, 2021

pokutta commented Jan 18, 2021

matbesancon commented Jan 19, 2021

Stochastic FW: optional batched user-provided functions #40

Stochastic FW: optional batched user-provided functions #40

Comments

matbesancon commented Jan 18, 2021

pokutta commented Jan 18, 2021

matbesancon commented Jan 18, 2021

pokutta commented Jan 18, 2021

matbesancon commented Jan 19, 2021