Add L2/L1 regularization #179

ablaom · 2021-07-10T00:07:48Z

This PR:

resolves Regularisation parameters alpha and lambda are not used #169
moves responsibility for CPU <--> GPU data movement to the fit/update methods. Previously, this was done in the fit! method. This change was natural in addressing above and anticipates further transfer of responsibility to planned data front-end (see https://alan-turing-institute.github.io/MLJ.jl/dev/adding_models_for_general_use/#Implementing-a-data-front-end and [Design discussion] Batches and resampling #97).
adds missing doc-strings to all the models

@ToucheSir Further to Julia Discourse discussion, no change was actually necessary to the coretrain! loop to add regularization. It's just that the loss function passed to this loop now depends on the chain (Flux model), when L2/L1 regularisation parameters are non-trivial. To avoid the array mutation error I needed to avoid broadcasting in the computation of the penalty here. Any performance suggestions re these two bits of code appreciated.

oops oops correct order of move and collate fix missing chain move fix forgotten addition to cache update doc-string

codecov-commenter · 2021-07-10T00:26:46Z

Codecov Report

Merging #179 (67409a5) into dev (a40bdfd) will decrease coverage by 0.14%.
The diff coverage is 95.12%.

@@            Coverage Diff             @@
##              dev     #179      +/-   ##
==========================================
- Coverage   90.74%   90.59%   -0.15%     
==========================================
  Files           8        9       +1     
  Lines         216      234      +18     
==========================================
+ Hits          196      212      +16     
- Misses         20       22       +2

Impacted Files	Coverage Δ
src/types.jl	`85.71% <0.00%> (-14.29%)`	⬇️
src/common.jl	`96.77% <100.00%> (+0.22%)`	⬆️
src/core.jl	`90.00% <100.00%> (-0.79%)`	⬇️
src/penalized_losses.jl	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a40bdfd...67409a5. Read the comment docs.

src/penalized_losses.jl

ablaom · 2021-07-13T02:35:44Z

@ayush-1506 Would you like to review this PR? I've tested it locally on a GPU.

@DilumAluthge How do I put the GPU tests back for PR's onto dev?

ayush-1506 · 2021-07-14T18:36:34Z

@ablaom Sure, please give me a day.

ayush-1506

Just one small question, everything else looks great.

ayush-1506 · 2021-07-18T17:06:34Z

src/common.jl

@@ -43,48 +43,53 @@ end
 true_rng(model) = model.rng isa Integer ? MersenneTwister(model.rng) : model.rng

 function MLJModelInterface.fit(model::MLJFluxModel,
-                               verbosity::Int,
+                               verbosity,


Please correct me if I'm wrong, but verbosity should is still Int, right?

Yes, but you no longer need to explicitly type it. There used to be a type ambiguity that required the type annotation but that is now long gone.

ablaom added 6 commits July 10, 2021 11:07

add penalized loss function and integrate with fit/update

cec592f

oops oops correct order of move and collate fix missing chain move fix forgotten addition to cache update doc-string

add integration test for regularization

68d3de5

add doc-strings to the MLJFlux models

a6c49f1

minor opimization for case of zero regularization

49a3e25

oops. Add forgotten module qualifier

8db85b4

typo

8f09ab1

ablaom changed the title ~~Add regularization~~ Add L2/L1 regularization Jul 10, 2021

ToucheSir reviewed Jul 10, 2021

View reviewed changes

src/penalized_losses.jl Outdated Show resolved Hide resolved

use sum(abs2, A) instead of sum(x^2 for x in A) in loss penalty

67409a5

ayush-1506 reviewed Jul 18, 2021

View reviewed changes

ablaom merged commit a46aab1 into dev Jul 19, 2021

ablaom deleted the add-regularization2 branch July 19, 2021 02:10

This was referenced Jul 19, 2021

For a 0.2.3 release #181

Closed

For a 0.2.3 release (take II) #184

Merged

Issue to trigger new releases #47

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add L2/L1 regularization #179

Add L2/L1 regularization #179

ablaom commented Jul 10, 2021 •

edited

Loading

codecov-commenter commented Jul 10, 2021 •

edited

Loading

ablaom commented Jul 13, 2021

ayush-1506 commented Jul 14, 2021

ayush-1506 left a comment

ayush-1506 Jul 18, 2021

ablaom Jul 19, 2021

Add L2/L1 regularization #179

Add L2/L1 regularization #179

Conversation

ablaom commented Jul 10, 2021 • edited Loading

codecov-commenter commented Jul 10, 2021 • edited Loading

Codecov Report

ablaom commented Jul 13, 2021

ayush-1506 commented Jul 14, 2021

ayush-1506 left a comment

Choose a reason for hiding this comment

ayush-1506 Jul 18, 2021

Choose a reason for hiding this comment

ablaom Jul 19, 2021

Choose a reason for hiding this comment

ablaom commented Jul 10, 2021 •

edited

Loading

codecov-commenter commented Jul 10, 2021 •

edited

Loading