Handle commutativity correctly in scalar rules #275

sethaxen · 2020-10-07T05:30:07Z

Many scalar rules defined on arguments of Number type assume the scalar commutes under multiplication, which fails for non-commutative numbers like quaternions. This PR restricts the type of such rules to Union{Real,Complex}. Where possible, it also adds generic rules defined for Number that don't assume commutativity.

If a user had implemented their own commutative number type that was not a Real, then before this PR, the rules may have worked for them, but now they will not. Hence, this is marked as a breaking change.

This PR requires JuliaDiff/ChainRulesTestUtils.jl#61.

Current use of imag and im assumes real or complex

This reverts commit d0d1954.

oxinabox · 2020-10-07T10:48:03Z

will review tomorrow

oxinabox

Looks good, I don't fully understand it.
Needs to bump the version.

Only major comment is that we should move more of the testing stuff into ChainRulesTestUtils.
(and/or FiniteDifferences.jl)

Approving this one now so once we have that sorted in
JuliaDiff/ChainRulesTestUtils.jl#61
it can be merged

oxinabox · 2020-10-08T13:12:14Z

src/ChainRules.jl

@@ -22,6 +22,8 @@ if VERSION < v"1.3.0-DEV.142"
    import LinearAlgebra: dot
 end

+# numbers that we know commute under multiplication
+const CommutativeMulNumber = Union{Real,Complex}


I thought the style guide said to put spaces here.
But now that i look, I am not sure that it mentions it JuliaDiff/BlueStyle#77
Still it is what we do else-where

Suggested change

const CommutativeMulNumber = Union{Real,Complex}

const CommutativeMulNumber = Union{Real, Complex}

src/rulesets/Base/base.jl

oxinabox · 2020-10-08T13:27:54Z

src/rulesets/Base/base.jl

-@scalar_rule acosh(x) inv(sqrt(x - 1) * sqrt(x + 1))
-@scalar_rule acoth(x) inv(1 - x ^ 2)
-@scalar_rule acsch(x) -(inv(x ^ 2 * sqrt(1 + x ^ -2)))
+@scalar_rule acosh(x::CommutativeMulNumber) inv(sqrt(x - 1) * sqrt(x + 1))


What is the logic used to determine if Multiplicative Commutative is needed for univariate functions?

If the extension of a function to the complex numbers and matrices is in the form of a power series, then the non-commutativity becomes a problem for non-commutative numbers, and I restrict it.

oxinabox · 2020-10-08T13:29:11Z

src/rulesets/Base/base.jl

@@ -140,7 +140,36 @@ end
    ),
    (!(islow | ishigh), islow, ishigh),
 )
-@scalar_rule x \ y (-(Ω / x), one(y) / x)
+


Should we move the other scalar rule for muladd to be here also?

oxinabox · 2020-10-08T13:34:20Z

src/rulesets/Base/base.jl

+
+# product rule requires special care for arguments where `muladd` is non-commutative
+function frule((_, Δx, Δy, Δz), ::typeof(muladd), x::Number, y::Number, z::Number)
+    ∂xyz = muladd(Δx, y, muladd(x, Δy, Δz))


Should we use MulAddMacro.jl here?
Its a dependency of ChainRulesCore already.
I think then we can just write:

Suggested change

∂xyz = muladd(Δx, y, muladd(x, Δy, Δz))

@muladd ∂xyz = Δx*y + x*Δy + Δz

and then i think the macro takes care of rearranging.

idk if that really adds clarity or not. What do you thing?

oxinabox · 2020-10-08T14:03:42Z

test/rulesets/Base/base.jl

+        function FiniteDifferences.to_vec(q::Quaternion)
+            function Quaternion_from_vec(q_vec)
+                return Quaternion(q_vec[1], q_vec[2], q_vec[3], q_vec[4])
+            end
+            return [q.s, q.v1, q.v2, q.v3], Quaternion_from_vec
+        end


We should just have this defined in ChainRulesTestUtils.jl
Whole point of that package is to avoid defining re-usable stuff inside the tests.

Its still type piracy there but it makes sense for us to define proper testing functionality for testing this using ChainRulesTestUtils.
JuliaDiff/ChainRulesTestUtils.jl#61

Or we could move it to FiniteDifferences.jl that would also be acceptable, and not type-piracy.

oxinabox · 2020-10-08T14:05:06Z

test/rulesets/Base/base.jl

+            end
+            @testset "/(::Quaternion, ::Real)" begin
+                x, ẋ = quatrand(), quatrand(), quatrand()
+                y, ẏ = randn(3)


typo?

Suggested change

y, ẏ = randn(3)

y, ẏ = randn(2)

oxinabox · 2020-10-08T14:06:19Z

test/rulesets/Base/base.jl

+                rrule_test(f, ΔΩ, (x, x̄), (y, ȳ))
+            end
+            @testset "/(::Quaternion, ::Real)" begin
+                x, ẋ = quatrand(), quatrand(), quatrand()


Suggested change

x, ẋ = quatrand(), quatrand(), quatrand()

x, ẋ = quatrand(), quatrand()

oxinabox · 2020-10-08T14:07:21Z

test/rulesets/Base/base.jl

+                x, ẋ, x̄ = randn(3)
+                y, ẏ, ȳ = quatrand(), quatrand(), quatrand()
+                ΔΩ = quatrand()


since we are not testing rrule don't need these/

Suggested change

x, ẋ, x̄ = randn(3)

y, ẏ, ȳ = quatrand(), quatrand(), quatrand()

ΔΩ = quatrand()

x, ẋ = randn(2)

y, ẏ = quatrand(), quatrand()

oxinabox · 2020-10-08T14:08:15Z

test/rulesets/Base/base.jl

+                y, ẏ = randn(3)
+                frule_test(/, (x, ẋ), (y, ẏ))
+                # don't test rrule, because it doesn't project adjoint of y to the reals
+                # so fd won't agree


Do we have a way to test these?

Why don't we run into this problem for Complex numbers?

sethaxen added 8 commits October 6, 2020 22:17

Add commutative numbers const

dbc3d08

Restrict rules assuming commutativity

8758584

Restrict angle to real/complexes

4add85d

Current use of imag and im assumes real or complex

Restrict sign to Number

9401209

Add rules for non-commuting numbers

51180ca

Use Quaternions in test

431fe8c

Test non-commutative rules with Quaternion

2f14f73

Increment version number

d0d1954

sethaxen requested a review from oxinabox October 7, 2020 05:30

Revert "Increment version number"

967a1bc

This reverts commit d0d1954.

oxinabox approved these changes Oct 8, 2020

View reviewed changes

mcabbott mentioned this pull request Feb 22, 2022

Assume commutative multiplication exactly when necessary #540

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle commutativity correctly in scalar rules #275

Handle commutativity correctly in scalar rules #275

sethaxen commented Oct 7, 2020

oxinabox commented Oct 7, 2020

oxinabox left a comment

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

sethaxen Oct 8, 2020

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

oxinabox Oct 8, 2020

	const CommutativeMulNumber = Union{Real,Complex}
	const CommutativeMulNumber = Union{Real, Complex}

	∂xyz = muladd(Δx, y, muladd(x, Δy, Δz))
	@muladd ∂xyz = Δxy + xΔy + Δz

	x, ẋ = quatrand(), quatrand(), quatrand()
	x, ẋ = quatrand(), quatrand()

Handle commutativity correctly in scalar rules #275

Are you sure you want to change the base?

Handle commutativity correctly in scalar rules #275

Conversation

sethaxen commented Oct 7, 2020

oxinabox commented Oct 7, 2020

oxinabox left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment