Benchmarks #7

willtebbutt · 2020-04-26T12:58:48Z

Benchmarking AD tools has come up a lot recently, and this seems like a good place to implement some benchmarks, in addition to "correctness" testing.

I was thinking that they should be micro-benchmarks, and the benchmarks themselves shouldn't depend on any functionality outside of Base and the standard libraries, with the possible exception of things that are needed to test supports for accelerators eg. CuArrays.jl. Equally these could be supported by typing things sufficiently abstractly 🤷

The first thing to do is figure out what it people actually care about the performance of. For example, I really care about broadcasting and operations involving linear algebra, but not so much about control flow, but I know that the Turing team has a different set of priorities. So perhaps if everyone could solicit what sorts of things they're interested in benchmarking, we can start to think about how to chop up tests. For example, there's a distinction between control-flow that depends on values and control-flow that doesn't from the perspective of reverse-mode AD, so we should probably be testing that kind of thing.

cc @vchuravy @yebai @oxinabox

The text was updated successfully, but these errors were encountered:

vchuravy · 2020-04-26T16:30:38Z

When I asked the question people pointed me to various benchmarks/tests, they had flying around:

oxinabox mentioned this issue May 5, 2020

Add ChainRules FluxML/Zygote.jl#366

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks #7

Benchmarks #7

willtebbutt commented Apr 26, 2020

vchuravy commented Apr 26, 2020

Benchmarks #7

Benchmarks #7

Comments

willtebbutt commented Apr 26, 2020

vchuravy commented Apr 26, 2020