Add support for per-module max_methods. #43370

chriselrod · 2021-12-08T18:28:38Z

Adding

if isdefined(Base, :Experimental) && isdefined(Base.Experimental, Symbol("@max_methods"))
    @eval Base.Experimental.@max_methods 1
end

to Plots.jl:
Master:

julia> using Plots, SnoopCompile, BenchmarkTools

julia> tinf = @snoopi_deep plot(1:10, 1:10)
InferenceTimingNode: 2.085727/12.130569 on Core.Compiler.Timings.ROOT() with 98 direct children

julia> @benchmark plot(1:10, 1:10)
BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  171.533 μs …   6.345 ms  ┊ GC (min … max): 0.00% … 91.09%
 Time  (median):     177.000 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   187.454 μs ± 173.048 μs  ┊ GC (mean ± σ):  2.97% ±  3.13%

    █▆
  ▂▆██▆▃▃▃▃▃▃▂▂▂▂▂▂▂▂▂▁▂▁▂▁▁▁▁▂▁▂▂▁▁▁▁▂▁▁▁▁▁▁▂▁▂▁▁▁▁▁▁▁▁▁▁▁▂▂▃▂ ▂
  172 μs           Histogram: frequency by time          275 μs <

 Memory estimate: 63.12 KiB, allocs estimate: 976.

This PR:

julia> using Plots, SnoopCompile, BenchmarkTools

julia> tinf = @snoopi_deep plot(1:10, 1:10)
InferenceTimingNode: 1.709860/6.303342 on Core.Compiler.Timings.ROOT() with 109 direct children

julia> @benchmark plot(1:10, 1:10)
BenchmarkTools.Trial: 10000 samples with 1 evaluation.
 Range (min … max):  226.586 μs …   5.654 ms  ┊ GC (min … max): 0.00% … 92.18%
 Time  (median):     232.976 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   243.641 μs ± 191.944 μs  ┊ GC (mean ± σ):  2.78% ±  3.39%

  ▂▇█▆▅▄▄▃▂▁                                                    ▂
  ███████████▆▅▅▅▅▄▄▄▄▅▄▃▄▄▁▅▁▁▁▄▄▄▁▄▁▁▅▃▄▃▁▁▃▄▁▁▄▄▃▅▃▄▃▃▃▄▅▅▇▇ █
  227 μs        Histogram: log(frequency) by time        378 μs <

 Memory estimate: 63.54 KiB, allocs estimate: 987.

Adding @max_methods 1 to OrindaryDiffEq, DiffEqBase, DifferentialEquations, VectorizationBase, LoopVectorization, TriangularSolve, and RecursiveFactorization, I get:
Master:

julia> using DifferentialEquations, SnoopCompile, BenchmarkTools

julia> lorenz = (du,u,p,t) -> begin
        du[1] = 10.0(u[2]-u[1])
        du[2] = u[1]*(28.0-u[3]) - u[2]
        du[3] = u[1]*u[2] - (8/3)*u[3]
       end
#1 (generic function with 1 method)

julia> u0 = [1.0;0.0;0.0]
3-element Vector{Float64}:
 1.0
 0.0
 0.0

julia> tspan = (0.0,100.0)
(0.0, 100.0)

julia> prob = ODEProblem(lorenz,u0,tspan)
ODEProblem with uType Vector{Float64} and tType Float64. In-place: true
timespan: (0.0, 100.0)
u0: 3-element Vector{Float64}:
 1.0
 0.0
 0.0

julia> alg = Rodas5()
Rodas5{0, true, DefaultLinSolve, Val{:forward}}(DefaultLinSolve(nothing, nothing, nothing))

julia> tinf = @snoopi_deep solve(prob,alg)
InferenceTimingNode: 4.206624/23.152936 on Core.Compiler.Timings.ROOT() with 6 direct children

julia> @benchmark solve($prob, $alg)
BenchmarkTools.Trial: 2299 samples with 1 evaluation.
 Range (min … max):  2.070 ms …  10.632 ms  ┊ GC (min … max): 0.00% … 78.31%
 Time  (median):     2.110 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   2.171 ms ± 646.150 μs  ┊ GC (mean ± σ):  2.27% ±  6.05%

         ▄▅█▇▅▄▁▁▁
  ▂▃▅▆▆▇▇██████████▆▅▄▃▃▃▃▃▃▃▂▂▂▂▁▂▂▁▁▂▂▃▄▃▄▃▃▃▂▃▃▃▂▂▂▂▂▂▂▂▁▂ ▃
  2.07 ms         Histogram: frequency by time        2.28 ms <

 Memory estimate: 666.94 KiB, allocs estimate: 8955.

julia> lorenz = (du,u,p,t) -> begin
        du[1] = 10.0(u[2]-u[1])
        du[2] = u[1]*(28.0-u[3]) - u[2]
        du[3] = u[1]*u[2] - (8/3)*u[3]
       end
#3 (generic function with 1 method)

julia> prob = ODEProblem(lorenz,u0,tspan)
ODEProblem with uType Vector{Float64} and tType Float64. In-place: true
timespan: (0.0, 100.0)
u0: 3-element Vector{Float64}:
 1.0
 0.0
 0.0

julia> tinf = @snoopi_deep solve(prob,alg)
InferenceTimingNode: 1.233243/2.736248 on Core.Compiler.Timings.ROOT() with 3 direct children

This PR:

julia> using DifferentialEquations, SnoopCompile, BenchmarkTools

julia> lorenz = (du,u,p,t) -> begin
        du[1] = 10.0(u[2]-u[1])
        du[2] = u[1]*(28.0-u[3]) - u[2]
        du[3] = u[1]*u[2] - (8/3)*u[3]
       end
#1 (generic function with 1 method)

julia> u0 = [1.0;0.0;0.0]
3-element Vector{Float64}:
 1.0
 0.0
 0.0

julia> tspan = (0.0,100.0)
(0.0, 100.0)

julia> prob = ODEProblem(lorenz,u0,tspan)
ODEProblem with uType Vector{Float64} and tType Float64. In-place: true
timespan: (0.0, 100.0)
u0: 3-element Vector{Float64}:
 1.0
 0.0
 0.0

julia> alg = Rodas5()
Rodas5{0, true, DefaultLinSolve, Val{:forward}}(DefaultLinSolve(nothing, nothing, nothing))

julia> tinf = @snoopi_deep solve(prob,alg)
InferenceTimingNode: 1.595283/3.539406 on Core.Compiler.Timings.ROOT() with 8 direct children

julia> @benchmark solve($prob, $alg)
BenchmarkTools.Trial: 2025 samples with 1 evaluation.
 Range (min … max):  2.343 ms …  10.539 ms  ┊ GC (min … max): 0.00% … 76.21%
 Time  (median):     2.400 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   2.465 ms ± 682.168 μs  ┊ GC (mean ± σ):  2.34% ±  6.45%

             ▁▄▅▆█▆▇▆▂▃
  ▂▁▂▂▂▃▃▃▄▅▇██████████▇▇▅▅▅▄▄▃▃▃▃▂▂▃▂▃▂▂▂▂▂▂▃▂▂▃▃▃▃▃▂▂▁▂▂▂▁▂ ▄
  2.34 ms         Histogram: frequency by time        2.54 ms <

 Memory estimate: 717.47 KiB, allocs estimate: 10572.

julia> lorenz = (du,u,p,t) -> begin
        du[1] = 10.0(u[2]-u[1])
        du[2] = u[1]*(28.0-u[3]) - u[2]
        du[3] = u[1]*u[2] - (8/3)*u[3]
       end
#3 (generic function with 1 method)

julia> prob = ODEProblem(lorenz,u0,tspan)
ODEProblem with uType Vector{Float64} and tType Float64. In-place: true
timespan: (0.0, 100.0)
u0: 3-element Vector{Float64}:
 1.0
 0.0
 0.0

julia> tinf = @snoopi_deep solve(prob,alg)
InferenceTimingNode: 1.197684/2.313140 on Core.Compiler.Timings.ROOT() with 3 direct children

We'll have to look which type instabilities that appear here are causing the performance regression before adding the option to these packages, but the possible latency improvements here are enticing.

YingboMa · 2021-12-08T20:14:43Z

What's the implication of @max_methods 1? For instance, will Union{Int,Vector{Int}} be specialized? What about the iteration protocol?

KristofferC · 2021-12-08T20:53:17Z

This is what the 3 represents:

julia> f(x::Int) = 1;

julia> g(x) = f(x)
g (generic function with 1 method)

julia> code_warntype(g, Tuple{Any})
MethodInstance for g(::Any)
  from g(x) in Main at REPL[2]:1
Arguments
  #self#::Core.Const(g)
  x::Any
Body::Int64
1 ─ %1 = Main.f(x)::Core.Const(1)
└──      return %1


julia> f(x::Float64) = 2.0;

julia> code_warntype(g, Tuple{Any})
MethodInstance for g(::Any)
  from g(x) in Main at REPL[2]:1
Arguments
  #self#::Core.Const(g)
  x::Any
Body::Union{Float64, Int64}
1 ─ %1 = Main.f(x)::Union{Float64, Int64}
└──      return %1


julia> f(x::Float32) = 3.0f0;

julia> code_warntype(g, Tuple{Any})
MethodInstance for g(::Any)
  from g(x) in Main at REPL[2]:1
Arguments
  #self#::Core.Const(g)
  x::Any
Body::Union{Float32, Float64, Int64}
1 ─ %1 = Main.f(x)::Union{Float32, Float64, Int64}
└──      return %1


julia> f(x::String) = "4"
f (generic function with 4 methods)

julia> code_warntype(g, Tuple{Any})
MethodInstance for g(::Any)
  from g(x) in Main at REPL[2]:1
Arguments
  #self#::Core.Const(g)
  x::Any
Body::Any
1 ─ %1 = Main.f(x)::Any
└──      return %1

chriselrod · 2021-12-08T21:19:07Z

What's the implication of @max_methods 1? For instance, will Union{Int,Vector{Int}} be specialized? What about the iteration protocol?

It will not cause problems for/is not applicable to the iteration protocol. This does not rely on method matching:

valuestate = iterate(iter, state)
valuestate === nothing || break
value, state = valuestate

Union splits should (likewise) generally be fine.

timholy

This LGTM, but it would be ideal if someone who spends more time hacking on inference than me took a look.

Overall, I agree there are very good reasons to target this parameter, and I suspect many packages should set it.

base/experimental.jl

Co-authored-by: Tim Holy <[email protected]>

base/compiler/abstractinterpretation.jl

base/experimental.jl

src/module.c

base/experimental.jl

Co-authored-by: Shuhei Kadowaki <[email protected]>

base/compiler/abstractinterpretation.jl

base/experimental.jl

Co-authored-by: Tim Holy <[email protected]> Co-authored-by: Shuhei Kadowaki <[email protected]>

KristofferC · 2023-01-16T10:41:15Z

Shouldn't this have any tests?

chriselrod · 2023-01-16T19:50:28Z

What should tests look like?
If you want an example use
https://github.com/JuliaPlots/Plots.jl/blob/ce961acec433d34be00470e2c07500e9be894080/src/Plots.jl#L6-L8

I should probably add it to LoopVectorization, too.

KristofferC · 2023-01-16T20:04:45Z

What should tests look like?

Like using the functionality @max_methods and checking that it had the intended effect, like checking that inference stops after it reaches the max methods instead of the default 3 etc? That the macro errors when applied to a wrong type of expression, that the module version works on all methods in a module etc etc. I don't really get the question..

KristofferC · 2023-01-17T17:02:48Z

Okay, this PR did less than I thought and I didn't find the tests of the original max_methods PR due to #48316

chriselrod · 2023-01-17T17:36:56Z

I think those tests are for #44426
This PR still needs tests.

KristofferC · 2023-01-17T18:09:22Z

Yes, but I didn't find any tests at all :p

xref: #43370 (comment)

chriselrod added 2 commits December 8, 2021 12:53

Add support for per-module max_methods.

79fd901

Merge branch 'master' into permodulemaxmethods

971d18b

chriselrod requested a review from JeffBezanson December 8, 2021 18:29

ChrisRackauckas mentioned this pull request Dec 8, 2021

22 seconds to 3 and now more: Let's fix all of the DifferentialEquations.jl + universe compile times! SciML/DifferentialEquations.jl#786

Closed

5 tasks

timholy reviewed Dec 10, 2021

View reviewed changes

base/experimental.jl Outdated Show resolved Hide resolved

base/experimental.jl Outdated Show resolved Hide resolved

base/experimental.jl Show resolved Hide resolved

Update base/experimental.jl

58b90d6

Co-authored-by: Tim Holy <[email protected]>

vchuravy requested a review from aviatesk December 10, 2021 03:35

aviatesk requested changes Dec 16, 2021

View reviewed changes

base/compiler/abstractinterpretation.jl Outdated Show resolved Hide resolved

base/experimental.jl Outdated Show resolved Hide resolved

base/experimental.jl Outdated Show resolved Hide resolved

src/module.c Outdated Show resolved Hide resolved

base/experimental.jl Show resolved Hide resolved

PallHaraldsson mentioned this pull request Dec 16, 2021

Faster startup JuliaPy/PythonCall.jl#79

Closed

chriselrod and others added 5 commits December 16, 2021 10:16

Update base/compiler/abstractinterpretation.jl

11b0627

Co-authored-by: Shuhei Kadowaki <[email protected]>

Update src/module.c

3909565

Co-authored-by: Shuhei Kadowaki <[email protected]>

Update base/experimental.jl

bd2aa58

Co-authored-by: Shuhei Kadowaki <[email protected]>

Update base/experimental.jl

cd28834

Co-authored-by: Shuhei Kadowaki <[email protected]>

Throw on max methods values greater than 4.

bd265ea

aviatesk approved these changes Dec 16, 2021

View reviewed changes

base/compiler/abstractinterpretation.jl Outdated Show resolved Hide resolved

Update base/compiler/abstractinterpretation.jl

0acb9f5

aviatesk reviewed Dec 16, 2021

View reviewed changes

base/experimental.jl Outdated Show resolved Hide resolved

Clearer error message.

592d7ff

aviatesk added merge me PR is reviewed. Merge when all tests are passing latency Latency labels Dec 16, 2021

Cleanup whitespace

a833182

aviatesk reviewed Dec 17, 2021

View reviewed changes

base/experimental.jl Show resolved Hide resolved

Update base/experimental.jl

ec87c17

aviatesk merged commit a957953 into JuliaLang:master Dec 17, 2021

chriselrod deleted the permodulemaxmethods branch December 17, 2021 11:59

DilumAluthge removed the merge me PR is reviewed. Merge when all tests are passing label Dec 17, 2021

chriselrod mentioned this pull request Dec 17, 2021

Set max_methods=1 JuliaPlots/Plots.jl#4010

Merged

ChrisRackauckas mentioned this pull request Jan 3, 2022

Index sets and constant propagation on Array shapes #43642

Open

mcabbott mentioned this pull request Jan 5, 2022

Very slow first-time gradient calculation FluxML/Zygote.jl#1119

Open

LilithHafner pushed a commit to LilithHafner/julia that referenced this pull request Feb 22, 2022

Add support for per-module max_methods. (JuliaLang#43370)

4f06869

Co-authored-by: Tim Holy <[email protected]> Co-authored-by: Shuhei Kadowaki <[email protected]>

LilithHafner pushed a commit to LilithHafner/julia that referenced this pull request Mar 8, 2022

Add support for per-module max_methods. (JuliaLang#43370)

ec66508

Co-authored-by: Tim Holy <[email protected]> Co-authored-by: Shuhei Kadowaki <[email protected]>

rikhuijzer mentioned this pull request Apr 27, 2022

Set Base.Experimental.@max_methods 1 fonsp/Pluto.jl#2068

Merged

rikhuijzer mentioned this pull request Jun 7, 2022

Extend doc for @max_methods #45595

Merged

aviatesk added a commit that referenced this pull request Jan 18, 2023

add test cases for module-wise @max_methods configuration

642630b

xref: #43370 (comment)

aviatesk mentioned this pull request Jan 18, 2023

add test cases for module-wise @max_methods configuration #48328

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for per-module max_methods. #43370

Add support for per-module max_methods. #43370

chriselrod commented Dec 8, 2021

YingboMa commented Dec 8, 2021

KristofferC commented Dec 8, 2021

chriselrod commented Dec 8, 2021

timholy left a comment

KristofferC commented Jan 16, 2023

chriselrod commented Jan 16, 2023

KristofferC commented Jan 16, 2023 •

edited

Loading

KristofferC commented Jan 17, 2023

chriselrod commented Jan 17, 2023

KristofferC commented Jan 17, 2023

Add support for per-module max_methods. #43370

Add support for per-module max_methods. #43370

Conversation

chriselrod commented Dec 8, 2021

YingboMa commented Dec 8, 2021

KristofferC commented Dec 8, 2021

chriselrod commented Dec 8, 2021

timholy left a comment

Choose a reason for hiding this comment

KristofferC commented Jan 16, 2023

chriselrod commented Jan 16, 2023

KristofferC commented Jan 16, 2023 • edited Loading

KristofferC commented Jan 17, 2023

chriselrod commented Jan 17, 2023

KristofferC commented Jan 17, 2023

KristofferC commented Jan 16, 2023 •

edited

Loading