joaquimg · LukasBarner · Aug 30, 2022 · Aug 31, 2022 · Sep 3, 2022 · Sep 9, 2022
diff --git a/Project.toml b/Project.toml
@@ -8,6 +8,7 @@ Dualization = "191a621a-6537-11e9-281d-650236a99e60"
 JuMP = "4076af6c-e467-56ae-b986-b466b2749572"
 LinearAlgebra = "37e2e46d-f89d-539d-b4ee-838fcccc9c8e"
 MathOptInterface = "b8f27783-ece8-5eb3-8dc8-9495eed66fee"
+Printf = "de0858da-6303-5e67-8744-51eddeeeb8d7"
 
 [compat]
 Dualization = "0.5.4"

diff --git a/README.md b/README.md
@@ -81,6 +81,8 @@ expansions
 Note that binary expansions require variables to have upper and lower bounds.
 Also, note that the `Gurobi` solver supports products, but requires [setting the
 `"NonConvex"` options](https://github.com/jump-dev/Gurobi.jl#using-gurobi-v90-and-you-got-an-error-like-q-not-psd).
+Since, in general, constraint qualification does not hold for the lower level 
+complementary slackness conditions, they may be regularized by enforcing an inequality with a small constant. For more challenging models, an additional solution procedure is implemented, where the regularization is reduced iteratively and the NLP solver is warmstarted with primal/dual values of the previous run. For more information and some recommendations see the examples provided in the docs. 
 
 Finally, one can use `BilevelJuMP.MixedMode(default = mode)` where `mode` is one
 of the other modes described above. With this method it is possible to set

diff --git a/docs/src/examples/Iterative_example1.jl b/docs/src/examples/Iterative_example1.jl
@@ -0,0 +1,129 @@
+# # Example 1 on Iterative Solution for ProductMode
+# This example is again from the book Decomposition Techniques in Mathematical Programming
+# Chapter 7.2, page 281, [url](https://www.springer.com/gp/book/9783540276852)
+# The formulation is the same as used before, but we are now focussing on more practical aspects of an iterative product mode.
+# In particular, a few tricks for solving iteratively with Ipopt are provided. 
+
+# Model of the problem
+# First level
+# ```math
+# \min -x + 4y + z,\\
+# \notag s.t.\\
+# y + 2x + z \leq 9,\\
+# z = 1,\\
+# ```
+# Second level
+# ```math
+# \min -x - y + w,\\
+# \notag s.t.\\
+# y \geq 0,\\
+# x + y + w \leq 8,\\
+# x \geq 0,\\
+# x \leq 4,\\
+# x = 1.\\
+# ```
+
+using JuMP, BilevelJuMP, Ipopt
+
+# First, we need to specify an iterable with our desired regularization values.
+# The problem will be solved for each element in this series, while the next step starts at the optimal values of the previous iteration. 
+# Note, that this iterable specifies regularization parameters after a first solve with the initial value (later set to 1e-0). 
+IterEps = [1e-0 * 0.99^i for i in 1:1500]
+
+# Also, it is possible to give specific solver attributes that are not valid for the initial solve, but only subsequent ones.
+# Warmstarting interior point algorithms is difficult, but some of the following options can be recommended for Ipopt. 
+# Allowing warmstarts explicitly is necessary in Ipopt, this may vary with different solvers.
+# In addition, we are suppressing solver outputs for all but the first iteration (and could instead write a log to a file using the commented lines).
+
+IterAttr = Dict( 
+    "mu_init" => 1e-9, 
+    "warm_start_init_point"=> "yes", 
+    "warm_start_bound_push"=> 1e-12, 
+    "warm_start_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_push"=> 1e-12, 
+    "warm_start_mult_bound_push" => 1e-12, 
+    # "output_file" => "solver.log",
+    # "file_print_level" => 8,
+    "print_level" => 0,
+    )
+# Note that for other solvers, these options may differ. 
+# You can also specify other options here. 
+# For example, you could play around with an adaptive mu strategy in the initial solve and a monotone mu strategy in subsequent iterations (see the Ipopt options docs). 
+
+# We can now specify our model, where the initial regularization is 1e+0, followed by setting the iterative values and settings:
+model = BilevelModel(Ipopt.Optimizer, mode = BilevelJuMP.ProductMode(1e-0;IterEps=IterEps, IterAttr = IterAttr))
+
+# The following lines are copied from the example as presented before: 
+@variable(Upper(model), x)
+@variable(UpperOnly(model), z)
+@variable(Lower(model), y)
+@variable(LowerOnly(model), w)
+@objective(Upper(model), Min, -x + 4y + z)
+@constraint(Upper(model), y + 2x + z <= 9)
+@constraint(Upper(model), z == 1)
+@objective(Lower(model), Min, -x - y + w)
+@constraint(Lower(model),  y >= 0)
+@constraint(Lower(model), x + y + w <= 8)
+@constraint(Lower(model),  x >= 0)
+@constraint(Lower(model),  x <= 4)
+@constraint(Lower(model),  w == 1)
+
+# Setting the right solver options for NLPs can make a huge difference. It is recommended to play around with these when solving challenging models.
+# All options set with set_optimizer_attribute() carry over to subsequent iterations. 
+# In case you want to return to default settings for following solves, you have to manually specify these in the IterAttr options again. 
+# For the initial solution, we want to solve with a target mu of 1e-9 (see the Ipopt docs for an explanation), which is the initial mu as set in IterAttr. 
+# Other options can easily be set the same way. 
+set_optimizer_attribute(model, "mu_target", 1e-9)
+
+# Warmstarting the first iteration is also possible, start values need to be provided using set_start_value() and set_dual_start_value() functions, or in the variable declarations.
+# Note, that if you are using starting values, the solver options must be set accordingly for the first iteration (as with mu_target, not via IterAttr...).
+
+# Finally, we can optimize: 
+optimize!(model)
+
+# Auto testing
+@test value(x) ≈ 1 atol=1e-4
+@test value(y) ≈ 6 atol=1e-4
+@test value(z) ≈ 1 atol=1e-4
+@test value(w) ≈ 1 atol=1e-4
+
+
+# It is important to check that the solver actually accepts and uses our warmstart as desired. Otherwise, the iterative solution procedure does not work at all!
+# The following script illustrates how one can verify the desired behavior; the math remains unchanged from above...
+# This time, we do not suppress any solver output. You can see that IterEps only contains one value (and it is the same as the regularization specified for the initial solve).
+# If all options for the warmstart are set correctly, you should see that the solver (in our case Ipopt) accepts the warmstart solution as optimal. 
+# For more complicated problems, it may be necessary to relax this statement to near optimal. However, very few iterations should be made to find the optimal solution again.
+# Primal and dual infeasibility of the initial point (inf_pr/inf_du) should in general be very low (i.e. in the order of solver specified tolerances for optimal termination). 
+# In our simple example, both should be less than 1e-9. 
+# Depending on the actual problem and modifications by Ipopt (such as scaling etc.), this may not always be the case and you could come across values of roughly 1e-6 sometimes...
+# In general, you should always check that provided warmstarts actually work as desired before going ahead!
+# It is left to the user to verify that using an empty IterAttr does not yield the same bahavior. 
+
+using JuMP, BilevelJuMP, Ipopt
+IterEps = [1e-0 for i in 1:1]
+IterAttr = Dict( 
+    "mu_init" => 1e-9, 
+    "warm_start_init_point"=> "yes", 
+    "warm_start_bound_push"=> 1e-12, 
+    "warm_start_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_push"=> 1e-12, 
+    "warm_start_mult_bound_push" => 1e-12, 
+    )
+TestModel = BilevelModel(Ipopt.Optimizer, mode = BilevelJuMP.ProductMode(1e-0;IterEps=IterEps, IterAttr = IterAttr))
+@variable(Upper(TestModel), x)
+@variable(UpperOnly(TestModel), z)
+@variable(Lower(TestModel), y)
+@variable(LowerOnly(TestModel), w)
+@objective(Upper(TestModel), Min, -x + 4y + z)
+@constraint(Upper(TestModel), y + 2x + z <= 9)
+@constraint(Upper(TestModel), z == 1)
+@objective(Lower(TestModel), Min, -x - y + w)
+@constraint(Lower(TestModel),  y >= 0)
+@constraint(Lower(TestModel), x + y + w <= 8)
+@constraint(Lower(TestModel),  x >= 0)
+@constraint(Lower(TestModel),  x <= 4)
+@constraint(Lower(TestModel),  w == 1)
+set_optimizer_attribute(TestModel, "mu_target", 1e-9)
+optimize!(TestModel)
diff --git a/docs/src/examples/Iterative_example2.jl b/docs/src/examples/Iterative_example2.jl
@@ -0,0 +1,158 @@
+# # Example 2 on Iterative Solution for ProductMode
+# This example is from the paper S. Siddiqui & S. A. Gabriel (2012): "An SOS1-Based Approach for Solving MPECs with a Natural Gas Market Application", as appearing in "Networks and Spatial Economics"
+# It can be found in section 3 "Numerical Examples"
+
+# The leader solves the following first level
+# ```math
+# \max_{Q \geq 0} (a-b(q_1 + q_2 + Q))Q + CQ,\\
+# ```
+# Firm i = 1,2 solves the following lower-level problem where it takes the upper-level quantity Q as fixed and tries to maximize profits while in Nash-Cournot competition with the other Stackelberg follower firm j.
+# ```math
+# \max_{q_i \geq 0} (a-b(q_i + q_j + Q))q_i + c_i q_i,\\
+# ```
+# Subsequently, we will combine the two lower level players' problem into a single one with equivalent KKTs. 
+
+using JuMP, BilevelJuMP, Ipopt
+
+F = [1,2]
+c = Dict(1=>1, 2=>1)
+C = 1
+a = 13
+b = 1
+
+# First, we need to specify an iterable with our desired regularization values.
+# The problem will be solved for each element in this series, while the next step starts at the optimal values of the previous iteration. 
+# Note, that this iterable specifies regularization parameters after a first solve with the initial value 1e-0. 
+IterEps = [1e-0 * 0.99^i for i in 1:1500]
+
+# Also, it is possible to give specific solver attributes that are not valid for the initial solve, but only subsequent ones.
+# Warmstarting interior point algorithms is difficult, but some of the following options can be recommended. 
+# Allowing warmstarts in Ipopt explicitly is necessary in Ipopt, this may vary with different solvers.
+# Also, we are suppressing solver outputs for all but the first iteration (and could instead write a log to a file using the commented lines).
+
+IterAttr = Dict( 
+    "mu_init" => 1e-9, 
+    "warm_start_init_point"=> "yes", 
+    "warm_start_bound_push"=> 1e-12, 
+    "warm_start_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_push"=> 1e-12, 
+    "warm_start_mult_bound_push" => 1e-12, 
+    # "output_file" => "solver.log",
+    # "file_print_level" => 8,
+    "print_level" => 0,
+    )
+# Note that for other solvers, these options may differ. 
+# You can also specify other options here. 
+# For example, you could play around with an adaptive mu strategy in the initial solve and a monotone mu strategy in subsequent iterations (see the Ipopt options docs). 
+
+# We can now specify our model, where the initial regularization is 1e+0, followed by setting the iterative values and settings:
+model1 = BilevelModel(Ipopt.Optimizer, mode = BilevelJuMP.ProductMode(1e-0;IterEps=IterEps, IterAttr = IterAttr))
+
+# The following lines merge the two lower level problems into a single one. It is left to the reader to check the KKTs are indeed equivalent: 
+@variable(Lower(model1), q[F] >= 0)
+@variable(Upper(model1), Q >= 0)
+
+@objective(Upper(model1), Max, ((a-b * (q[1] + q[2] + Q)) * Q - C*Q) )
+
+@objective(Lower(model1), Min, -((a-b * (q[1] + q[2] + Q)) * q[1] - C*q[1] + (a-b * (q[1] + q[2] + Q)) * q[2] - C*q[2] + b*q[1]*q[2]) )
+
+# Setting the right solver options for NLPs can make a huge difference. It is recommended to play around with these when solving challenging models.
+# All options set with set_optimizer_attribute() carry over to subsequent iterations. 
+# In case you want to return to default settings for following solves, you have to manually specify these in the IterAttr options again. 
+# For the initial solution, we want to solve with a target mu of 1e-9 (see the Ipopt docs for an explanation), which is the initial mu as set in IterAttr. 
+# Other options can easily be set the same way. 
+set_optimizer_attribute(model1, "mu_target", 1e-9)
+
+# Warmstarting the first iteration is also possible, start values need to be provided using set_start_value() and set_dual_start_value() functions, or in the variable declarations.
+# Note, that if you are using starting values, the solver options must be set accordingly for the first iteration (as above...).
+
+# Finally, we can optimize: 
+optimize!(model1)
+
+# To verify that the KKTs are correctly set, we could also print the model to an lp file: 
+# optimize!(model1, bilevel_prob = "blp.lp")
+
+# Auto testing
+
+@test isapprox(value(model1[:Q]), 6; atol=1e-6)
+@test isapprox(value.(model1[:q]).data, [2,2]; atol=1e-6) 
+
+#######################
+# The following lines show the other two datasets provided in the original paper. 
+# Dataset 2: 
+F = [1,2]
+c = Dict(1=>1, 2=>1)
+C = 1
+a = 13
+b = 0.1
+
+IterEps = [1e-0 * 0.99^i for i in 1:1500]
+
+IterAttr = Dict( 
+    "mu_init" => 1e-9, 
+    "warm_start_init_point"=> "yes", 
+    "warm_start_bound_push"=> 1e-12, 
+    "warm_start_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_push"=> 1e-12, 
+    "warm_start_mult_bound_push" => 1e-12, 
+    "print_level" => 0,
+    )
+
+model2 = BilevelModel(Ipopt.Optimizer, mode = BilevelJuMP.ProductMode(1e-0;IterEps=IterEps, IterAttr = IterAttr))
+
+@variable(Lower(model2), q[F] >= 0)
+@variable(Upper(model2), Q >= 0)
+
+@objective(Upper(model2), Max, ((a-b * (q[1] + q[2] + Q)) * Q - C*Q) )
+
+@objective(Lower(model2), Min, -((a-b * (q[1] + q[2] + Q)) * q[1] - C*q[1] + (a-b * (q[1] + q[2] + Q)) * q[2] - C*q[2] + b*q[1]*q[2]) )
+
+set_optimizer_attribute(model2, "mu_target", 1e-9)
+
+optimize!(model2)
+
+# Auto testing
+
+@test isapprox(value(model2[:Q]), 60; atol=1e-6)
+@test isapprox(value.(model2[:q]).data, [20,20]; atol=1e-6) 
+
+#######################
+# Dataset 3: 
+F = [1,2]
+c = Dict(1=>1, 2=>1)
+C = 2
+a = 13
+b = 0.1
+
+IterEps = [1e-0 * 0.99^i for i in 1:1500]
+
+IterAttr = Dict( 
+    "mu_init" => 1e-9, 
+    "warm_start_init_point"=> "yes", 
+    "warm_start_bound_push"=> 1e-12, 
+    "warm_start_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_frac"=> 1e-12, 
+    "warm_start_slack_bound_push"=> 1e-12, 
+    "warm_start_mult_bound_push" => 1e-12, 
+    "print_level" => 0,
+    )
+
+model3 = BilevelModel(Ipopt.Optimizer, mode = BilevelJuMP.ProductMode(1e-0;IterEps=IterEps, IterAttr = IterAttr))
+
+@variable(Lower(model3), q[F] >= 0)
+@variable(Upper(model3), Q >= 0)
+
+@objective(Upper(model3), Max, ((a-b * (q[1] + q[2] + Q)) * Q - C*Q) )
+
+@objective(Lower(model3), Min, -((a-b * (q[1] + q[2] + Q)) * q[1] - C*q[1] + (a-b * (q[1] + q[2] + Q)) * q[2] - C*q[2] + b*q[1]*q[2]) )
+
+set_optimizer_attribute(model3, "mu_target", 1e-9)
+
+optimize!(model3)
+
+# Auto testing
+
+@test isapprox(value(model3[:Q]), 55; atol=1e-6)
+@test isapprox(value.(model3[:q]).data, [18.333,18.333]; atol=1e-2) 
diff --git a/src/BilevelJuMP.jl b/src/BilevelJuMP.jl
@@ -6,6 +6,7 @@ const MOIU = MathOptInterface.Utilities
 using JuMP
 using Dualization
 using LinearAlgebra
+using Printf
 
 export
 BilevelModel,