IE426 F.A.Q.
These are questions asked by students during or at the end of a class. I will record them here if I think they are useful for everyone. If you have a question, feel free to email it to me.
- Lecture 2: convexity, relaxation, lower/upper bound;
- Lecture 4: Linear Programming, AMPL;
- Lecture 10: Pre-quiz;
- Lecture 14: Integer Programming;
- Lecture 15: Branch and Bound;
Lecture 2 – September 1, 2009
Q.: Why is the function
A.: We can prove it by applying the definition: a function
Now take the two points and
, and
. We have
Q.: How does the function
A.: It looks like a saddle:

Q.: What is a relaxation good for? When solving a relaxation to optimality, the solution might not be feasible for the original problem.
A.: That is true. The feasible set
Q.: Why is the constraint
A.: One clarification first: we are considering non-negative values of

Here we have a segment between two points and
, all of whose points are not in the set, that is, for none of the points in between do we have
.
The inequality constraint is something else. Here we have that a portion of the Cartesian plane that includes not only the line
, but all points above this line (the shaded area):

Therefore, for any two points in , all points in between also belong to
. The convexity of
can be shown by convexity of the function
, obtained by its Hessian,
, which is positive semidefinite for any
.
Q.: Why don’t we prove that the set
A.: Sure. We want to prove that, for any two points
Q.: If
A.: If
means that there might be a point between and
such that the above inequality holds strictly, that is, such that
and therefore the set of points satisfying is nonconvex.
Lecture 4 – September 9, 2009
Q.: There are five variables in the two-year
project selection problem, one for each opportunity. Why aren’t there ten of them (five for each year)?
A.: Because that’s what the input said: “We have 5 investment opportunities over a 2-year term,” i.e., we’ll invest in the same funds this and next year. If we could change our mind in the second year, we’d have different variables, say , for the second year. The model would then change: from
to
where we changed the NPV given that is the NPV over the two years, while
and
are computed each on a one-year period (question for the Analytical Finance people: is
?)
Q.: What is meaning of the default keyword in AMPL?
A.: It is the initial value of a variable we, the modelers, are suggesting. It is totally irrelevant in the modeling part, but it can make a difference in the actual solving part. The solver, which we will use to look for a solution, may or may not take that value into account, as there are solvers whose result depends on the initial value of the variables. If we take a solver for convex optimization, such as Minos, included in the AMPL student edition, and apply it to a non-convex nonlinear problem, we may find a local optimum. This local optimum, in general, depends on the initial value we give the solver.
Q.: How do we check for convexity of a general function
A.: By looking at its Hessian, i.e., the
If the Hessian is Positive Semidefinite (PSD) over a certain set of points , then
is convex over
.
For instance, the Hessian of a quadratic function is
, which is constant. Therefore, a quadratic function is either convex for all
or nowhere. For this two-variable example,
is PSD if and only if
and
.
Lecture 10 – October 1, 2009
Q.: Is a problem
A.: Yes. Remember the two conditions for to be a relaxation of
. There is a
on the objective function and a non-strict containment (
) for the feasible set, therefore a problem
with the same objective function as
and the same feasible set as
is a relaxation of
.
Q.: Is the global optimum of
A: Yes. It is a feasible solution of , so it is an upper bound. It is a global optimum of a relaxation of
, so it is a lower bound too.
Q.: In order to qualify for a convex problem, must each of the constraint be convex? Or must all constraints combine to define a convex feasible set?
A.: The second rule (convexity of the feasible set) is necessary for convexity of a problem, the other condition being convexity of the objective function. Convexity of the feasible set is implied by the first rule (all constraints are convex), which however is only a *sufficient* condition for convexity of the feasible region.
Q.: Must the objective function be convex in just the feasible set of variables? (Need not to be convex outside of the feasible set?)
A.: Yes.
Q.: What do you mean by the bounded and unbounded problem? Is it related to the feasible set?
A.: A bounded optimization problem is one that admits a global optimum whose value is finite. In an unbounded problem, there is no global optimum with finite value (although there might be many feasible solutions with finite value). It is related to both the feasible set and the objective function.
For example, a problem of the form has an unbounded feasible set, but is bounded: its global optimum is
and has value of 2.
A problem of the form has an unbounded feasible set, and is unbounded: there is no global optimum with finite value, as any
, therefore any large negative number
, is feasible and has a value of
.
Q.: I don’t understand the notations you used to define the neighborhood of x (the local optima). Does it mean the square of the absolute value? (lecture 2)
is the Euclidean distance between
and
, two
-vectors: it is defined as
.
Q.: As your lectures don’t cover all the details in the reading materials, will those uncovered details be in the quiz?
A.: Some references in the lectures are to simple examples in the books. What I extracted from the textbook was sufficient for the lecture.
Q.: I thought by looking at the given objective function and the form of the constraints, I could determine if the problem is convex. Then I came across more examples which required that the feasible solution(s) be found and then, based on these, and “re-examining” the objective function and constraints with the feasible solutions, the problem was described as convex or not. So, the question is, do you have to do the second step of considering the feasible solution set, or not?
A.: No. These were just very special cases. With real-world problem, most of the times if there is one nonconvex constraint (or the objective is nonconvex) then the problem is nonconvex.
Q.: When the objective function and all other constraints are linear, but one constraint is restricting a variable to be integer, does the restriction of a variable to be integer still make the overall problem linear?
A.: It makes the problem Integer Linear, or better, Mixed Integer Linear as some variables are continuous and other are integer. Integer Linear Programming (or Integer Programming, IP for short) or Mixed Integer Linear Programming (MILP) problems are not Linear Programming (LP) problems. They are far more difficult than LP.
Q.: This integer restriction generally makes the problem non-convex, correct? (although there is an example where, because there was only one feasible solution (0,0), the problem was still called convex).
A.: Correct. Apart from special cases (which don’t occur very often in practice), ILPs and MILPs are nonconvex..
Lecture 15 – October 22, 2009
Q.: If the solution to the linear relaxation of an IP problem might be far from the actual optimal solution of the IP, why is it used in Branch-and-bound?
A.: For several reasons. First, it gives a lower bound on the original problem, which is necessary in branch-and-bound. Second, when partitioning the problem into two new subproblems (the branching step) one should make sure that the solution to the current relaxation is eliminated by branching, so that none of the two new subproblems admit it as a solution to the relaxation.
Lecture 15 – October 22, 2009
Q.: In the branch-and-bound algorithm, when the solution
A.: This question does not admit a simple answer. Given that the number of branch-and-bound (B&B) nodes generated affects the CPU time taken to solve an IP, it is usually desirable to have a strategy that minimizes the number of BB nodes, including choosing the proper variable at each node.
There are is a large body of scientific work on the subject, but to the interested reader I’d suggest to look at methods such as Strong Branching and Pseudocost Branching (for a quick introduction, see Prof. Linderoth’s slides for course IE418), and a “combination” of the two, called Reliability Branching (this is a journal article published on Operations Research Letters).