Chen Disser (1121212), страница 6

Файл №1121212 Chen Disser (Лекции в различных форматах) 6 страницаChen Disser (1121212) страница 62019-05-092019-05-09СтудИзба

Лекции в различных форматах

Просмтор этого файла доступен только зарегистрированным пользователям. Но у нас супер быстрая регистрация: достаточно только электронной почты!

Регистрация/авторизация

Текст из файла (страница 6)

We then compare the performance of our solver withChapter 2other leading solves on these continuous and mixed-integer benchmarks.Finally, in Chapter 6, we brieﬂy summarize the research work we have presented in thisPrevious Workthesis, and point out future directions to extend this research.In this chapter, we ﬁrst review existing penalty methods for solving constrained optimizationproblems and discuss their assumptions and limitations.

We then review existing partitioningmethods for solving mathematical programming problems, and point out what is new in ourproposed constraint partitioning approach. Finally, we survey existing automated planningalgorithms and show that most of them solve a problem as a whole without partitioning.2.1Existing Penalty MethodsIn this section, we survey existing penalty methods for constrained optimization. The concepts of saddle points and penalty formulations are important and form the basis of ourtheory presented in Chapter 3.Penalty methods belong to a general approach that can solve continuous, discrete, andmixed constrained optimization problems, with no continuity, diﬀerentiability, and convexityrequirements.

Penalty methods are the primary methods for solving constrained problems.A penalty function of Pm is a summation of its objective and its constraint functionsweighted by penalties. Using penalty vectors α ∈ Rm and β ∈ Rr , the general penaltyfunction for Pm is:Lp (z, α, β) = f (z) + αT P (h(z)) + β T Q(g(z)),1718(2.1)Penalty Functions for Constraint ProgrammingGlobal OptimalityLocal Optimality2.1.1Global optimal penalty methodsA static-penalty method [58, 70] formulates Pm as the minimization of (2.1) when the constraints of Pm are transformed by P and Q with the following properties: a) P (h(z)) ≥ 0Inexact PenaltyExact PenaltyExact Penaltyand Q(g(z)) ≥ 0; and b) P (h(z)) = 0 iﬀ h(z) = 0, and Q(g(z)) = 0 iﬀ g(z) ≤ 0.

Withpenalty vectors α and β, an example static-penalty method solves the following problemStaticPenaltyDynamicPenaltyDeathPenaltyDiscretePenaltyLagrangianl1−PenaltyFigure 2.1: A classiﬁcation of existing penalty methods.where P and Q are transformation functions. The goal of a penalty method is to ﬁnd suitable∗∗∗α and β in such a way that x that minimizes (2.1) corresponds to either a constrainedglobal minimum (CGM) that is feasible and has the best objective value in the entire searchspace, or a constrained local minimum (CLM) that is feasible and has the best objectivevalue in a pre-deﬁned neighborhood.

Penalty methods belong to a general approach that cansolve continuous, discrete, and mixed constrained optimization problems, with no continuity,with constant ρ ≥ 1:min Ls (z, α, β) = min f (z) +xzmαi |hi (z)|ρ +i=1rρ βjmax(0, gj (z)).(2.2)j=1A static penalty method can be an exact or inexact penalty method, depending on thevalue of ρ. It is an exact penalty method when ρ = 1 and an inexact penalty method whenwhen ρ > 1 [10, 58, 70]. That is, when ρ = 1, there exist ﬁnite penalty values α and β suchthat the point minimizing the penalty function is exactly the CGM of Pm .

However, whenρ > 1, the static penalty method is an inexact method and will converge to the CGM asthe penalty values approach inﬁnity. We illustrate this property by an example.diﬀerentiability, and convexity requirements.We show a classiﬁcation of existing penalty methods in Figure 2.1. Penalty methods canExample 1. Consider the following simple optimization problem:be classiﬁed into global optimal penalty methods that look for CGM solutions of Pm and localminxoptimal penalty methods that look for CLM solutions of Pm . By another dimension, penaltysubject to :methods can be classiﬁed into exact penalty methods that can ﬁnd exact CGM or CLMpoints under ﬁnite penalty values, and inexact penalty methods in which the minimizationf (x) = x(2.3)h(x) = x = 10(2.4)Obviously, the CGM is at x∗ = 10.

The static penalty function for this problem is:of a penalty function does not lead to exact CGM or CLM points [10, 58, 9, 70]. Instead,successive minimizations of an inexact penalty function with increasing penalty values leadLp (x, α) = f (x) + α|h(x)|ρ = x + α|x − 10|ρto points inﬁnitely close to a CGM or CLM solution. Inexact penalty methods converge toa CGM or CLM solution as the penalty values approach inﬁnity.19We consider two cases:20(2.5)b) When ρ > 1, diﬀerentiating Lp and setting the result to zero:a) When ρ = 1, the penalty function becomes:Lp (x, α) = f (x) + α|h(x)|ρ = x + α|x − 10|(2.6)dLp (x, α) = 1 + ρα|x − 10|ρ−1 = 0,dx(2.11)we can show that there exists a ﬁnite penalty value α∗ = 1 such that for any penalty valueswe have that the minimum of Lp is at:satisfying:α∗∗ ≥ α∗ = 1,1 ρ−1−1.x = 10±ρα(2.7)the point that minimizes Lp (x, α∗∗ ) is exactly at the CGM x∗ = 10.

To see this, we need toshow that:(2.12)We can see that the static penalty method is an inexact penalty method because x in(2.12) that minimizes Lp is not exactly at the CGM x∗ = 10. However, x will converge toLp (x, α∗∗ ) ≥ Lp (x∗ , α∗∗ ) = Lp (10, α∗∗) = 10, ∀x ∈ R.x∗ = 10 as the penalty value α approaches inﬁnity.(2.8)A variation of the static-penalty method proposed in [40] uses discrete penalty valuesWe show that (2.8) is true for both x > 10 and x < 10. When x > 10, we have:and assigns a penalty value αi (hi (z)) when hi (z) exceeds a discrete level i (resp., βj (g(z))Lp (x, α∗∗ ) = x + α∗∗ (x − 10) > x > 10,∀x > 10 ;(2.9)when max(0, g(z)) exceeds a discrete level j ), where a higher level of constraint violationentails a larger penalty value.

The penalty method then solves the following problem:When x < 10, we have:min Ls (z, α, β) = min f (z) +zLp (x, α∗∗ ) = x + α∗∗ (10 − x) = 10α∗∗ − (α∗∗ − 1)x= (α∗∗ − 1)(10 − x) + 10 ≥ 10,∀x < 10,zmαi (hi (z)) h2i (z) +i=1r2 .(2.13)βj (gj (z)) max(0, gj (z))j=1(2.10)It is an inexact penalty method and requires the setting of many parameters. A limitationwhere the last inequality is true because α∗∗ ≥ α∗ = 1 and x < 10.Therefore, we have shown that Lp (x, α∗∗ ) is minimized exactly at the CGM x∗ = 10 forcommon to all static-penalty methods is that it is generally very diﬃcult to set the suitablepenalty values statically.

Also, these methods were developed for ﬁnding CGM and cannotrelate a CLM of Pm to a local minimum of the corresponding penalty function. Therefore,all ﬁnite penalty values α∗∗ > 1.they are computationally expensive because they involve ﬁnding a global minimum of a2122nonlinear penalty function. Techniques like simulated annealing [49] can be used, althoughthey only achieve global optimality with asymptotic convergence.Instead of ﬁnding the penalty values by trial and error, a dynamic-penalty method [58, 70]increases the penalties in (2.2) gradually, ﬁnds the global minimum z ∗ of (2.2) with respectto z for each combination of penalties, and stops when z ∗ is a feasible solution to Pm . Likehi (z) = 0 is respectively, infeasible, feasible, or neither in the last iterations.

That is,⎧⎪⎪⎪αi (t)/λ1 if hi (z(i)) = 0 is feasible in iterations t − + 1, . . . , t⎪⎪⎪⎨αi (t + 1) = λ2 · αi (t) if hi (z(i)) = 0 is infeasible in iterations t − + 1, . . . , t⎪⎪⎪⎪⎪⎪⎩αi (t)otherwise.(2.16)the static penalty methods, a dynamic penalty methods can be an exact or inexact method,depending on the value of ρ. Moreover, it has the same limitation as all static-penaltymethods because it requires ﬁnding global minima of nonlinear functions.where is a positive integer, λ1 , λ2 > 1 and λ1 = λ2 in order to avoid cycles in updates. βis updated in a similar fashion.There are many variations of dynamic penalty methods.

A well-known one is the nonstationary method (NS) [42] that solves a sequence of problems with the following problemin iteration t:The threshold dynamic penalty method estimates and adjusts dynamically a near-feasiblethreshold qi (t) (resp. qj (t)) for each constraint in iteration t. Each threshold indicatesa reasonable amount of violation allowed for promising but infeasible points during themin Lt (z, α, β) = min f (z) +zzmαi (t) |hi (z)|ρ +i=1rρ βj (t) max(0, gj (z)),(2.14)j=1min Lt (z, α, β) = min f (z) + α(t)αi (t + 1) = αi (t) + C · |hi (z(t))|,wheresolution of the following problem:zz m hi (z) 2qi (t)i=1+2r max(0, gj (z))j=1qj (t).(2.17)βj (t + 1) = βj (t) + C · max(0, gj (z(t)),There are two other variations of global penalty methods that are exact methods.

Характеристики

Тип файла

PDF-файл

Размер

1,63 Mb

Материал

Лекции в различных форматах

Тип материала

Лекции

Предмет

Алгоритмы оптимизации основанные на методе проб и ошибок

Высшее учебное заведение

МГУ им. Ломоносова

Список файлов лекций

lekcii-v-razlichnyh-formatah.rar

Лекции в различных форматах

Лекция 9

convergence

!!Extended neighborhoods.pdf

Aarts - Simulated Annealing book .pdf

Chapter 8 Theory and Practice of simulated Annealing.pdf

2013-11-13 - Simulated Annealing Convergence.pptx

Лекция 10

Полное содержание архива

Поделитесь ссылкой:

Ставлю 10/10
Все нравится, очень удобный сайт, помогает в учебе. Кроме этого, можно заработать самому, выставляя готовые учебные материалы на продажу здесь. Рейтинги и отзывы на преподавателей очень помогают сориентироваться в начале нового семестра. Спасибо за такую функцию. Ставлю максимальную оценку.

Лучшая платформа для успешной сдачи сессии
Познакомился со СтудИзбой благодаря своему другу, очень нравится интерфейс, количество доступных файлов, цена, в общем, все прекрасно. Даже сам продаю какие-то свои работы.

Студизба ван лав ❤
Очень офигенный сайт для студентов. Много полезных учебных материалов. Пользуюсь студизбой с октября 2021 года. Серьёзных нареканий нет. Хотелось бы, что бы ввели подписочную модель и сделали материалы дешевле 300 рублей в рамках подписки бесплатными.

Отличный сайт
Лично меня всё устраивает - и покупка, и продажа; и цены, и возможность предпросмотра куска файла, и обилие бесплатных файлов (в подборках по авторам, читай, ВУЗам и факультетам). Есть определённые баги, но всё решаемо, да и администраторы реагируют в течение суток.

Маленький отзыв о большом помощнике!
Студизба спасает в те моменты, когда сроки горят, а работ накопилось достаточно. Довольно удобный сайт с простой навигацией и огромным количеством материалов.

Студ. Изба как крупнейший сборник работ для студентов
Тут дофига бывает всего полезного. Печально, что бывают предметы по которым даже одного бесплатного решения нет, но это скорее вопрос к студентам. В остальном всё здорово.

Спасательный островок
Если уже не успеваешь разобраться или застрял на каком-то задание поможет тебе быстро и недорого решить твою проблему.

Всё и так отлично
Всё очень удобно. Особенно круто, что есть система бонусов и можно выводить остатки денег. Очень много качественных бесплатных файлов.

Отзыв о системе "Студизба"
Отличная платформа для распространения работ, востребованных студентами. Хорошо налаженная и качественная работа сайта, огромная база заданий и аудитория.

Отличный помощник
Отличный сайт с кучей полезных файлов, позволяющий найти много методичек / учебников / отзывов о вузах и преподователях.

Отлично помогает студентам в любой момент для решения трудных и незамедлительных задач
Хотелось бы больше конкретной информации о преподавателях. А так в принципе хороший сайт, всегда им пользуюсь и ни разу не было желания прекратить. Хороший сайт для помощи студентам, удобный и приятный интерфейс. Из недостатков можно выделить только отсутствия небольшого количества файлов.

Спасибо за шикарный сайт
Великолепный сайт на котором студент за не большие деньги может найти помощь с дз, проектами курсовыми, лабораторными, а также узнать отзывы на преподавателей и бесплатно скачать пособия.