Lagrange multiplier without implicit function theorem Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)Implicit Function Theorem [Understanding theorem in book]Normal vectors and tangent planesTrying to understand Lagrange multipliersShowing that the gradient $nabla f(x)$ is parallel to constraint surface gradient $nabla g(x)$ at an extreme point on the surfaceIn Lagrange Multiplier, why level curves of $f$ and $g$ are tangent to each other?Lagrange multipliers and critical pointsSolve by using Lagrange Multiplier MethodGeneralized Lagrange Multiplier Theorem.Lagrange Multiplier do not make senseOptimality of Lagrange Multiplier

What to do with post with dry rot?

Can smartphones with the same camera sensor have different image quality?

I'm having difficulty getting my players to do stuff in a sandbox campaign

Why is there no army of Iron-Mans in the MCU?

Does a C shift expression have unsigned type? Why would Splint warn about a right-shift?

Failing to enforce immigration laws?

What are the performance impacts of 'functional' Rust?

Using "nakedly" instead of "with nothing on"

Strange behaviour of Check

Estimate capacitor parameters

Passing functions in C++

Two different pronunciation of "понял"

Problem when applying foreach loop

How do you clear the ApexPages.getMessages() collection in a test?

Direct Experience of Meditation

What's the difference between (size_t)-1 and ~0?

How to rotate it perfectly?

What would be Julian Assange's expected punishment, on the current English criminal law?

Can the prologue be the backstory of your main character?

How many spell slots should a Fighter 11/Ranger 9 have?

How to politely respond to generic emails requesting a PhD/job in my lab? Without wasting too much time

What is the largest species of polychaete?

What kind of display is this?

What is the electric potential inside a point charge?

Lagrange multiplier without implicit function theorem

Announcing the arrival of Valued Associate #679: Cesar Manara

Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)Implicit Function Theorem [Understanding theorem in book]Normal vectors and tangent planesTrying to understand Lagrange multipliersShowing that the gradient $nabla f(x)$ is parallel to constraint surface gradient $nabla g(x)$ at an extreme point on the surfaceIn Lagrange Multiplier, why level curves of $f$ and $g$ are tangent to each other?Lagrange multipliers and critical pointsSolve by using Lagrange Multiplier MethodGeneralized Lagrange Multiplier Theorem.Lagrange Multiplier do not make senseOptimality of Lagrange Multiplier

Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?

Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$

This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.

Alternatively, an even simpler proof from MIT OCW goes as follows:

Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.

What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?

edited Apr 8 at 19:44

asked Apr 8 at 19:13

dkv

898

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has not received enough attention.

add a comment |

Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$

This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.

Alternatively, an even simpler proof from MIT OCW goes as follows:

Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.

edited Apr 8 at 19:44

asked Apr 8 at 19:13

dkv

898

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has not received enough attention.

add a comment |

Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$

This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.

Alternatively, an even simpler proof from MIT OCW goes as follows:

Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.

edited Apr 8 at 19:44

asked Apr 8 at 19:13

dkv

898

Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$

This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.

Alternatively, an even simpler proof from MIT OCW goes as follows:

Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.

calculus proof-verification alternative-proof lagrange-multiplier

edited Apr 8 at 19:44

asked Apr 8 at 19:13

dkv

898

edited Apr 8 at 19:44

asked Apr 8 at 19:13

dkv

898

edited Apr 8 at 19:44

asked Apr 8 at 19:13

dkv

898

asked Apr 8 at 19:13

dkv

898

asked Apr 8 at 19:13

dkv

898

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has not received enough attention.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has not received enough attention.

add a comment |

2 Answers
2

active

oldest

votes

The two proofs are equivalent (with slight non-consequential differences I will clarify later).

At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).

Let's use just two coordinates instead of three to make things easier to visualize:

We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.

Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):

$$ fracddt f(vecr(t_0)+hat u t) $$

For Steward, it's (evaluated at $t_0$):

$$ fracddt f(vecr(t))$$

In the first case, apply chain rule we get:

$$ nabla f(vecr(t_0)) cdot hat u$$

In the second case:

$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$

So, same conclusion.

Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.

answered yesterday

Thinking Torus

1605

New contributor

add a comment |

The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?

For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
$f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
$f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
there cannot be any local constrained extremum at $a$.
Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.

answered yesterday

user141614

12.3k1025

add a comment |

Your Answer

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3180069%2flagrange-multiplier-without-implicit-function-theorem%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

The two proofs are equivalent (with slight non-consequential differences I will clarify later).

At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).

Let's use just two coordinates instead of three to make things easier to visualize:

Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):

$$ fracddt f(vecr(t_0)+hat u t) $$

For Steward, it's (evaluated at $t_0$):

$$ fracddt f(vecr(t))$$

In the first case, apply chain rule we get:

$$ nabla f(vecr(t_0)) cdot hat u$$

In the second case:

$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$

So, same conclusion.

answered yesterday

Thinking Torus

1605

New contributor

add a comment |

The two proofs are equivalent (with slight non-consequential differences I will clarify later).

At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).

Let's use just two coordinates instead of three to make things easier to visualize:

Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):

$$ fracddt f(vecr(t_0)+hat u t) $$

For Steward, it's (evaluated at $t_0$):

$$ fracddt f(vecr(t))$$

In the first case, apply chain rule we get:

$$ nabla f(vecr(t_0)) cdot hat u$$

In the second case:

$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$

So, same conclusion.

answered yesterday

Thinking Torus

1605

New contributor

add a comment |

The two proofs are equivalent (with slight non-consequential differences I will clarify later).

At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).

Let's use just two coordinates instead of three to make things easier to visualize:

Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):

$$ fracddt f(vecr(t_0)+hat u t) $$

For Steward, it's (evaluated at $t_0$):

$$ fracddt f(vecr(t))$$

In the first case, apply chain rule we get:

$$ nabla f(vecr(t_0)) cdot hat u$$

In the second case:

$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$

So, same conclusion.

answered yesterday

Thinking Torus

1605

New contributor

The two proofs are equivalent (with slight non-consequential differences I will clarify later).

At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).

Let's use just two coordinates instead of three to make things easier to visualize:

Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):

$$ fracddt f(vecr(t_0)+hat u t) $$

For Steward, it's (evaluated at $t_0$):

$$ fracddt f(vecr(t))$$

In the first case, apply chain rule we get:

$$ nabla f(vecr(t_0)) cdot hat u$$

In the second case:

$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$

So, same conclusion.

answered yesterday

Thinking Torus

1605

New contributor

answered yesterday

Thinking Torus

1605

New contributor

answered yesterday

Thinking Torus

1605

answered yesterday

Thinking Torus

1605

New contributor

Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

add a comment |

answered yesterday

user141614

12.3k1025

add a comment |

answered yesterday

user141614

12.3k1025

add a comment |

answered yesterday

user141614

12.3k1025

answered yesterday

user141614

12.3k1025

answered yesterday

user141614

12.3k1025

answered yesterday

user141614

12.3k1025

answered yesterday

user141614

12.3k1025

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Mathematics Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Usbrth

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

2 Answers
2

Your Answer

Post as a guest

2 Answers
2

2 Answers
2

Post as a guest

Popular posts from this blog

369. pr. Kr. Događaji Rođenja Smrti

This question has an open bounty worth +50 reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50 reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50 reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50 reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

2 Answers 2

2 Answers 2

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

369. pr. Kr. Događaji Rođenja Smrti

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.

2 Answers
2

2 Answers
2

2 Answers
2