Lagrange multiplier without implicit function theorem Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)Implicit Function Theorem [Understanding theorem in book]Normal vectors and tangent planesTrying to understand Lagrange multipliersShowing that the gradient $nabla f(x)$ is parallel to constraint surface gradient $nabla g(x)$ at an extreme point on the surfaceIn Lagrange Multiplier, why level curves of $f$ and $g$ are tangent to each other?Lagrange multipliers and critical pointsSolve by using Lagrange Multiplier MethodGeneralized Lagrange Multiplier Theorem.Lagrange Multiplier do not make senseOptimality of Lagrange Multiplier
What to do with post with dry rot?
Can smartphones with the same camera sensor have different image quality?
I'm having difficulty getting my players to do stuff in a sandbox campaign
Why is there no army of Iron-Mans in the MCU?
Does a C shift expression have unsigned type? Why would Splint warn about a right-shift?
Failing to enforce immigration laws?
What are the performance impacts of 'functional' Rust?
Using "nakedly" instead of "with nothing on"
Strange behaviour of Check
Estimate capacitor parameters
Passing functions in C++
Two different pronunciation of "понял"
Problem when applying foreach loop
How do you clear the ApexPages.getMessages() collection in a test?
Direct Experience of Meditation
What's the difference between (size_t)-1 and ~0?
How to rotate it perfectly?
What would be Julian Assange's expected punishment, on the current English criminal law?
Can the prologue be the backstory of your main character?
How many spell slots should a Fighter 11/Ranger 9 have?
How to politely respond to generic emails requesting a PhD/job in my lab? Without wasting too much time
What is the largest species of polychaete?
What kind of display is this?
What is the electric potential inside a point charge?
Lagrange multiplier without implicit function theorem
Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)Implicit Function Theorem [Understanding theorem in book]Normal vectors and tangent planesTrying to understand Lagrange multipliersShowing that the gradient $nabla f(x)$ is parallel to constraint surface gradient $nabla g(x)$ at an extreme point on the surfaceIn Lagrange Multiplier, why level curves of $f$ and $g$ are tangent to each other?Lagrange multipliers and critical pointsSolve by using Lagrange Multiplier MethodGeneralized Lagrange Multiplier Theorem.Lagrange Multiplier do not make senseOptimality of Lagrange Multiplier
$begingroup$
Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?
Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$
This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.
Alternatively, an even simpler proof from MIT OCW goes as follows:
Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.
What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?
calculus proof-verification alternative-proof lagrange-multiplier
$endgroup$
This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.
This question has not received enough attention.
add a comment |
$begingroup$
Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?
Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$
This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.
Alternatively, an even simpler proof from MIT OCW goes as follows:
Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.
What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?
calculus proof-verification alternative-proof lagrange-multiplier
$endgroup$
This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.
This question has not received enough attention.
add a comment |
$begingroup$
Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?
Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$
This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.
Alternatively, an even simpler proof from MIT OCW goes as follows:
Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.
What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?
calculus proof-verification alternative-proof lagrange-multiplier
$endgroup$
Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?
Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$
This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.
Alternatively, an even simpler proof from MIT OCW goes as follows:
Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.
What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?
calculus proof-verification alternative-proof lagrange-multiplier
calculus proof-verification alternative-proof lagrange-multiplier
edited Apr 8 at 19:44
dkv
asked Apr 8 at 19:13
dkvdkv
898
898
This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.
This question has not received enough attention.
This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.
This question has not received enough attention.
add a comment |
add a comment |
2 Answers
2
active
oldest
votes
$begingroup$
The two proofs are equivalent (with slight non-consequential differences I will clarify later).
At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).
Let's use just two coordinates instead of three to make things easier to visualize:
We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.
Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):
$$ fracddt f(vecr(t_0)+hat u t) $$
For Steward, it's (evaluated at $t_0$):
$$ fracddt f(vecr(t))$$
In the first case, apply chain rule we get:
$$ nabla f(vecr(t_0)) cdot hat u$$
In the second case:
$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$
So, same conclusion.
Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
add a comment |
$begingroup$
The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?
For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
$f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
$f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
there cannot be any local constrained extremum at $a$.
Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.
$endgroup$
add a comment |
Your Answer
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3180069%2flagrange-multiplier-without-implicit-function-theorem%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
The two proofs are equivalent (with slight non-consequential differences I will clarify later).
At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).
Let's use just two coordinates instead of three to make things easier to visualize:
We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.
Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):
$$ fracddt f(vecr(t_0)+hat u t) $$
For Steward, it's (evaluated at $t_0$):
$$ fracddt f(vecr(t))$$
In the first case, apply chain rule we get:
$$ nabla f(vecr(t_0)) cdot hat u$$
In the second case:
$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$
So, same conclusion.
Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
add a comment |
$begingroup$
The two proofs are equivalent (with slight non-consequential differences I will clarify later).
At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).
Let's use just two coordinates instead of three to make things easier to visualize:
We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.
Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):
$$ fracddt f(vecr(t_0)+hat u t) $$
For Steward, it's (evaluated at $t_0$):
$$ fracddt f(vecr(t))$$
In the first case, apply chain rule we get:
$$ nabla f(vecr(t_0)) cdot hat u$$
In the second case:
$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$
So, same conclusion.
Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
add a comment |
$begingroup$
The two proofs are equivalent (with slight non-consequential differences I will clarify later).
At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).
Let's use just two coordinates instead of three to make things easier to visualize:
We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.
Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):
$$ fracddt f(vecr(t_0)+hat u t) $$
For Steward, it's (evaluated at $t_0$):
$$ fracddt f(vecr(t))$$
In the first case, apply chain rule we get:
$$ nabla f(vecr(t_0)) cdot hat u$$
In the second case:
$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$
So, same conclusion.
Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
$endgroup$
The two proofs are equivalent (with slight non-consequential differences I will clarify later).
At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).
Let's use just two coordinates instead of three to make things easier to visualize:
We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.
Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):
$$ fracddt f(vecr(t_0)+hat u t) $$
For Steward, it's (evaluated at $t_0$):
$$ fracddt f(vecr(t))$$
In the first case, apply chain rule we get:
$$ nabla f(vecr(t_0)) cdot hat u$$
In the second case:
$$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$
So, same conclusion.
Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
answered yesterday
Thinking TorusThinking Torus
1605
1605
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
add a comment |
add a comment |
$begingroup$
The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?
For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
$f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
$f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
there cannot be any local constrained extremum at $a$.
Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.
$endgroup$
add a comment |
$begingroup$
The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?
For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
$f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
$f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
there cannot be any local constrained extremum at $a$.
Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.
$endgroup$
add a comment |
$begingroup$
The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?
For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
$f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
$f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
there cannot be any local constrained extremum at $a$.
Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.
$endgroup$
The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?
For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
$f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
$f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
there cannot be any local constrained extremum at $a$.
Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.
answered yesterday
user141614user141614
12.3k1025
12.3k1025
add a comment |
add a comment |
Thanks for contributing an answer to Mathematics Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3180069%2flagrange-multiplier-without-implicit-function-theorem%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown