Lagrange multiplier without implicit function theorem Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)Implicit Function Theorem [Understanding theorem in book]Normal vectors and tangent planesTrying to understand Lagrange multipliersShowing that the gradient $nabla f(x)$ is parallel to constraint surface gradient $nabla g(x)$ at an extreme point on the surfaceIn Lagrange Multiplier, why level curves of $f$ and $g$ are tangent to each other?Lagrange multipliers and critical pointsSolve by using Lagrange Multiplier MethodGeneralized Lagrange Multiplier Theorem.Lagrange Multiplier do not make senseOptimality of Lagrange Multiplier

What to do with post with dry rot?

Can smartphones with the same camera sensor have different image quality?

I'm having difficulty getting my players to do stuff in a sandbox campaign

Why is there no army of Iron-Mans in the MCU?

Does a C shift expression have unsigned type? Why would Splint warn about a right-shift?

Failing to enforce immigration laws?

What are the performance impacts of 'functional' Rust?

Using "nakedly" instead of "with nothing on"

Strange behaviour of Check

Estimate capacitor parameters

Passing functions in C++

Two different pronunciation of "понял"

Problem when applying foreach loop

How do you clear the ApexPages.getMessages() collection in a test?

Direct Experience of Meditation

What's the difference between (size_t)-1 and ~0?

How to rotate it perfectly?

What would be Julian Assange's expected punishment, on the current English criminal law?

Can the prologue be the backstory of your main character?

How many spell slots should a Fighter 11/Ranger 9 have?

How to politely respond to generic emails requesting a PhD/job in my lab? Without wasting too much time

What is the largest species of polychaete?

What kind of display is this?

What is the electric potential inside a point charge?



Lagrange multiplier without implicit function theorem



Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)Implicit Function Theorem [Understanding theorem in book]Normal vectors and tangent planesTrying to understand Lagrange multipliersShowing that the gradient $nabla f(x)$ is parallel to constraint surface gradient $nabla g(x)$ at an extreme point on the surfaceIn Lagrange Multiplier, why level curves of $f$ and $g$ are tangent to each other?Lagrange multipliers and critical pointsSolve by using Lagrange Multiplier MethodGeneralized Lagrange Multiplier Theorem.Lagrange Multiplier do not make senseOptimality of Lagrange Multiplier










0












$begingroup$


Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?




Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$



This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.





Alternatively, an even simpler proof from MIT OCW goes as follows:




Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.




What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?










share|cite|improve this question











$endgroup$





This question has an open bounty worth +50
reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.


This question has not received enough attention.




















    0












    $begingroup$


    Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?




    Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$



    This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.





    Alternatively, an even simpler proof from MIT OCW goes as follows:




    Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.




    What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?










    share|cite|improve this question











    $endgroup$





    This question has an open bounty worth +50
    reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.


    This question has not received enough attention.


















      0












      0








      0


      2



      $begingroup$


      Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?




      Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$



      This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.





      Alternatively, an even simpler proof from MIT OCW goes as follows:




      Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.




      What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?










      share|cite|improve this question











      $endgroup$




      Here is a proof of the Lagrange multiplier method from Calculus Early Transcendentals by James Stewart (8th ed). It does not rely on the Implicit Function Theorem like all other "rigorous" proofs seem to. What is the missing piece from this proof (which I guess relies on the Implicit Function Theorem) that would make this rigorous?




      Suppose that a function $f$ has an extreme value at a point $(x_0, y_0, z_0)$ on the surface $S$ and let $C$ be a curve with vector equation $vecr(t)=(x(t), y(t), z(t))$ that lies on $S$ and passes through $(x_0, y_0, z_0)$. If $t_0$ is the parameter value corresponding to the point $(x_0, y_0, z_0)$, then $vecr(t_0)=(x(t_0), y(t_0), z(t_0))$. The composite function $h(t)=f(x(t), y(t), z(t))$ represents the values that $f$ takes on the curve $C$. Since $f$ has an extreme value at $(x_0, y_0, z_0)$, it follows that $h$ has an extreme value at $t_0$, so $h'(t_0) = 0$. But if $f$ is differentiable, we can use the Chain Rule to write $$0 = h'(t_0) = nabla f(x_0, y_0, z_0) cdot vecr'(t_0)$$



      This shows that the gradient vector $nabla f(x_0, y_0, z_0)$ is orthogonal to the tangent vector $vecr'(t_0)$ to every such curve $C$. We know that the gradient of $g$, $nabla g(x_0, y_0, z_0)$, is also orthogonal to $vecr'(t_0)$ for every such curve. This means that the gradient vectors $nabla f(x_0, y_0, z_0)$ and $nabla g(x_0, y_0, z_0)$ must be parallel.





      Alternatively, an even simpler proof from MIT OCW goes as follows:




      Consider any unit vector $hatu$ at the critical point that is tangent to the constraint surface. Then, since the directional derivative along $hatu$, $D_hatu f = nabla f cdot hatu = 0$ at the critical point so $nabla f$ is perpendicular to any such $hatu$. We know $nabla g$ is perpendicular to the level curves of $g$, so $nabla g$ is also perpendicular to any such $hatu$, implying $nabla f$ and $nabla g$ are parallel.




      What does introducing $vecr(t)$ in the Stewart proof give us over this one? And, again, what is the piece here that needs to be shown more rigorously (presumably using the Implicit Function Theorem)?







      calculus proof-verification alternative-proof lagrange-multiplier






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Apr 8 at 19:44







      dkv

















      asked Apr 8 at 19:13









      dkvdkv

      898




      898






      This question has an open bounty worth +50
      reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.


      This question has not received enough attention.








      This question has an open bounty worth +50
      reputation from dkv ending ending at 2019-04-18 16:04:46Z">in 4 days.


      This question has not received enough attention.






















          2 Answers
          2






          active

          oldest

          votes


















          0












          $begingroup$

          The two proofs are equivalent (with slight non-consequential differences I will clarify later).



          At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).



          Let's use just two coordinates instead of three to make things easier to visualize:



          We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.



          Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):



          $$ fracddt f(vecr(t_0)+hat u t) $$



          For Steward, it's (evaluated at $t_0$):



          $$ fracddt f(vecr(t))$$



          In the first case, apply chain rule we get:



          $$ nabla f(vecr(t_0)) cdot hat u$$



          In the second case:



          $$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$



          So, same conclusion.



          Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.






          share|cite|improve this answer








          New contributor




          Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
          Check out our Code of Conduct.






          $endgroup$




















            0












            $begingroup$

            The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?



            For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
            If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
            $f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
            $f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
            there cannot be any local constrained extremum at $a$.
            Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.






            share|cite|improve this answer









            $endgroup$













              Your Answer








              StackExchange.ready(function()
              var channelOptions =
              tags: "".split(" "),
              id: "69"
              ;
              initTagRenderer("".split(" "), "".split(" "), channelOptions);

              StackExchange.using("externalEditor", function()
              // Have to fire editor after snippets, if snippets enabled
              if (StackExchange.settings.snippets.snippetsEnabled)
              StackExchange.using("snippets", function()
              createEditor();
              );

              else
              createEditor();

              );

              function createEditor()
              StackExchange.prepareEditor(
              heartbeatType: 'answer',
              autoActivateHeartbeat: false,
              convertImagesToLinks: true,
              noModals: true,
              showLowRepImageUploadWarning: true,
              reputationToPostImages: 10,
              bindNavPrevention: true,
              postfix: "",
              imageUploader:
              brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
              contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
              allowUrls: true
              ,
              noCode: true, onDemand: true,
              discardSelector: ".discard-answer"
              ,immediatelyShowMarkdownHelp:true
              );



              );













              draft saved

              draft discarded


















              StackExchange.ready(
              function ()
              StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3180069%2flagrange-multiplier-without-implicit-function-theorem%23new-answer', 'question_page');

              );

              Post as a guest















              Required, but never shown

























              2 Answers
              2






              active

              oldest

              votes








              2 Answers
              2






              active

              oldest

              votes









              active

              oldest

              votes






              active

              oldest

              votes









              0












              $begingroup$

              The two proofs are equivalent (with slight non-consequential differences I will clarify later).



              At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).



              Let's use just two coordinates instead of three to make things easier to visualize:



              We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.



              Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):



              $$ fracddt f(vecr(t_0)+hat u t) $$



              For Steward, it's (evaluated at $t_0$):



              $$ fracddt f(vecr(t))$$



              In the first case, apply chain rule we get:



              $$ nabla f(vecr(t_0)) cdot hat u$$



              In the second case:



              $$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$



              So, same conclusion.



              Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.






              share|cite|improve this answer








              New contributor




              Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
              Check out our Code of Conduct.






              $endgroup$

















                0












                $begingroup$

                The two proofs are equivalent (with slight non-consequential differences I will clarify later).



                At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).



                Let's use just two coordinates instead of three to make things easier to visualize:



                We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.



                Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):



                $$ fracddt f(vecr(t_0)+hat u t) $$



                For Steward, it's (evaluated at $t_0$):



                $$ fracddt f(vecr(t))$$



                In the first case, apply chain rule we get:



                $$ nabla f(vecr(t_0)) cdot hat u$$



                In the second case:



                $$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$



                So, same conclusion.



                Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.






                share|cite|improve this answer








                New contributor




                Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.






                $endgroup$















                  0












                  0








                  0





                  $begingroup$

                  The two proofs are equivalent (with slight non-consequential differences I will clarify later).



                  At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).



                  Let's use just two coordinates instead of three to make things easier to visualize:



                  We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.



                  Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):



                  $$ fracddt f(vecr(t_0)+hat u t) $$



                  For Steward, it's (evaluated at $t_0$):



                  $$ fracddt f(vecr(t))$$



                  In the first case, apply chain rule we get:



                  $$ nabla f(vecr(t_0)) cdot hat u$$



                  In the second case:



                  $$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$



                  So, same conclusion.



                  Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.






                  share|cite|improve this answer








                  New contributor




                  Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                  Check out our Code of Conduct.






                  $endgroup$



                  The two proofs are equivalent (with slight non-consequential differences I will clarify later).



                  At this level, it's helpful to borrow some intuition from physics (after all that's where calculus came from).



                  Let's use just two coordinates instead of three to make things easier to visualize:



                  We have a hill, and $f(x,y)$ is the height of the hill at $(x,y)$. A hiker's horizontal location (horizontal since we are not using $z$) at any time t is given by $vecr(t)$ in Steward (which basically gives us the entire history of the hiker's movement). OCW only concerns us with hiker's movement near the extremum (and doesn't bother making it explicit), since elsewhere it's irrelevant. The latter also specifies that the hiker travels at unit speed, which is inconsequential here. Steward doesn't specify the speed. So these are the slight differences.



                  Now, if we write out the derivative in OCW (making the location explicit as in Steward), it's (evaluated at 0):



                  $$ fracddt f(vecr(t_0)+hat u t) $$



                  For Steward, it's (evaluated at $t_0$):



                  $$ fracddt f(vecr(t))$$



                  In the first case, apply chain rule we get:



                  $$ nabla f(vecr(t_0)) cdot hat u$$



                  In the second case:



                  $$ nabla f(vecr(t_0)) cdot vecr'(t_0)$$



                  So, same conclusion.



                  Personally, I think Steward's approach presents it in a more intuitive way (and painstakingly names every detail), so is easier for beginners to understand. OCW's approach is more pragmatic, and you will be using that kind of notation later on. There is not any difference in terms of rigor.







                  share|cite|improve this answer








                  New contributor




                  Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                  Check out our Code of Conduct.









                  share|cite|improve this answer



                  share|cite|improve this answer






                  New contributor




                  Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                  Check out our Code of Conduct.









                  answered yesterday









                  Thinking TorusThinking Torus

                  1605




                  1605




                  New contributor




                  Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                  Check out our Code of Conduct.





                  New contributor





                  Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                  Check out our Code of Conduct.






                  Thinking Torus is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                  Check out our Code of Conduct.





















                      0












                      $begingroup$

                      The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?



                      For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
                      If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
                      $f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
                      $f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
                      there cannot be any local constrained extremum at $a$.
                      Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.






                      share|cite|improve this answer









                      $endgroup$

















                        0












                        $begingroup$

                        The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?



                        For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
                        If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
                        $f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
                        $f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
                        there cannot be any local constrained extremum at $a$.
                        Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.






                        share|cite|improve this answer









                        $endgroup$















                          0












                          0








                          0





                          $begingroup$

                          The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?



                          For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
                          If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
                          $f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
                          $f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
                          there cannot be any local constrained extremum at $a$.
                          Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.






                          share|cite|improve this answer









                          $endgroup$



                          The point where you really require the implicit function theorem is when you start talking about "constraint surface" and "tangents". How can you know that your constraints locally determine some smooth surface?



                          For the Lagrange Multipliers itself, a weaker part if the IFT is enough; it follows directly from the local surjectivity.
                          If $a$ is a point such that $f_1(a)=ldots=f_n(a)=0$ and the gradients $f_1',dots,f_n',g'$ are linearly independent, then the map $(f_1,ldots,f_n,g)$ maps every ball around $a$ to a neighbourhood of $(0,ldots,0,g(a))$, so in every ball around $a$, there exist points $b,c$ such that
                          $f_1(b)=ldots=f_n(b)=0$ and $g(b)>g(a)$, and
                          $f_1(c)=ldots=f_n(c)=0$ and $g(c)<g(a)$; this proves that
                          there cannot be any local constrained extremum at $a$.
                          Hence, at all constrained extremal points, the gradients $f_1',dots,f_n',g'$ must be linearly dependent.







                          share|cite|improve this answer












                          share|cite|improve this answer



                          share|cite|improve this answer










                          answered yesterday









                          user141614user141614

                          12.3k1025




                          12.3k1025



























                              draft saved

                              draft discarded
















































                              Thanks for contributing an answer to Mathematics Stack Exchange!


                              • Please be sure to answer the question. Provide details and share your research!

                              But avoid


                              • Asking for help, clarification, or responding to other answers.

                              • Making statements based on opinion; back them up with references or personal experience.

                              Use MathJax to format equations. MathJax reference.


                              To learn more, see our tips on writing great answers.




                              draft saved


                              draft discarded














                              StackExchange.ready(
                              function ()
                              StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3180069%2flagrange-multiplier-without-implicit-function-theorem%23new-answer', 'question_page');

                              );

                              Post as a guest















                              Required, but never shown





















































                              Required, but never shown














                              Required, but never shown












                              Required, but never shown







                              Required, but never shown

































                              Required, but never shown














                              Required, but never shown












                              Required, but never shown







                              Required, but never shown







                              Popular posts from this blog

                              What does it mean to find percent difference when two values are equivalent? The 2019 Stack Overflow Developer Survey Results Are InWhat does “percent of change” mean?Find what percent X is between two numbers?Unable to determine 'original amount' in simple percentage problemsWhat is the correct percent difference formula?How does proportionality hold when quantities are high? And the percentage increase formulaprofit and loss GRE questionProfitability calculationWhat is the difference between $xtimes 0.8$ and $x div 1.2 ? $Finding the percent probability of completing BUDs trainingCalculating Percent Difference with zero and near zero values

                              Why did some early computer designers eschew integers?What register size did early computers use?What other computers used this floating-point format?Why did so many early microcomputers use the MOS 6502 and variants?Why were early computers named “Mark”?Why did expert systems fall?Why were early personal computer monitors not green?When did “Zen” in computer programming become a thing?History of advanced hardwareWere there any working computers using residue number systems?Why did some CPUs use two Read/Write lines, and others just one?

                              How to avoid repetitive long generic constraints in Rust The 2019 Stack Overflow Developer Survey Results Are In Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) The Ask Question Wizard is Live! Data science time! April 2019 and salary with experienceIs it possible to automatically implement a trait for any tuple that is made up of types that all implement the trait?Is there a constraint that restricts my generic method to numeric types?How can foreign key constraints be temporarily disabled using T-SQL?How do I use reflection to call a generic method?How to create a generic array in Java?How to get a class instance of generics type THow is `last` allowed to be called for an Args value?How to implement a trait for a parameterized traitAvoiding PhantomData in a struct to enforce type constraintsIs it possible to return part of a struct by reference?Associated References types as Value Types