Characterize the critical points of $Vert Ax - b Vert^{2}$












0












$begingroup$



Let $A$ a $m times n$ matrix, $b$ a $m times 1$ matrix and $x$ a $n times 1$ matrix. Consider $f: mathbb{R}^n to mathbb{R}$ defined by
$$f(x) = Vert Ax - b Vert^2.$$
Determine a condition that characterizes the critical points of $f$.




My attempt. Note that
$$Vert Ax - b Vert^{2} = (Ax - b)^{T}(Ax - b) = (x^{T}A^{T} - b^{T})(Ax - b) = x^{T}A^{T}Ax - x^{T}A^{T}b - b^{T}Ax + b^{T}b.$$
Thus,
$$frac{partial}{partial x_{i}}f(x) = 2A^{T}Ax_{i} - A^{T}b - color{red}{b^{T}A}.$$
But I know that $nabla f(x) = 2A^{T}Ax - 2A^{T}b$, so "$b^{T}A$ should be $A^{T}b$". How do I correct this?



Assuming that $nabla f(x) = 2A^{T}Ax - 2A^{T}b = 2A^{T}(Ax - b)$, the critical points of $f$ are the solutions of $Ax - b$. I dont understand what it means to characterize the critical points, but I suppose it's about maximum and minimum.



Let $H: mathbb{R}^n to mathbb{R}$ a quadratic form given by $Hv^{2} = sum h_{ij}alpha_{i}alpha_{j}$. The quadric form of $H$ is positive when $Hv^{2} > 0$ for all $v neq 0$ in $mathbb{R}^n$ and is negative when $Hv^{2} < 0$ for all $v neq 0$ in $mathbb{R}^n$, otherwise we say that $Hv^{2}$ is undefined.




Theorem. Let $f: U to mathbb{R}$ a $C^{2}$ function, $p in U$ a critical point of $f$ and $H$ the Hessian quadratic form of $f$ in $p$. Then



(i) If $H$ is positive, $p$ is a local minimum point (non-degenerated).



(ii) If $H$ is negative, $p$ is a maximum local point (non-degenerated).



(iii) If $H$ is undefined, $p$ is not a local maximum or minimum point.




The Hessian of $f$ is $nabla^{2} f(x) = 2A^{T}A = 2Vert A Vert^2$. But, the Hessian is independent of the point and $2Vert A Vert^2$ is non-negative. So the quadratic form is always non-negative? This is the first one I try about Hessian and critical points so, I dont have much practice.



I appreciate any help!










share|cite|improve this question









$endgroup$

















    0












    $begingroup$



    Let $A$ a $m times n$ matrix, $b$ a $m times 1$ matrix and $x$ a $n times 1$ matrix. Consider $f: mathbb{R}^n to mathbb{R}$ defined by
    $$f(x) = Vert Ax - b Vert^2.$$
    Determine a condition that characterizes the critical points of $f$.




    My attempt. Note that
    $$Vert Ax - b Vert^{2} = (Ax - b)^{T}(Ax - b) = (x^{T}A^{T} - b^{T})(Ax - b) = x^{T}A^{T}Ax - x^{T}A^{T}b - b^{T}Ax + b^{T}b.$$
    Thus,
    $$frac{partial}{partial x_{i}}f(x) = 2A^{T}Ax_{i} - A^{T}b - color{red}{b^{T}A}.$$
    But I know that $nabla f(x) = 2A^{T}Ax - 2A^{T}b$, so "$b^{T}A$ should be $A^{T}b$". How do I correct this?



    Assuming that $nabla f(x) = 2A^{T}Ax - 2A^{T}b = 2A^{T}(Ax - b)$, the critical points of $f$ are the solutions of $Ax - b$. I dont understand what it means to characterize the critical points, but I suppose it's about maximum and minimum.



    Let $H: mathbb{R}^n to mathbb{R}$ a quadratic form given by $Hv^{2} = sum h_{ij}alpha_{i}alpha_{j}$. The quadric form of $H$ is positive when $Hv^{2} > 0$ for all $v neq 0$ in $mathbb{R}^n$ and is negative when $Hv^{2} < 0$ for all $v neq 0$ in $mathbb{R}^n$, otherwise we say that $Hv^{2}$ is undefined.




    Theorem. Let $f: U to mathbb{R}$ a $C^{2}$ function, $p in U$ a critical point of $f$ and $H$ the Hessian quadratic form of $f$ in $p$. Then



    (i) If $H$ is positive, $p$ is a local minimum point (non-degenerated).



    (ii) If $H$ is negative, $p$ is a maximum local point (non-degenerated).



    (iii) If $H$ is undefined, $p$ is not a local maximum or minimum point.




    The Hessian of $f$ is $nabla^{2} f(x) = 2A^{T}A = 2Vert A Vert^2$. But, the Hessian is independent of the point and $2Vert A Vert^2$ is non-negative. So the quadratic form is always non-negative? This is the first one I try about Hessian and critical points so, I dont have much practice.



    I appreciate any help!










    share|cite|improve this question









    $endgroup$















      0












      0








      0





      $begingroup$



      Let $A$ a $m times n$ matrix, $b$ a $m times 1$ matrix and $x$ a $n times 1$ matrix. Consider $f: mathbb{R}^n to mathbb{R}$ defined by
      $$f(x) = Vert Ax - b Vert^2.$$
      Determine a condition that characterizes the critical points of $f$.




      My attempt. Note that
      $$Vert Ax - b Vert^{2} = (Ax - b)^{T}(Ax - b) = (x^{T}A^{T} - b^{T})(Ax - b) = x^{T}A^{T}Ax - x^{T}A^{T}b - b^{T}Ax + b^{T}b.$$
      Thus,
      $$frac{partial}{partial x_{i}}f(x) = 2A^{T}Ax_{i} - A^{T}b - color{red}{b^{T}A}.$$
      But I know that $nabla f(x) = 2A^{T}Ax - 2A^{T}b$, so "$b^{T}A$ should be $A^{T}b$". How do I correct this?



      Assuming that $nabla f(x) = 2A^{T}Ax - 2A^{T}b = 2A^{T}(Ax - b)$, the critical points of $f$ are the solutions of $Ax - b$. I dont understand what it means to characterize the critical points, but I suppose it's about maximum and minimum.



      Let $H: mathbb{R}^n to mathbb{R}$ a quadratic form given by $Hv^{2} = sum h_{ij}alpha_{i}alpha_{j}$. The quadric form of $H$ is positive when $Hv^{2} > 0$ for all $v neq 0$ in $mathbb{R}^n$ and is negative when $Hv^{2} < 0$ for all $v neq 0$ in $mathbb{R}^n$, otherwise we say that $Hv^{2}$ is undefined.




      Theorem. Let $f: U to mathbb{R}$ a $C^{2}$ function, $p in U$ a critical point of $f$ and $H$ the Hessian quadratic form of $f$ in $p$. Then



      (i) If $H$ is positive, $p$ is a local minimum point (non-degenerated).



      (ii) If $H$ is negative, $p$ is a maximum local point (non-degenerated).



      (iii) If $H$ is undefined, $p$ is not a local maximum or minimum point.




      The Hessian of $f$ is $nabla^{2} f(x) = 2A^{T}A = 2Vert A Vert^2$. But, the Hessian is independent of the point and $2Vert A Vert^2$ is non-negative. So the quadratic form is always non-negative? This is the first one I try about Hessian and critical points so, I dont have much practice.



      I appreciate any help!










      share|cite|improve this question









      $endgroup$





      Let $A$ a $m times n$ matrix, $b$ a $m times 1$ matrix and $x$ a $n times 1$ matrix. Consider $f: mathbb{R}^n to mathbb{R}$ defined by
      $$f(x) = Vert Ax - b Vert^2.$$
      Determine a condition that characterizes the critical points of $f$.




      My attempt. Note that
      $$Vert Ax - b Vert^{2} = (Ax - b)^{T}(Ax - b) = (x^{T}A^{T} - b^{T})(Ax - b) = x^{T}A^{T}Ax - x^{T}A^{T}b - b^{T}Ax + b^{T}b.$$
      Thus,
      $$frac{partial}{partial x_{i}}f(x) = 2A^{T}Ax_{i} - A^{T}b - color{red}{b^{T}A}.$$
      But I know that $nabla f(x) = 2A^{T}Ax - 2A^{T}b$, so "$b^{T}A$ should be $A^{T}b$". How do I correct this?



      Assuming that $nabla f(x) = 2A^{T}Ax - 2A^{T}b = 2A^{T}(Ax - b)$, the critical points of $f$ are the solutions of $Ax - b$. I dont understand what it means to characterize the critical points, but I suppose it's about maximum and minimum.



      Let $H: mathbb{R}^n to mathbb{R}$ a quadratic form given by $Hv^{2} = sum h_{ij}alpha_{i}alpha_{j}$. The quadric form of $H$ is positive when $Hv^{2} > 0$ for all $v neq 0$ in $mathbb{R}^n$ and is negative when $Hv^{2} < 0$ for all $v neq 0$ in $mathbb{R}^n$, otherwise we say that $Hv^{2}$ is undefined.




      Theorem. Let $f: U to mathbb{R}$ a $C^{2}$ function, $p in U$ a critical point of $f$ and $H$ the Hessian quadratic form of $f$ in $p$. Then



      (i) If $H$ is positive, $p$ is a local minimum point (non-degenerated).



      (ii) If $H$ is negative, $p$ is a maximum local point (non-degenerated).



      (iii) If $H$ is undefined, $p$ is not a local maximum or minimum point.




      The Hessian of $f$ is $nabla^{2} f(x) = 2A^{T}A = 2Vert A Vert^2$. But, the Hessian is independent of the point and $2Vert A Vert^2$ is non-negative. So the quadratic form is always non-negative? This is the first one I try about Hessian and critical points so, I dont have much practice.



      I appreciate any help!







      real-analysis derivatives hessian-matrix






      share|cite|improve this question













      share|cite|improve this question











      share|cite|improve this question




      share|cite|improve this question










      asked Jan 20 at 16:49









      Lucas CorrêaLucas Corrêa

      1,6181321




      1,6181321






















          1 Answer
          1






          active

          oldest

          votes


















          1












          $begingroup$

          The partial derivative should be
          $$frac{partial}{partial x_i} f(x) = 2 A^top A x_i - (A^top b)_i - (A^top b)_i$$
          where $(A^top b)_i$ denotes the $i$th component of the vector $A^top b$.





          "Determine a condition that characterizes the critical points" is simply asking for the condition $A^top (Ax - b) = 0$. That's it! :) [That is, if someone asked you "how can you tell me if this $x$ is a critical point?" you can check whether $A^top (Ax - b) = 0$ holds or not.]



          More importantly, the above condition is not equivalent to $Ax-b=0$. If $Ax-b=0$ it is then true that $A^top (Ax-b) = 0$, but the converse may not hold, in particular if $A^top$ has a nontrivial nullspace.





          I would not write $A^top A = |A|^2$ since $A$ is a matrix, not a vector.
          But indeed the Hessian is $H = 2 A^top A$ which is a positive semi-definite matrix, i.e. it satisfies $v^top H v ge 0$ for any vector $v$. Thus any critical point is a local minimum.






          share|cite|improve this answer









          $endgroup$













          • $begingroup$
            Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 17:25






          • 1




            $begingroup$
            @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
            $endgroup$
            – angryavian
            Jan 20 at 17:51










          • $begingroup$
            I got it! Thank you!
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 18:36











          Your Answer





          StackExchange.ifUsing("editor", function () {
          return StackExchange.using("mathjaxEditing", function () {
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          });
          });
          }, "mathjax-editing");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "69"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3080810%2fcharacterize-the-critical-points-of-vert-ax-b-vert2%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          1












          $begingroup$

          The partial derivative should be
          $$frac{partial}{partial x_i} f(x) = 2 A^top A x_i - (A^top b)_i - (A^top b)_i$$
          where $(A^top b)_i$ denotes the $i$th component of the vector $A^top b$.





          "Determine a condition that characterizes the critical points" is simply asking for the condition $A^top (Ax - b) = 0$. That's it! :) [That is, if someone asked you "how can you tell me if this $x$ is a critical point?" you can check whether $A^top (Ax - b) = 0$ holds or not.]



          More importantly, the above condition is not equivalent to $Ax-b=0$. If $Ax-b=0$ it is then true that $A^top (Ax-b) = 0$, but the converse may not hold, in particular if $A^top$ has a nontrivial nullspace.





          I would not write $A^top A = |A|^2$ since $A$ is a matrix, not a vector.
          But indeed the Hessian is $H = 2 A^top A$ which is a positive semi-definite matrix, i.e. it satisfies $v^top H v ge 0$ for any vector $v$. Thus any critical point is a local minimum.






          share|cite|improve this answer









          $endgroup$













          • $begingroup$
            Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 17:25






          • 1




            $begingroup$
            @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
            $endgroup$
            – angryavian
            Jan 20 at 17:51










          • $begingroup$
            I got it! Thank you!
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 18:36
















          1












          $begingroup$

          The partial derivative should be
          $$frac{partial}{partial x_i} f(x) = 2 A^top A x_i - (A^top b)_i - (A^top b)_i$$
          where $(A^top b)_i$ denotes the $i$th component of the vector $A^top b$.





          "Determine a condition that characterizes the critical points" is simply asking for the condition $A^top (Ax - b) = 0$. That's it! :) [That is, if someone asked you "how can you tell me if this $x$ is a critical point?" you can check whether $A^top (Ax - b) = 0$ holds or not.]



          More importantly, the above condition is not equivalent to $Ax-b=0$. If $Ax-b=0$ it is then true that $A^top (Ax-b) = 0$, but the converse may not hold, in particular if $A^top$ has a nontrivial nullspace.





          I would not write $A^top A = |A|^2$ since $A$ is a matrix, not a vector.
          But indeed the Hessian is $H = 2 A^top A$ which is a positive semi-definite matrix, i.e. it satisfies $v^top H v ge 0$ for any vector $v$. Thus any critical point is a local minimum.






          share|cite|improve this answer









          $endgroup$













          • $begingroup$
            Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 17:25






          • 1




            $begingroup$
            @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
            $endgroup$
            – angryavian
            Jan 20 at 17:51










          • $begingroup$
            I got it! Thank you!
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 18:36














          1












          1








          1





          $begingroup$

          The partial derivative should be
          $$frac{partial}{partial x_i} f(x) = 2 A^top A x_i - (A^top b)_i - (A^top b)_i$$
          where $(A^top b)_i$ denotes the $i$th component of the vector $A^top b$.





          "Determine a condition that characterizes the critical points" is simply asking for the condition $A^top (Ax - b) = 0$. That's it! :) [That is, if someone asked you "how can you tell me if this $x$ is a critical point?" you can check whether $A^top (Ax - b) = 0$ holds or not.]



          More importantly, the above condition is not equivalent to $Ax-b=0$. If $Ax-b=0$ it is then true that $A^top (Ax-b) = 0$, but the converse may not hold, in particular if $A^top$ has a nontrivial nullspace.





          I would not write $A^top A = |A|^2$ since $A$ is a matrix, not a vector.
          But indeed the Hessian is $H = 2 A^top A$ which is a positive semi-definite matrix, i.e. it satisfies $v^top H v ge 0$ for any vector $v$. Thus any critical point is a local minimum.






          share|cite|improve this answer









          $endgroup$



          The partial derivative should be
          $$frac{partial}{partial x_i} f(x) = 2 A^top A x_i - (A^top b)_i - (A^top b)_i$$
          where $(A^top b)_i$ denotes the $i$th component of the vector $A^top b$.





          "Determine a condition that characterizes the critical points" is simply asking for the condition $A^top (Ax - b) = 0$. That's it! :) [That is, if someone asked you "how can you tell me if this $x$ is a critical point?" you can check whether $A^top (Ax - b) = 0$ holds or not.]



          More importantly, the above condition is not equivalent to $Ax-b=0$. If $Ax-b=0$ it is then true that $A^top (Ax-b) = 0$, but the converse may not hold, in particular if $A^top$ has a nontrivial nullspace.





          I would not write $A^top A = |A|^2$ since $A$ is a matrix, not a vector.
          But indeed the Hessian is $H = 2 A^top A$ which is a positive semi-definite matrix, i.e. it satisfies $v^top H v ge 0$ for any vector $v$. Thus any critical point is a local minimum.







          share|cite|improve this answer












          share|cite|improve this answer



          share|cite|improve this answer










          answered Jan 20 at 17:09









          angryavianangryavian

          41.9k23381




          41.9k23381












          • $begingroup$
            Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 17:25






          • 1




            $begingroup$
            @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
            $endgroup$
            – angryavian
            Jan 20 at 17:51










          • $begingroup$
            I got it! Thank you!
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 18:36


















          • $begingroup$
            Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 17:25






          • 1




            $begingroup$
            @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
            $endgroup$
            – angryavian
            Jan 20 at 17:51










          • $begingroup$
            I got it! Thank you!
            $endgroup$
            – Lucas Corrêa
            Jan 20 at 18:36
















          $begingroup$
          Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
          $endgroup$
          – Lucas Corrêa
          Jan 20 at 17:25




          $begingroup$
          Its a nice answer! Can you explain a little more the partial derivative, precisely the last term?
          $endgroup$
          – Lucas Corrêa
          Jan 20 at 17:25




          1




          1




          $begingroup$
          @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
          $endgroup$
          – angryavian
          Jan 20 at 17:51




          $begingroup$
          @LucasCorrêa It may be clearer for you to simply write $x^top A^top b = b^top A x = sum_j sum_k A_{jk} b_j x_k$ and take the partial derivative.
          $endgroup$
          – angryavian
          Jan 20 at 17:51












          $begingroup$
          I got it! Thank you!
          $endgroup$
          – Lucas Corrêa
          Jan 20 at 18:36




          $begingroup$
          I got it! Thank you!
          $endgroup$
          – Lucas Corrêa
          Jan 20 at 18:36


















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Mathematics Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3080810%2fcharacterize-the-critical-points-of-vert-ax-b-vert2%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          MongoDB - Not Authorized To Execute Command

          Setting WPF background Opacity programmatically

          in spring boot 2.1 many test slices are not allowed anymore due to multiple @BootstrapWith