What is the motivation of defining weak derivative as it is?












3












$begingroup$


I've been reading lately about reproducing kernel Hilbert spaces (RKHS) and Gaussian processes (GP) and during my studies I came across with the concept of weak derivative and Sobolev spaces. I have tried to get some sense from weak derivative by reading these two questions at the site:



What is the intuition behind a function being 'weakly differentiable'?



Intuition about weakly differentiable functions



So I have learned now that:




A function $u:Omegarightarrowmathbb{R}$ is weakly differentiable
with $alpha$th ($alphainmathbb{N}^n$) weak partial derivative
$v=D^{alpha}u$, if for all compactly supported smooth functions
$phi$ it holds that:



$$int_{Omega} u ,D^{alpha}phi,dx =
(-1)^{|alpha|}int_{Omega}vphi,dx.$$




Quoting earlier answers (by user Christopher A. Wong) from the links I provided above:




The basic intuition is that a weakly differentiable function looks
differentiable except for on sets of zero measure. This allows
functions that are not normally considered differentiable at "corners"
to have a weak derivative that is defined everywhere on the original
function's domain. The reason why weak derivatives ignore sets of zero
measure is precisely because weak derivatives are defined by
integrals, and integrals cannot see behavior on sets of zero measure.




Now I'm almost satisfied with this answer, but one unclarity remains:



Why is the weak derivative defined as it is? That is, when the idea of weak derivative was first considered, whoever it was, why did he select that particular equation above as the definition? Or was this definition a somewhat arbitrary choice, which simply suited to our purposes?



For example, one of the reasons of using squared error in many problems of statistics or optimization is (as I've understood) because the squared error has nice analytical properties, making the math easy, when compared with e.g. absolute error.



The reason why I'm asking this, is that sometimes it seems many techniques of mathematics rise like from a magicians hat. For a beginner like myself, it is difficult to picture the scenario and conditions that gave rise to that discovery or definition.










share|cite|improve this question











$endgroup$

















    3












    $begingroup$


    I've been reading lately about reproducing kernel Hilbert spaces (RKHS) and Gaussian processes (GP) and during my studies I came across with the concept of weak derivative and Sobolev spaces. I have tried to get some sense from weak derivative by reading these two questions at the site:



    What is the intuition behind a function being 'weakly differentiable'?



    Intuition about weakly differentiable functions



    So I have learned now that:




    A function $u:Omegarightarrowmathbb{R}$ is weakly differentiable
    with $alpha$th ($alphainmathbb{N}^n$) weak partial derivative
    $v=D^{alpha}u$, if for all compactly supported smooth functions
    $phi$ it holds that:



    $$int_{Omega} u ,D^{alpha}phi,dx =
    (-1)^{|alpha|}int_{Omega}vphi,dx.$$




    Quoting earlier answers (by user Christopher A. Wong) from the links I provided above:




    The basic intuition is that a weakly differentiable function looks
    differentiable except for on sets of zero measure. This allows
    functions that are not normally considered differentiable at "corners"
    to have a weak derivative that is defined everywhere on the original
    function's domain. The reason why weak derivatives ignore sets of zero
    measure is precisely because weak derivatives are defined by
    integrals, and integrals cannot see behavior on sets of zero measure.




    Now I'm almost satisfied with this answer, but one unclarity remains:



    Why is the weak derivative defined as it is? That is, when the idea of weak derivative was first considered, whoever it was, why did he select that particular equation above as the definition? Or was this definition a somewhat arbitrary choice, which simply suited to our purposes?



    For example, one of the reasons of using squared error in many problems of statistics or optimization is (as I've understood) because the squared error has nice analytical properties, making the math easy, when compared with e.g. absolute error.



    The reason why I'm asking this, is that sometimes it seems many techniques of mathematics rise like from a magicians hat. For a beginner like myself, it is difficult to picture the scenario and conditions that gave rise to that discovery or definition.










    share|cite|improve this question











    $endgroup$















      3












      3








      3





      $begingroup$


      I've been reading lately about reproducing kernel Hilbert spaces (RKHS) and Gaussian processes (GP) and during my studies I came across with the concept of weak derivative and Sobolev spaces. I have tried to get some sense from weak derivative by reading these two questions at the site:



      What is the intuition behind a function being 'weakly differentiable'?



      Intuition about weakly differentiable functions



      So I have learned now that:




      A function $u:Omegarightarrowmathbb{R}$ is weakly differentiable
      with $alpha$th ($alphainmathbb{N}^n$) weak partial derivative
      $v=D^{alpha}u$, if for all compactly supported smooth functions
      $phi$ it holds that:



      $$int_{Omega} u ,D^{alpha}phi,dx =
      (-1)^{|alpha|}int_{Omega}vphi,dx.$$




      Quoting earlier answers (by user Christopher A. Wong) from the links I provided above:




      The basic intuition is that a weakly differentiable function looks
      differentiable except for on sets of zero measure. This allows
      functions that are not normally considered differentiable at "corners"
      to have a weak derivative that is defined everywhere on the original
      function's domain. The reason why weak derivatives ignore sets of zero
      measure is precisely because weak derivatives are defined by
      integrals, and integrals cannot see behavior on sets of zero measure.




      Now I'm almost satisfied with this answer, but one unclarity remains:



      Why is the weak derivative defined as it is? That is, when the idea of weak derivative was first considered, whoever it was, why did he select that particular equation above as the definition? Or was this definition a somewhat arbitrary choice, which simply suited to our purposes?



      For example, one of the reasons of using squared error in many problems of statistics or optimization is (as I've understood) because the squared error has nice analytical properties, making the math easy, when compared with e.g. absolute error.



      The reason why I'm asking this, is that sometimes it seems many techniques of mathematics rise like from a magicians hat. For a beginner like myself, it is difficult to picture the scenario and conditions that gave rise to that discovery or definition.










      share|cite|improve this question











      $endgroup$




      I've been reading lately about reproducing kernel Hilbert spaces (RKHS) and Gaussian processes (GP) and during my studies I came across with the concept of weak derivative and Sobolev spaces. I have tried to get some sense from weak derivative by reading these two questions at the site:



      What is the intuition behind a function being 'weakly differentiable'?



      Intuition about weakly differentiable functions



      So I have learned now that:




      A function $u:Omegarightarrowmathbb{R}$ is weakly differentiable
      with $alpha$th ($alphainmathbb{N}^n$) weak partial derivative
      $v=D^{alpha}u$, if for all compactly supported smooth functions
      $phi$ it holds that:



      $$int_{Omega} u ,D^{alpha}phi,dx =
      (-1)^{|alpha|}int_{Omega}vphi,dx.$$




      Quoting earlier answers (by user Christopher A. Wong) from the links I provided above:




      The basic intuition is that a weakly differentiable function looks
      differentiable except for on sets of zero measure. This allows
      functions that are not normally considered differentiable at "corners"
      to have a weak derivative that is defined everywhere on the original
      function's domain. The reason why weak derivatives ignore sets of zero
      measure is precisely because weak derivatives are defined by
      integrals, and integrals cannot see behavior on sets of zero measure.




      Now I'm almost satisfied with this answer, but one unclarity remains:



      Why is the weak derivative defined as it is? That is, when the idea of weak derivative was first considered, whoever it was, why did he select that particular equation above as the definition? Or was this definition a somewhat arbitrary choice, which simply suited to our purposes?



      For example, one of the reasons of using squared error in many problems of statistics or optimization is (as I've understood) because the squared error has nice analytical properties, making the math easy, when compared with e.g. absolute error.



      The reason why I'm asking this, is that sometimes it seems many techniques of mathematics rise like from a magicians hat. For a beginner like myself, it is difficult to picture the scenario and conditions that gave rise to that discovery or definition.







      measure-theory sobolev-spaces weak-derivatives






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Jan 21 at 12:48







      jjepsuomi

















      asked Jan 21 at 12:36









      jjepsuomijjepsuomi

      3,64393361




      3,64393361






















          1 Answer
          1






          active

          oldest

          votes


















          1












          $begingroup$

          Laurent Schwartz had the idea. Distributions are linear functionals defined on test functions. They generalize functions in this sense. If $u$ is an actual function, then the corresponding distribution is the linear functional
          $$
          phi mapsto int u ;phi;dx
          $$

          A linear functional not of this form may still be considered a generalized function.



          Now if $u$ is a function and the derivative $v = D^alpha u$ actually exists, then integration by parts shows
          $$
          int_{Omega}v;phi,dx = (-1)^{|alpha|}int_{Omega} u ,D^{alpha}phi;dx
          $$

          [no boundary terms because $phi$ vanishes outside a compact subset of $Omega$]



          Next, if $u$ s a function but $D^alpha u$ does not exist in the classical sense, it is still true that the functional
          $$
          phi mapsto (-1)^{|alpha|}int_{Omega} u; ,D^{alpha}phi;dx
          $$

          makes sense and defines a generalized function (a.k.a.Schwartz distribution). So things work out if we go ahead and call this functional $D^alpha u$. [It is not a function, but a distribution.]






          share|cite|improve this answer











          $endgroup$













          • $begingroup$
            Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
            $endgroup$
            – jjepsuomi
            Jan 21 at 14:27













          Your Answer





          StackExchange.ifUsing("editor", function () {
          return StackExchange.using("mathjaxEditing", function () {
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          });
          });
          }, "mathjax-editing");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "69"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3081828%2fwhat-is-the-motivation-of-defining-weak-derivative-as-it-is%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          1












          $begingroup$

          Laurent Schwartz had the idea. Distributions are linear functionals defined on test functions. They generalize functions in this sense. If $u$ is an actual function, then the corresponding distribution is the linear functional
          $$
          phi mapsto int u ;phi;dx
          $$

          A linear functional not of this form may still be considered a generalized function.



          Now if $u$ is a function and the derivative $v = D^alpha u$ actually exists, then integration by parts shows
          $$
          int_{Omega}v;phi,dx = (-1)^{|alpha|}int_{Omega} u ,D^{alpha}phi;dx
          $$

          [no boundary terms because $phi$ vanishes outside a compact subset of $Omega$]



          Next, if $u$ s a function but $D^alpha u$ does not exist in the classical sense, it is still true that the functional
          $$
          phi mapsto (-1)^{|alpha|}int_{Omega} u; ,D^{alpha}phi;dx
          $$

          makes sense and defines a generalized function (a.k.a.Schwartz distribution). So things work out if we go ahead and call this functional $D^alpha u$. [It is not a function, but a distribution.]






          share|cite|improve this answer











          $endgroup$













          • $begingroup$
            Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
            $endgroup$
            – jjepsuomi
            Jan 21 at 14:27


















          1












          $begingroup$

          Laurent Schwartz had the idea. Distributions are linear functionals defined on test functions. They generalize functions in this sense. If $u$ is an actual function, then the corresponding distribution is the linear functional
          $$
          phi mapsto int u ;phi;dx
          $$

          A linear functional not of this form may still be considered a generalized function.



          Now if $u$ is a function and the derivative $v = D^alpha u$ actually exists, then integration by parts shows
          $$
          int_{Omega}v;phi,dx = (-1)^{|alpha|}int_{Omega} u ,D^{alpha}phi;dx
          $$

          [no boundary terms because $phi$ vanishes outside a compact subset of $Omega$]



          Next, if $u$ s a function but $D^alpha u$ does not exist in the classical sense, it is still true that the functional
          $$
          phi mapsto (-1)^{|alpha|}int_{Omega} u; ,D^{alpha}phi;dx
          $$

          makes sense and defines a generalized function (a.k.a.Schwartz distribution). So things work out if we go ahead and call this functional $D^alpha u$. [It is not a function, but a distribution.]






          share|cite|improve this answer











          $endgroup$













          • $begingroup$
            Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
            $endgroup$
            – jjepsuomi
            Jan 21 at 14:27
















          1












          1








          1





          $begingroup$

          Laurent Schwartz had the idea. Distributions are linear functionals defined on test functions. They generalize functions in this sense. If $u$ is an actual function, then the corresponding distribution is the linear functional
          $$
          phi mapsto int u ;phi;dx
          $$

          A linear functional not of this form may still be considered a generalized function.



          Now if $u$ is a function and the derivative $v = D^alpha u$ actually exists, then integration by parts shows
          $$
          int_{Omega}v;phi,dx = (-1)^{|alpha|}int_{Omega} u ,D^{alpha}phi;dx
          $$

          [no boundary terms because $phi$ vanishes outside a compact subset of $Omega$]



          Next, if $u$ s a function but $D^alpha u$ does not exist in the classical sense, it is still true that the functional
          $$
          phi mapsto (-1)^{|alpha|}int_{Omega} u; ,D^{alpha}phi;dx
          $$

          makes sense and defines a generalized function (a.k.a.Schwartz distribution). So things work out if we go ahead and call this functional $D^alpha u$. [It is not a function, but a distribution.]






          share|cite|improve this answer











          $endgroup$



          Laurent Schwartz had the idea. Distributions are linear functionals defined on test functions. They generalize functions in this sense. If $u$ is an actual function, then the corresponding distribution is the linear functional
          $$
          phi mapsto int u ;phi;dx
          $$

          A linear functional not of this form may still be considered a generalized function.



          Now if $u$ is a function and the derivative $v = D^alpha u$ actually exists, then integration by parts shows
          $$
          int_{Omega}v;phi,dx = (-1)^{|alpha|}int_{Omega} u ,D^{alpha}phi;dx
          $$

          [no boundary terms because $phi$ vanishes outside a compact subset of $Omega$]



          Next, if $u$ s a function but $D^alpha u$ does not exist in the classical sense, it is still true that the functional
          $$
          phi mapsto (-1)^{|alpha|}int_{Omega} u; ,D^{alpha}phi;dx
          $$

          makes sense and defines a generalized function (a.k.a.Schwartz distribution). So things work out if we go ahead and call this functional $D^alpha u$. [It is not a function, but a distribution.]







          share|cite|improve this answer














          share|cite|improve this answer



          share|cite|improve this answer








          edited Jan 21 at 13:18

























          answered Jan 21 at 13:02









          GEdgarGEdgar

          62.8k267171




          62.8k267171












          • $begingroup$
            Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
            $endgroup$
            – jjepsuomi
            Jan 21 at 14:27




















          • $begingroup$
            Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
            $endgroup$
            – jjepsuomi
            Jan 21 at 14:27


















          $begingroup$
          Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
          $endgroup$
          – jjepsuomi
          Jan 21 at 14:27






          $begingroup$
          Thank you for your help! Needed to check on a book about generalized functions tailored for theoretical physics, but I think the idea almost sunk into my head (helped to think about density and Dirac functions). One more question: the product rule is used in the definition of weak derivative....could we have used e.g. quotient rule instead? I mean, is there something special about the product rule? Bear with me if this is a funny question, still learning :-)
          $endgroup$
          – jjepsuomi
          Jan 21 at 14:27




















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Mathematics Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3081828%2fwhat-is-the-motivation-of-defining-weak-derivative-as-it-is%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          MongoDB - Not Authorized To Execute Command

          How to fix TextFormField cause rebuild widget in Flutter

          in spring boot 2.1 many test slices are not allowed anymore due to multiple @BootstrapWith