How does a gradient allow the calculation of the directional derivative












3












$begingroup$


If the gradient only results in a vector telling you the steepest direction to travel, how can the "slope" in any direction be calculated?



If the gradient is:



$nabla f(x,y,z) = left[begin{array}{c}frac{partial f}{partial x}\frac{partial f}{partial y}\frac{partial f}{partial z}
end{array}right]$



How can the directional derivative simply be a dot product between the gradient and the vector?



$nabla_{vec{v}},f(x,y,z) = nabla f(x,y,z) cdot vec{v} = vec{v}_xfrac{partial f}{partial x}+vec{v}_yfrac{partial f}{partial y}+vec{v}_zfrac{partial f}{partial z}$










share|cite|improve this question











$endgroup$








  • 1




    $begingroup$
    It isn't just the direction of steepest ascent. The logic goes the other way: the directional derivative is the dot product of the gradient with a vector, and as a consequence the normalized gradient is the direction of steepest ascent. This follows from the Cauchy-Schwarz inequality.
    $endgroup$
    – Ian
    Jan 13 at 20:08
















3












$begingroup$


If the gradient only results in a vector telling you the steepest direction to travel, how can the "slope" in any direction be calculated?



If the gradient is:



$nabla f(x,y,z) = left[begin{array}{c}frac{partial f}{partial x}\frac{partial f}{partial y}\frac{partial f}{partial z}
end{array}right]$



How can the directional derivative simply be a dot product between the gradient and the vector?



$nabla_{vec{v}},f(x,y,z) = nabla f(x,y,z) cdot vec{v} = vec{v}_xfrac{partial f}{partial x}+vec{v}_yfrac{partial f}{partial y}+vec{v}_zfrac{partial f}{partial z}$










share|cite|improve this question











$endgroup$








  • 1




    $begingroup$
    It isn't just the direction of steepest ascent. The logic goes the other way: the directional derivative is the dot product of the gradient with a vector, and as a consequence the normalized gradient is the direction of steepest ascent. This follows from the Cauchy-Schwarz inequality.
    $endgroup$
    – Ian
    Jan 13 at 20:08














3












3








3


1



$begingroup$


If the gradient only results in a vector telling you the steepest direction to travel, how can the "slope" in any direction be calculated?



If the gradient is:



$nabla f(x,y,z) = left[begin{array}{c}frac{partial f}{partial x}\frac{partial f}{partial y}\frac{partial f}{partial z}
end{array}right]$



How can the directional derivative simply be a dot product between the gradient and the vector?



$nabla_{vec{v}},f(x,y,z) = nabla f(x,y,z) cdot vec{v} = vec{v}_xfrac{partial f}{partial x}+vec{v}_yfrac{partial f}{partial y}+vec{v}_zfrac{partial f}{partial z}$










share|cite|improve this question











$endgroup$




If the gradient only results in a vector telling you the steepest direction to travel, how can the "slope" in any direction be calculated?



If the gradient is:



$nabla f(x,y,z) = left[begin{array}{c}frac{partial f}{partial x}\frac{partial f}{partial y}\frac{partial f}{partial z}
end{array}right]$



How can the directional derivative simply be a dot product between the gradient and the vector?



$nabla_{vec{v}},f(x,y,z) = nabla f(x,y,z) cdot vec{v} = vec{v}_xfrac{partial f}{partial x}+vec{v}_yfrac{partial f}{partial y}+vec{v}_zfrac{partial f}{partial z}$







multivariable-calculus vectors partial-derivative vector-analysis






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Jan 13 at 22:03







Arvin Singh

















asked Jan 13 at 20:03









Arvin SinghArvin Singh

14117




14117








  • 1




    $begingroup$
    It isn't just the direction of steepest ascent. The logic goes the other way: the directional derivative is the dot product of the gradient with a vector, and as a consequence the normalized gradient is the direction of steepest ascent. This follows from the Cauchy-Schwarz inequality.
    $endgroup$
    – Ian
    Jan 13 at 20:08














  • 1




    $begingroup$
    It isn't just the direction of steepest ascent. The logic goes the other way: the directional derivative is the dot product of the gradient with a vector, and as a consequence the normalized gradient is the direction of steepest ascent. This follows from the Cauchy-Schwarz inequality.
    $endgroup$
    – Ian
    Jan 13 at 20:08








1




1




$begingroup$
It isn't just the direction of steepest ascent. The logic goes the other way: the directional derivative is the dot product of the gradient with a vector, and as a consequence the normalized gradient is the direction of steepest ascent. This follows from the Cauchy-Schwarz inequality.
$endgroup$
– Ian
Jan 13 at 20:08




$begingroup$
It isn't just the direction of steepest ascent. The logic goes the other way: the directional derivative is the dot product of the gradient with a vector, and as a consequence the normalized gradient is the direction of steepest ascent. This follows from the Cauchy-Schwarz inequality.
$endgroup$
– Ian
Jan 13 at 20:08










2 Answers
2






active

oldest

votes


















2












$begingroup$

I don’t know how well you already know stuff so I’m going to just justify everything I can. There’s a paragraph about the gradient at the bottom. Also the logic extends so I’ll just talk about functions with 2 inputs (surfaces) instead of three. It might be more helpful. You also might need some visuals and I think there are several things online or I believe khan academy has some helpful visuals even though I’m not always a fan of their regular videos.



Consider the multivariable chain rule. Recall that it works intuitively because the change in z arises from the combined changes in x and in y. Also you can try some practical thought experiments help make it clear, all with the basic idea that for $z(x(t),y(t))$, the change of the height z as you go through time (or more generally some other parameter) is the amount it changes due to x plus the amount it’s changing due to y since they are independent. You could readily substitute amount and rate in that previous sentence, which yields the multivariable chain rule, as was desired.



Now, this $f(x(t),y(t))$ can be interpreted as a path above a graph on the xy plane, which goes along the surface $f$. When the graph (x,y) is a straight line and you move along it with constant speed, you have the derivative as you move in that direction (not just x or y, which are the partial derivatives). If you move at different speeds, like twice the speed, the height changes twice as much in the same amount of time so the directional derivative also doubles. Directional derivatives, however, we take when moving at a constant speed (so they’re like a regular partial derivative but moving in a different direction), thus the vector is normalized. That’s the interpretation of not normalizing the vector (which is just moving at a different speed) cause I don’t feel like normalizing this: suppose $frac{dy}{dt}=2$, and $frac{dx}{dt}=3$ (I’ll try to look up and make the partial derivative symbol in there later). As you move along the vector 2,3 in a unit of time, the rate of change of z is these substituted into the multivariable chain rule, which is algebraically equivelant to the gradient dot the vector you move along in a period.



Additionally, the dot product is saying how much length of one vector there is per length of the other along it’s direction, because this is equivalent to projecting one vector onto another and multiplying their lengths. Note: it doesn’t matter which is projected onto which because the dot product is symmetric. Anyways, locally the gradient is the direction of steepest ascent, and dotting a vector with it is just saying how much is this unit motion lined up along this direction of main (positive) ascent, the quantity you’re measuring. This comparison to the gradient is logically the same as saying how much of the increase (gradient) is in a particular direction, hence, directional derivative.



*The gradient is how a function ascends, ie a certain amount in one direction plus a certain amount in the other(s), or: $ifrac{df}{dx}+jfrac{df}{dy}$ (again, sorry about the latex, and that’s meant to be i hat and j hat). Viewed in this light, it only makes sense that “how it’s ascending” ascends in the direction of steepest ascent. It’s just as a gradient of a single variable function is a vector telling you loosely “how it’s changing”, and if you can connect this to the idea that the vector (which is just a scalar in this case) points in the direction of increase according to how it’s increasing you can see the analogy for higher dimensional gradients. Of course you could also prove this analytically in several ways, one of which is working backwards and optimizing the angle of the directional derivative (which weirdly isn’t circular reasoning and doesn’t require you to already know about the gradient).



I haven’t thought through this too much but since every analytic function is has a tangent plane you could figure it out by doing it on a plane.



And again, choosing some real world or more abstract variables with this and doing a thought experiment is fun. And like Mohammad said, like normal slope, it’s a scalar.






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
    $endgroup$
    – Arvin Singh
    Jan 13 at 23:45










  • $begingroup$
    @ArvinSingh I think so but I have to learn about interpolation.
    $endgroup$
    – Benjamin Thoburn
    Jan 14 at 17:52










  • $begingroup$
    Linear interpolation as in scaling between 2 values
    $endgroup$
    – Arvin Singh
    Jan 14 at 20:37



















1












$begingroup$

The result of dot product of gradient vector and the unit direction vector is a scalar not a vector. The scalar is the directional derivative in that direction. I noticed that you have a vector for the result of the dot product instead of a scalar. You need to add the components to get the result.






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
    $endgroup$
    – Arvin Singh
    Jan 13 at 22:04










  • $begingroup$
    Derivatives are slopes which are the instantaneous rate of change in the given direction.
    $endgroup$
    – Mohammad Riazi-Kermani
    Jan 13 at 22:30











Your Answer





StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});


}
});














draft saved

draft discarded


















StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3072458%2fhow-does-a-gradient-allow-the-calculation-of-the-directional-derivative%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown

























2 Answers
2






active

oldest

votes








2 Answers
2






active

oldest

votes









active

oldest

votes






active

oldest

votes









2












$begingroup$

I don’t know how well you already know stuff so I’m going to just justify everything I can. There’s a paragraph about the gradient at the bottom. Also the logic extends so I’ll just talk about functions with 2 inputs (surfaces) instead of three. It might be more helpful. You also might need some visuals and I think there are several things online or I believe khan academy has some helpful visuals even though I’m not always a fan of their regular videos.



Consider the multivariable chain rule. Recall that it works intuitively because the change in z arises from the combined changes in x and in y. Also you can try some practical thought experiments help make it clear, all with the basic idea that for $z(x(t),y(t))$, the change of the height z as you go through time (or more generally some other parameter) is the amount it changes due to x plus the amount it’s changing due to y since they are independent. You could readily substitute amount and rate in that previous sentence, which yields the multivariable chain rule, as was desired.



Now, this $f(x(t),y(t))$ can be interpreted as a path above a graph on the xy plane, which goes along the surface $f$. When the graph (x,y) is a straight line and you move along it with constant speed, you have the derivative as you move in that direction (not just x or y, which are the partial derivatives). If you move at different speeds, like twice the speed, the height changes twice as much in the same amount of time so the directional derivative also doubles. Directional derivatives, however, we take when moving at a constant speed (so they’re like a regular partial derivative but moving in a different direction), thus the vector is normalized. That’s the interpretation of not normalizing the vector (which is just moving at a different speed) cause I don’t feel like normalizing this: suppose $frac{dy}{dt}=2$, and $frac{dx}{dt}=3$ (I’ll try to look up and make the partial derivative symbol in there later). As you move along the vector 2,3 in a unit of time, the rate of change of z is these substituted into the multivariable chain rule, which is algebraically equivelant to the gradient dot the vector you move along in a period.



Additionally, the dot product is saying how much length of one vector there is per length of the other along it’s direction, because this is equivalent to projecting one vector onto another and multiplying their lengths. Note: it doesn’t matter which is projected onto which because the dot product is symmetric. Anyways, locally the gradient is the direction of steepest ascent, and dotting a vector with it is just saying how much is this unit motion lined up along this direction of main (positive) ascent, the quantity you’re measuring. This comparison to the gradient is logically the same as saying how much of the increase (gradient) is in a particular direction, hence, directional derivative.



*The gradient is how a function ascends, ie a certain amount in one direction plus a certain amount in the other(s), or: $ifrac{df}{dx}+jfrac{df}{dy}$ (again, sorry about the latex, and that’s meant to be i hat and j hat). Viewed in this light, it only makes sense that “how it’s ascending” ascends in the direction of steepest ascent. It’s just as a gradient of a single variable function is a vector telling you loosely “how it’s changing”, and if you can connect this to the idea that the vector (which is just a scalar in this case) points in the direction of increase according to how it’s increasing you can see the analogy for higher dimensional gradients. Of course you could also prove this analytically in several ways, one of which is working backwards and optimizing the angle of the directional derivative (which weirdly isn’t circular reasoning and doesn’t require you to already know about the gradient).



I haven’t thought through this too much but since every analytic function is has a tangent plane you could figure it out by doing it on a plane.



And again, choosing some real world or more abstract variables with this and doing a thought experiment is fun. And like Mohammad said, like normal slope, it’s a scalar.






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
    $endgroup$
    – Arvin Singh
    Jan 13 at 23:45










  • $begingroup$
    @ArvinSingh I think so but I have to learn about interpolation.
    $endgroup$
    – Benjamin Thoburn
    Jan 14 at 17:52










  • $begingroup$
    Linear interpolation as in scaling between 2 values
    $endgroup$
    – Arvin Singh
    Jan 14 at 20:37
















2












$begingroup$

I don’t know how well you already know stuff so I’m going to just justify everything I can. There’s a paragraph about the gradient at the bottom. Also the logic extends so I’ll just talk about functions with 2 inputs (surfaces) instead of three. It might be more helpful. You also might need some visuals and I think there are several things online or I believe khan academy has some helpful visuals even though I’m not always a fan of their regular videos.



Consider the multivariable chain rule. Recall that it works intuitively because the change in z arises from the combined changes in x and in y. Also you can try some practical thought experiments help make it clear, all with the basic idea that for $z(x(t),y(t))$, the change of the height z as you go through time (or more generally some other parameter) is the amount it changes due to x plus the amount it’s changing due to y since they are independent. You could readily substitute amount and rate in that previous sentence, which yields the multivariable chain rule, as was desired.



Now, this $f(x(t),y(t))$ can be interpreted as a path above a graph on the xy plane, which goes along the surface $f$. When the graph (x,y) is a straight line and you move along it with constant speed, you have the derivative as you move in that direction (not just x or y, which are the partial derivatives). If you move at different speeds, like twice the speed, the height changes twice as much in the same amount of time so the directional derivative also doubles. Directional derivatives, however, we take when moving at a constant speed (so they’re like a regular partial derivative but moving in a different direction), thus the vector is normalized. That’s the interpretation of not normalizing the vector (which is just moving at a different speed) cause I don’t feel like normalizing this: suppose $frac{dy}{dt}=2$, and $frac{dx}{dt}=3$ (I’ll try to look up and make the partial derivative symbol in there later). As you move along the vector 2,3 in a unit of time, the rate of change of z is these substituted into the multivariable chain rule, which is algebraically equivelant to the gradient dot the vector you move along in a period.



Additionally, the dot product is saying how much length of one vector there is per length of the other along it’s direction, because this is equivalent to projecting one vector onto another and multiplying their lengths. Note: it doesn’t matter which is projected onto which because the dot product is symmetric. Anyways, locally the gradient is the direction of steepest ascent, and dotting a vector with it is just saying how much is this unit motion lined up along this direction of main (positive) ascent, the quantity you’re measuring. This comparison to the gradient is logically the same as saying how much of the increase (gradient) is in a particular direction, hence, directional derivative.



*The gradient is how a function ascends, ie a certain amount in one direction plus a certain amount in the other(s), or: $ifrac{df}{dx}+jfrac{df}{dy}$ (again, sorry about the latex, and that’s meant to be i hat and j hat). Viewed in this light, it only makes sense that “how it’s ascending” ascends in the direction of steepest ascent. It’s just as a gradient of a single variable function is a vector telling you loosely “how it’s changing”, and if you can connect this to the idea that the vector (which is just a scalar in this case) points in the direction of increase according to how it’s increasing you can see the analogy for higher dimensional gradients. Of course you could also prove this analytically in several ways, one of which is working backwards and optimizing the angle of the directional derivative (which weirdly isn’t circular reasoning and doesn’t require you to already know about the gradient).



I haven’t thought through this too much but since every analytic function is has a tangent plane you could figure it out by doing it on a plane.



And again, choosing some real world or more abstract variables with this and doing a thought experiment is fun. And like Mohammad said, like normal slope, it’s a scalar.






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
    $endgroup$
    – Arvin Singh
    Jan 13 at 23:45










  • $begingroup$
    @ArvinSingh I think so but I have to learn about interpolation.
    $endgroup$
    – Benjamin Thoburn
    Jan 14 at 17:52










  • $begingroup$
    Linear interpolation as in scaling between 2 values
    $endgroup$
    – Arvin Singh
    Jan 14 at 20:37














2












2








2





$begingroup$

I don’t know how well you already know stuff so I’m going to just justify everything I can. There’s a paragraph about the gradient at the bottom. Also the logic extends so I’ll just talk about functions with 2 inputs (surfaces) instead of three. It might be more helpful. You also might need some visuals and I think there are several things online or I believe khan academy has some helpful visuals even though I’m not always a fan of their regular videos.



Consider the multivariable chain rule. Recall that it works intuitively because the change in z arises from the combined changes in x and in y. Also you can try some practical thought experiments help make it clear, all with the basic idea that for $z(x(t),y(t))$, the change of the height z as you go through time (or more generally some other parameter) is the amount it changes due to x plus the amount it’s changing due to y since they are independent. You could readily substitute amount and rate in that previous sentence, which yields the multivariable chain rule, as was desired.



Now, this $f(x(t),y(t))$ can be interpreted as a path above a graph on the xy plane, which goes along the surface $f$. When the graph (x,y) is a straight line and you move along it with constant speed, you have the derivative as you move in that direction (not just x or y, which are the partial derivatives). If you move at different speeds, like twice the speed, the height changes twice as much in the same amount of time so the directional derivative also doubles. Directional derivatives, however, we take when moving at a constant speed (so they’re like a regular partial derivative but moving in a different direction), thus the vector is normalized. That’s the interpretation of not normalizing the vector (which is just moving at a different speed) cause I don’t feel like normalizing this: suppose $frac{dy}{dt}=2$, and $frac{dx}{dt}=3$ (I’ll try to look up and make the partial derivative symbol in there later). As you move along the vector 2,3 in a unit of time, the rate of change of z is these substituted into the multivariable chain rule, which is algebraically equivelant to the gradient dot the vector you move along in a period.



Additionally, the dot product is saying how much length of one vector there is per length of the other along it’s direction, because this is equivalent to projecting one vector onto another and multiplying their lengths. Note: it doesn’t matter which is projected onto which because the dot product is symmetric. Anyways, locally the gradient is the direction of steepest ascent, and dotting a vector with it is just saying how much is this unit motion lined up along this direction of main (positive) ascent, the quantity you’re measuring. This comparison to the gradient is logically the same as saying how much of the increase (gradient) is in a particular direction, hence, directional derivative.



*The gradient is how a function ascends, ie a certain amount in one direction plus a certain amount in the other(s), or: $ifrac{df}{dx}+jfrac{df}{dy}$ (again, sorry about the latex, and that’s meant to be i hat and j hat). Viewed in this light, it only makes sense that “how it’s ascending” ascends in the direction of steepest ascent. It’s just as a gradient of a single variable function is a vector telling you loosely “how it’s changing”, and if you can connect this to the idea that the vector (which is just a scalar in this case) points in the direction of increase according to how it’s increasing you can see the analogy for higher dimensional gradients. Of course you could also prove this analytically in several ways, one of which is working backwards and optimizing the angle of the directional derivative (which weirdly isn’t circular reasoning and doesn’t require you to already know about the gradient).



I haven’t thought through this too much but since every analytic function is has a tangent plane you could figure it out by doing it on a plane.



And again, choosing some real world or more abstract variables with this and doing a thought experiment is fun. And like Mohammad said, like normal slope, it’s a scalar.






share|cite|improve this answer









$endgroup$



I don’t know how well you already know stuff so I’m going to just justify everything I can. There’s a paragraph about the gradient at the bottom. Also the logic extends so I’ll just talk about functions with 2 inputs (surfaces) instead of three. It might be more helpful. You also might need some visuals and I think there are several things online or I believe khan academy has some helpful visuals even though I’m not always a fan of their regular videos.



Consider the multivariable chain rule. Recall that it works intuitively because the change in z arises from the combined changes in x and in y. Also you can try some practical thought experiments help make it clear, all with the basic idea that for $z(x(t),y(t))$, the change of the height z as you go through time (or more generally some other parameter) is the amount it changes due to x plus the amount it’s changing due to y since they are independent. You could readily substitute amount and rate in that previous sentence, which yields the multivariable chain rule, as was desired.



Now, this $f(x(t),y(t))$ can be interpreted as a path above a graph on the xy plane, which goes along the surface $f$. When the graph (x,y) is a straight line and you move along it with constant speed, you have the derivative as you move in that direction (not just x or y, which are the partial derivatives). If you move at different speeds, like twice the speed, the height changes twice as much in the same amount of time so the directional derivative also doubles. Directional derivatives, however, we take when moving at a constant speed (so they’re like a regular partial derivative but moving in a different direction), thus the vector is normalized. That’s the interpretation of not normalizing the vector (which is just moving at a different speed) cause I don’t feel like normalizing this: suppose $frac{dy}{dt}=2$, and $frac{dx}{dt}=3$ (I’ll try to look up and make the partial derivative symbol in there later). As you move along the vector 2,3 in a unit of time, the rate of change of z is these substituted into the multivariable chain rule, which is algebraically equivelant to the gradient dot the vector you move along in a period.



Additionally, the dot product is saying how much length of one vector there is per length of the other along it’s direction, because this is equivalent to projecting one vector onto another and multiplying their lengths. Note: it doesn’t matter which is projected onto which because the dot product is symmetric. Anyways, locally the gradient is the direction of steepest ascent, and dotting a vector with it is just saying how much is this unit motion lined up along this direction of main (positive) ascent, the quantity you’re measuring. This comparison to the gradient is logically the same as saying how much of the increase (gradient) is in a particular direction, hence, directional derivative.



*The gradient is how a function ascends, ie a certain amount in one direction plus a certain amount in the other(s), or: $ifrac{df}{dx}+jfrac{df}{dy}$ (again, sorry about the latex, and that’s meant to be i hat and j hat). Viewed in this light, it only makes sense that “how it’s ascending” ascends in the direction of steepest ascent. It’s just as a gradient of a single variable function is a vector telling you loosely “how it’s changing”, and if you can connect this to the idea that the vector (which is just a scalar in this case) points in the direction of increase according to how it’s increasing you can see the analogy for higher dimensional gradients. Of course you could also prove this analytically in several ways, one of which is working backwards and optimizing the angle of the directional derivative (which weirdly isn’t circular reasoning and doesn’t require you to already know about the gradient).



I haven’t thought through this too much but since every analytic function is has a tangent plane you could figure it out by doing it on a plane.



And again, choosing some real world or more abstract variables with this and doing a thought experiment is fun. And like Mohammad said, like normal slope, it’s a scalar.







share|cite|improve this answer












share|cite|improve this answer



share|cite|improve this answer










answered Jan 13 at 23:15









Benjamin ThoburnBenjamin Thoburn

350213




350213












  • $begingroup$
    Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
    $endgroup$
    – Arvin Singh
    Jan 13 at 23:45










  • $begingroup$
    @ArvinSingh I think so but I have to learn about interpolation.
    $endgroup$
    – Benjamin Thoburn
    Jan 14 at 17:52










  • $begingroup$
    Linear interpolation as in scaling between 2 values
    $endgroup$
    – Arvin Singh
    Jan 14 at 20:37


















  • $begingroup$
    Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
    $endgroup$
    – Arvin Singh
    Jan 13 at 23:45










  • $begingroup$
    @ArvinSingh I think so but I have to learn about interpolation.
    $endgroup$
    – Benjamin Thoburn
    Jan 14 at 17:52










  • $begingroup$
    Linear interpolation as in scaling between 2 values
    $endgroup$
    – Arvin Singh
    Jan 14 at 20:37
















$begingroup$
Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
$endgroup$
– Arvin Singh
Jan 13 at 23:45




$begingroup$
Oh I get it. Of course it is akin to interpolation. The tangents in all the directions are on a flat plane and that is why you can use a dot product to describe the directional derivative. Am I right? If so, I can’t believe I did not get this earlier. Thank you. Your second to last paragraph if the one that made it super obvious.
$endgroup$
– Arvin Singh
Jan 13 at 23:45












$begingroup$
@ArvinSingh I think so but I have to learn about interpolation.
$endgroup$
– Benjamin Thoburn
Jan 14 at 17:52




$begingroup$
@ArvinSingh I think so but I have to learn about interpolation.
$endgroup$
– Benjamin Thoburn
Jan 14 at 17:52












$begingroup$
Linear interpolation as in scaling between 2 values
$endgroup$
– Arvin Singh
Jan 14 at 20:37




$begingroup$
Linear interpolation as in scaling between 2 values
$endgroup$
– Arvin Singh
Jan 14 at 20:37











1












$begingroup$

The result of dot product of gradient vector and the unit direction vector is a scalar not a vector. The scalar is the directional derivative in that direction. I noticed that you have a vector for the result of the dot product instead of a scalar. You need to add the components to get the result.






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
    $endgroup$
    – Arvin Singh
    Jan 13 at 22:04










  • $begingroup$
    Derivatives are slopes which are the instantaneous rate of change in the given direction.
    $endgroup$
    – Mohammad Riazi-Kermani
    Jan 13 at 22:30
















1












$begingroup$

The result of dot product of gradient vector and the unit direction vector is a scalar not a vector. The scalar is the directional derivative in that direction. I noticed that you have a vector for the result of the dot product instead of a scalar. You need to add the components to get the result.






share|cite|improve this answer









$endgroup$













  • $begingroup$
    Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
    $endgroup$
    – Arvin Singh
    Jan 13 at 22:04










  • $begingroup$
    Derivatives are slopes which are the instantaneous rate of change in the given direction.
    $endgroup$
    – Mohammad Riazi-Kermani
    Jan 13 at 22:30














1












1








1





$begingroup$

The result of dot product of gradient vector and the unit direction vector is a scalar not a vector. The scalar is the directional derivative in that direction. I noticed that you have a vector for the result of the dot product instead of a scalar. You need to add the components to get the result.






share|cite|improve this answer









$endgroup$



The result of dot product of gradient vector and the unit direction vector is a scalar not a vector. The scalar is the directional derivative in that direction. I noticed that you have a vector for the result of the dot product instead of a scalar. You need to add the components to get the result.







share|cite|improve this answer












share|cite|improve this answer



share|cite|improve this answer










answered Jan 13 at 20:14









Mohammad Riazi-KermaniMohammad Riazi-Kermani

41.6k42061




41.6k42061












  • $begingroup$
    Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
    $endgroup$
    – Arvin Singh
    Jan 13 at 22:04










  • $begingroup$
    Derivatives are slopes which are the instantaneous rate of change in the given direction.
    $endgroup$
    – Mohammad Riazi-Kermani
    Jan 13 at 22:30


















  • $begingroup$
    Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
    $endgroup$
    – Arvin Singh
    Jan 13 at 22:04










  • $begingroup$
    Derivatives are slopes which are the instantaneous rate of change in the given direction.
    $endgroup$
    – Mohammad Riazi-Kermani
    Jan 13 at 22:30
















$begingroup$
Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
$endgroup$
– Arvin Singh
Jan 13 at 22:04




$begingroup$
Sorry, I didn't realize my mistake, but I just don't see how combinations of 3 derivatives can give you derivatives in every direction. Derivatives can't just be linear interpolations, can they?
$endgroup$
– Arvin Singh
Jan 13 at 22:04












$begingroup$
Derivatives are slopes which are the instantaneous rate of change in the given direction.
$endgroup$
– Mohammad Riazi-Kermani
Jan 13 at 22:30




$begingroup$
Derivatives are slopes which are the instantaneous rate of change in the given direction.
$endgroup$
– Mohammad Riazi-Kermani
Jan 13 at 22:30


















draft saved

draft discarded




















































Thanks for contributing an answer to Mathematics Stack Exchange!


  • Please be sure to answer the question. Provide details and share your research!

But avoid



  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.


Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3072458%2fhow-does-a-gradient-allow-the-calculation-of-the-directional-derivative%23new-answer', 'question_page');
}
);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Can a sorcerer learn a 5th-level spell early by creating spell slots using the Font of Magic feature?

Does disintegrating a polymorphed enemy still kill it after the 2018 errata?

A Topological Invariant for $pi_3(U(n))$