A mutual information equality

$x$ and $y$ are two random variables and $I(u;v)$ is the mutual information between random variables $u$ and $v$. Does the following equality hold?
$$text{argmax}_a I(y;ax)=text{argmin}_a I(y;y-ax).$$

asked Jan 8 at 15:57

Hans

4,99021032

add a comment |

asked Jan 8 at 15:57

Hans

4,99021032

add a comment |

asked Jan 8 at 15:57

Hans

4,99021032

information-theory

asked Jan 8 at 15:57

Hans

4,99021032

asked Jan 8 at 15:57

Hans

4,99021032

asked Jan 8 at 15:57

Hans

4,99021032

asked Jan 8 at 15:57

Hans

4,99021032

asked Jan 8 at 15:57

Hans

4,99021032

add a comment |

1 Answer
1

active

oldest

votes

It can be seen that for any $aneq 0$, $I(y;ax)=I(y;x)$ because $yrightarrow axrightarrow x$ and $yrightarrow xrightarrow ax$ are both Markov chain hence by the data processing inequality both $I(y;x)leq I(y;ax)$ and $I(y;x)geq I(y;ax)$ respectively.

Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.

edited Jan 8 at 17:09

answered Jan 8 at 16:25

P. Quinton

1,7911213

$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48

$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07

$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00

$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28

$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\$","\$"]]);
});
});
}, "mathjax-editing");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "69"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3066344%2fa-mutual-information-equality%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

1 Answer
1

active

oldest

votes

1 Answer
1

active

oldest

votes

Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.

edited Jan 8 at 17:09

answered Jan 8 at 16:25

P. Quinton

1,7911213

$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48

$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07

$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00

$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28

$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45

add a comment |

Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.

edited Jan 8 at 17:09

answered Jan 8 at 16:25

P. Quinton

1,7911213

$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48

$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07

$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00

$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28

$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45

add a comment |

Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.

edited Jan 8 at 17:09

answered Jan 8 at 16:25

P. Quinton

1,7911213

Since any $aneq 0$ is a maximizer of $I(y;ax)$, your question is ill defined.

edited Jan 8 at 17:09

answered Jan 8 at 16:25

P. Quinton

1,7911213

edited Jan 8 at 17:09

answered Jan 8 at 16:25

P. Quinton

1,7911213

answered Jan 8 at 16:25

P. Quinton

1,7911213

answered Jan 8 at 16:25

P. Quinton

1,7911213

$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48

$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07

$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00

$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28

$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45

add a comment |

$begingroup$
Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.
$endgroup$
– Hans
Jan 8 at 16:48

$begingroup$
Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition
$endgroup$
– P. Quinton
Jan 8 at 17:07

$begingroup$
You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.
$endgroup$
– Hans
Jan 8 at 23:00

$begingroup$
well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.
$endgroup$
– P. Quinton
Jan 9 at 6:28

$begingroup$
I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.
$endgroup$
– Hans
Jan 9 at 17:45

Could there be a misunderstanding, say, as shown in the expression $I(y;x)>0=mathrm{argmin}_a I(y;y-ax)$? I am asking about argmin which the argument $a$ for which a minimization is achieved, not min.

– Hans
Jan 8 at 16:48

Yes, Sorry about that, but then you have to define $mathrm{argmax}$ more carefully since any $aneq 0$ is a maximizer of $I(y;ax)$ hence yes, it could be true depending on how you do this definition

– P. Quinton
Jan 8 at 17:07

You are right. Indeed, from the definition $I(x;y)$ is invariant under any injective one variable transformation, i.e. $I(x;y)=I(f(x),g(y))$ where $f$ and $g$ measurable injective functions. More importantly, you may be interested in taking a look at the question of my main concern stats.stackexchange.com/q/386101/44368.

– Hans
Jan 8 at 23:00

well, what you say is true if and only if $f$ and $g$ are almost surely one to one. About your main question, I don't understand what "z distributed normally conditioned on x" means, I would say you may want $x$ and $z$ independ and $zsim mathcal N(0,sigma^2)$ or something of the sort. I would say without proof that if $x$ is also gaussian then your minimization is exactly the same as the usual regression so it may be of interest in other cases. I'm pretty sure this has been done before but I do not have any references.

– P. Quinton
Jan 9 at 6:28

I said "injective... transformation". That means one-to-one. "Almost surely", sure it makes it more complete, but is more like icing on the cake. So we are saying the same thing. Regarding my main question, there may be some misunderstanding there. The normality is to introduce the subject by giving the background. The new concept using mutual information comes after the phrase "now dropping the normality". Would you please reread the question more carefully? Thank you.

– Hans
Jan 9 at 17:45

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Mathematics Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

Search This Blog

Ufyukyu