How does quantile regression compare to logistic regression with the variable split at the quantile?Analyzing Logistic Regression when not using a dichotomous dependent variableEstimating logistic regression coefficients in a case-control design when the outcome variable is not case/control statusWhen does quantile regression produce biased coefficients (if ever)?How can I account for a nonlinear variable in a logistic regression?Variable Selection for Logistic regressionlogistic regression: the relation between sample proportion and prediction?How to compare the performance of two classification methods? (logistic regression and classification trees)Unbalanced Design with a Large Data Set and Logistic RegressionFit logistic regression with linear constraints on coefficients in RLogistic regression with double censored independent variable

What would happen to a modern skyscraper if it rains micro blackholes?

Is it unprofessional to ask if a job posting on GlassDoor is real?

Does detail obscure or enhance action?

Maximum likelihood parameters deviate from posterior distributions

Perform and show arithmetic with LuaLaTeX

Today is the Center

DC-DC converter from low voltage at high current, to high voltage at low current

"You are your self first supporter", a more proper way to say it

Why doesn't H₄O²⁺ exist?

Which country benefited the most from UN Security Council vetoes?

What defenses are there against being summoned by the Gate spell?

Are the number of citations and number of published articles the most important criteria for a tenure promotion?

Client team has low performances and low technical skills: we always fix their work and now they stop collaborate with us. How to solve?

Why can't I see bouncing of switch on oscilloscope screen?

Is it legal for company to use my work email to pretend I still work there?

What does the "remote control" for a QF-4 look like?

What does it mean to describe someone as a butt steak?

Arrow those variables!

How to source a part of a file

Decision tree nodes overlapping with Tikz

Do I have a twin with permutated remainders?

Paid for article while in US on F-1 visa?

Two films in a tank, only one comes out with a development error – why?

When a company launches a new product do they "come out" with a new product or do they "come up" with a new product?



How does quantile regression compare to logistic regression with the variable split at the quantile?


Analyzing Logistic Regression when not using a dichotomous dependent variableEstimating logistic regression coefficients in a case-control design when the outcome variable is not case/control statusWhen does quantile regression produce biased coefficients (if ever)?How can I account for a nonlinear variable in a logistic regression?Variable Selection for Logistic regressionlogistic regression: the relation between sample proportion and prediction?How to compare the performance of two classification methods? (logistic regression and classification trees)Unbalanced Design with a Large Data Set and Logistic RegressionFit logistic regression with linear constraints on coefficients in RLogistic regression with double censored independent variable






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty margin-bottom:0;








5












$begingroup$


I googled a bit but didn't find anything on this.



Suppose you do a quantile regression on the qth quantile of the dependent variable.



Then you split the DV at the qth quantile and label the result 0 and 1. Then you do logistic regression on the categorized DV.



I'm looking for any Monte-Carlo studies of this or reasons to prefer one over the other etc.










share|cite|improve this question









$endgroup$











  • $begingroup$
    Could you show us any reasonable way even to compare the results of the two regressions? After all, unless you have something a little less general in mind, the coefficients of the regressors in these two models have entirely different meanings and interpretations, so in what sense are we to understand what you mean by "prefer"?
    $endgroup$
    – whuber
    1 hour ago

















5












$begingroup$


I googled a bit but didn't find anything on this.



Suppose you do a quantile regression on the qth quantile of the dependent variable.



Then you split the DV at the qth quantile and label the result 0 and 1. Then you do logistic regression on the categorized DV.



I'm looking for any Monte-Carlo studies of this or reasons to prefer one over the other etc.










share|cite|improve this question









$endgroup$











  • $begingroup$
    Could you show us any reasonable way even to compare the results of the two regressions? After all, unless you have something a little less general in mind, the coefficients of the regressors in these two models have entirely different meanings and interpretations, so in what sense are we to understand what you mean by "prefer"?
    $endgroup$
    – whuber
    1 hour ago













5












5








5


1



$begingroup$


I googled a bit but didn't find anything on this.



Suppose you do a quantile regression on the qth quantile of the dependent variable.



Then you split the DV at the qth quantile and label the result 0 and 1. Then you do logistic regression on the categorized DV.



I'm looking for any Monte-Carlo studies of this or reasons to prefer one over the other etc.










share|cite|improve this question









$endgroup$




I googled a bit but didn't find anything on this.



Suppose you do a quantile regression on the qth quantile of the dependent variable.



Then you split the DV at the qth quantile and label the result 0 and 1. Then you do logistic regression on the categorized DV.



I'm looking for any Monte-Carlo studies of this or reasons to prefer one over the other etc.







logistic quantile-regression






share|cite|improve this question













share|cite|improve this question











share|cite|improve this question




share|cite|improve this question










asked 3 hours ago









Peter FlomPeter Flom

77k12109215




77k12109215











  • $begingroup$
    Could you show us any reasonable way even to compare the results of the two regressions? After all, unless you have something a little less general in mind, the coefficients of the regressors in these two models have entirely different meanings and interpretations, so in what sense are we to understand what you mean by "prefer"?
    $endgroup$
    – whuber
    1 hour ago
















  • $begingroup$
    Could you show us any reasonable way even to compare the results of the two regressions? After all, unless you have something a little less general in mind, the coefficients of the regressors in these two models have entirely different meanings and interpretations, so in what sense are we to understand what you mean by "prefer"?
    $endgroup$
    – whuber
    1 hour ago















$begingroup$
Could you show us any reasonable way even to compare the results of the two regressions? After all, unless you have something a little less general in mind, the coefficients of the regressors in these two models have entirely different meanings and interpretations, so in what sense are we to understand what you mean by "prefer"?
$endgroup$
– whuber
1 hour ago




$begingroup$
Could you show us any reasonable way even to compare the results of the two regressions? After all, unless you have something a little less general in mind, the coefficients of the regressors in these two models have entirely different meanings and interpretations, so in what sense are we to understand what you mean by "prefer"?
$endgroup$
– whuber
1 hour ago










1 Answer
1






active

oldest

votes


















4












$begingroup$

For simplicity, assume you have a continuous dependent variable Y and a continuous predictor variable X.



Logistic Regression



If I understand your post correctly, your logistic regression will categorize Y into 0 and 1 based on the quantile of the (unconditional) distribution of Y. Specifically, the q-th quantile of the distribution of observed Y values will be computed and Ycat will be defined as 0 if Y is strictly less than this quantile and 1 if Y is greater than or equal to this quantile.



If the above captures your intent, then the logistic regression will model the odds of Y exceeding or being equal to the (observed) q-th quantile of the (unconditional) Y distribution as a function of X.



** Quantile Regression**



On the other hand, if you are performing a quantile regression of Y on X, you are focusing on modelling how the q-th quantile of the conditional distribution of Y given X changes as a function of X.



Logistic Regression versus Quantile Regression



It seems to me that these two procedures have totally different aims, since the first procedure (i.e., logistic regression) focuses on the q-th quantile of the unconditional distribution of Y, whereas the second procedure (i.e., quantile regression) focuses on the the q-th quantile of the conditional distribution of Y.



The unconditional distribution of Y is the 
distribution of Y values (hence it ignores any
information about the X values).

The conditional distribution of Y given X is the
distribution of those Y values for which the values
of X are the same.


Illustrative Example



For illustration purposes, let's say Y = cholesterol and X = body weight.



Then logistic regression is modelling the odds of having a 'high' cholesterol value (i.e., greater than or equal to the q-th quantile of the observed cholesterol values) as a function of body weight, where the definition of 'high' has no relation to body weight. In other words, the marker for what constitutes a 'high' cholesterol value is independent of body weight. What changes with body weight in this model is the odds that a cholesterol value would exceed this marker.



On the other hand, quantile regression is looking at how the 'marker' cholesterol values for which q% of the subjects with the same body weight in the underlying population have a higher cholesterol value vary as a function of body weight. You can think of these cholesterol values as markers for identifying what cholesterol values are 'high' - but in this case, each marker depends on the corresponding body weight; furthermore, the markers are assumed to change in a predictable fashion as the value of X changes (e.g., the markers tend to increase as X increases).






share|cite|improve this answer











$endgroup$








  • 1




    $begingroup$
    I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
    $endgroup$
    – Peter Flom
    35 mins ago






  • 2




    $begingroup$
    Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
    $endgroup$
    – Isabella Ghement
    32 mins ago











Your Answer





StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "65"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f401421%2fhow-does-quantile-regression-compare-to-logistic-regression-with-the-variable-sp%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes









4












$begingroup$

For simplicity, assume you have a continuous dependent variable Y and a continuous predictor variable X.



Logistic Regression



If I understand your post correctly, your logistic regression will categorize Y into 0 and 1 based on the quantile of the (unconditional) distribution of Y. Specifically, the q-th quantile of the distribution of observed Y values will be computed and Ycat will be defined as 0 if Y is strictly less than this quantile and 1 if Y is greater than or equal to this quantile.



If the above captures your intent, then the logistic regression will model the odds of Y exceeding or being equal to the (observed) q-th quantile of the (unconditional) Y distribution as a function of X.



** Quantile Regression**



On the other hand, if you are performing a quantile regression of Y on X, you are focusing on modelling how the q-th quantile of the conditional distribution of Y given X changes as a function of X.



Logistic Regression versus Quantile Regression



It seems to me that these two procedures have totally different aims, since the first procedure (i.e., logistic regression) focuses on the q-th quantile of the unconditional distribution of Y, whereas the second procedure (i.e., quantile regression) focuses on the the q-th quantile of the conditional distribution of Y.



The unconditional distribution of Y is the 
distribution of Y values (hence it ignores any
information about the X values).

The conditional distribution of Y given X is the
distribution of those Y values for which the values
of X are the same.


Illustrative Example



For illustration purposes, let's say Y = cholesterol and X = body weight.



Then logistic regression is modelling the odds of having a 'high' cholesterol value (i.e., greater than or equal to the q-th quantile of the observed cholesterol values) as a function of body weight, where the definition of 'high' has no relation to body weight. In other words, the marker for what constitutes a 'high' cholesterol value is independent of body weight. What changes with body weight in this model is the odds that a cholesterol value would exceed this marker.



On the other hand, quantile regression is looking at how the 'marker' cholesterol values for which q% of the subjects with the same body weight in the underlying population have a higher cholesterol value vary as a function of body weight. You can think of these cholesterol values as markers for identifying what cholesterol values are 'high' - but in this case, each marker depends on the corresponding body weight; furthermore, the markers are assumed to change in a predictable fashion as the value of X changes (e.g., the markers tend to increase as X increases).






share|cite|improve this answer











$endgroup$








  • 1




    $begingroup$
    I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
    $endgroup$
    – Peter Flom
    35 mins ago






  • 2




    $begingroup$
    Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
    $endgroup$
    – Isabella Ghement
    32 mins ago















4












$begingroup$

For simplicity, assume you have a continuous dependent variable Y and a continuous predictor variable X.



Logistic Regression



If I understand your post correctly, your logistic regression will categorize Y into 0 and 1 based on the quantile of the (unconditional) distribution of Y. Specifically, the q-th quantile of the distribution of observed Y values will be computed and Ycat will be defined as 0 if Y is strictly less than this quantile and 1 if Y is greater than or equal to this quantile.



If the above captures your intent, then the logistic regression will model the odds of Y exceeding or being equal to the (observed) q-th quantile of the (unconditional) Y distribution as a function of X.



** Quantile Regression**



On the other hand, if you are performing a quantile regression of Y on X, you are focusing on modelling how the q-th quantile of the conditional distribution of Y given X changes as a function of X.



Logistic Regression versus Quantile Regression



It seems to me that these two procedures have totally different aims, since the first procedure (i.e., logistic regression) focuses on the q-th quantile of the unconditional distribution of Y, whereas the second procedure (i.e., quantile regression) focuses on the the q-th quantile of the conditional distribution of Y.



The unconditional distribution of Y is the 
distribution of Y values (hence it ignores any
information about the X values).

The conditional distribution of Y given X is the
distribution of those Y values for which the values
of X are the same.


Illustrative Example



For illustration purposes, let's say Y = cholesterol and X = body weight.



Then logistic regression is modelling the odds of having a 'high' cholesterol value (i.e., greater than or equal to the q-th quantile of the observed cholesterol values) as a function of body weight, where the definition of 'high' has no relation to body weight. In other words, the marker for what constitutes a 'high' cholesterol value is independent of body weight. What changes with body weight in this model is the odds that a cholesterol value would exceed this marker.



On the other hand, quantile regression is looking at how the 'marker' cholesterol values for which q% of the subjects with the same body weight in the underlying population have a higher cholesterol value vary as a function of body weight. You can think of these cholesterol values as markers for identifying what cholesterol values are 'high' - but in this case, each marker depends on the corresponding body weight; furthermore, the markers are assumed to change in a predictable fashion as the value of X changes (e.g., the markers tend to increase as X increases).






share|cite|improve this answer











$endgroup$








  • 1




    $begingroup$
    I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
    $endgroup$
    – Peter Flom
    35 mins ago






  • 2




    $begingroup$
    Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
    $endgroup$
    – Isabella Ghement
    32 mins ago













4












4








4





$begingroup$

For simplicity, assume you have a continuous dependent variable Y and a continuous predictor variable X.



Logistic Regression



If I understand your post correctly, your logistic regression will categorize Y into 0 and 1 based on the quantile of the (unconditional) distribution of Y. Specifically, the q-th quantile of the distribution of observed Y values will be computed and Ycat will be defined as 0 if Y is strictly less than this quantile and 1 if Y is greater than or equal to this quantile.



If the above captures your intent, then the logistic regression will model the odds of Y exceeding or being equal to the (observed) q-th quantile of the (unconditional) Y distribution as a function of X.



** Quantile Regression**



On the other hand, if you are performing a quantile regression of Y on X, you are focusing on modelling how the q-th quantile of the conditional distribution of Y given X changes as a function of X.



Logistic Regression versus Quantile Regression



It seems to me that these two procedures have totally different aims, since the first procedure (i.e., logistic regression) focuses on the q-th quantile of the unconditional distribution of Y, whereas the second procedure (i.e., quantile regression) focuses on the the q-th quantile of the conditional distribution of Y.



The unconditional distribution of Y is the 
distribution of Y values (hence it ignores any
information about the X values).

The conditional distribution of Y given X is the
distribution of those Y values for which the values
of X are the same.


Illustrative Example



For illustration purposes, let's say Y = cholesterol and X = body weight.



Then logistic regression is modelling the odds of having a 'high' cholesterol value (i.e., greater than or equal to the q-th quantile of the observed cholesterol values) as a function of body weight, where the definition of 'high' has no relation to body weight. In other words, the marker for what constitutes a 'high' cholesterol value is independent of body weight. What changes with body weight in this model is the odds that a cholesterol value would exceed this marker.



On the other hand, quantile regression is looking at how the 'marker' cholesterol values for which q% of the subjects with the same body weight in the underlying population have a higher cholesterol value vary as a function of body weight. You can think of these cholesterol values as markers for identifying what cholesterol values are 'high' - but in this case, each marker depends on the corresponding body weight; furthermore, the markers are assumed to change in a predictable fashion as the value of X changes (e.g., the markers tend to increase as X increases).






share|cite|improve this answer











$endgroup$



For simplicity, assume you have a continuous dependent variable Y and a continuous predictor variable X.



Logistic Regression



If I understand your post correctly, your logistic regression will categorize Y into 0 and 1 based on the quantile of the (unconditional) distribution of Y. Specifically, the q-th quantile of the distribution of observed Y values will be computed and Ycat will be defined as 0 if Y is strictly less than this quantile and 1 if Y is greater than or equal to this quantile.



If the above captures your intent, then the logistic regression will model the odds of Y exceeding or being equal to the (observed) q-th quantile of the (unconditional) Y distribution as a function of X.



** Quantile Regression**



On the other hand, if you are performing a quantile regression of Y on X, you are focusing on modelling how the q-th quantile of the conditional distribution of Y given X changes as a function of X.



Logistic Regression versus Quantile Regression



It seems to me that these two procedures have totally different aims, since the first procedure (i.e., logistic regression) focuses on the q-th quantile of the unconditional distribution of Y, whereas the second procedure (i.e., quantile regression) focuses on the the q-th quantile of the conditional distribution of Y.



The unconditional distribution of Y is the 
distribution of Y values (hence it ignores any
information about the X values).

The conditional distribution of Y given X is the
distribution of those Y values for which the values
of X are the same.


Illustrative Example



For illustration purposes, let's say Y = cholesterol and X = body weight.



Then logistic regression is modelling the odds of having a 'high' cholesterol value (i.e., greater than or equal to the q-th quantile of the observed cholesterol values) as a function of body weight, where the definition of 'high' has no relation to body weight. In other words, the marker for what constitutes a 'high' cholesterol value is independent of body weight. What changes with body weight in this model is the odds that a cholesterol value would exceed this marker.



On the other hand, quantile regression is looking at how the 'marker' cholesterol values for which q% of the subjects with the same body weight in the underlying population have a higher cholesterol value vary as a function of body weight. You can think of these cholesterol values as markers for identifying what cholesterol values are 'high' - but in this case, each marker depends on the corresponding body weight; furthermore, the markers are assumed to change in a predictable fashion as the value of X changes (e.g., the markers tend to increase as X increases).







share|cite|improve this answer














share|cite|improve this answer



share|cite|improve this answer








edited 34 mins ago

























answered 2 hours ago









Isabella GhementIsabella Ghement

7,823422




7,823422







  • 1




    $begingroup$
    I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
    $endgroup$
    – Peter Flom
    35 mins ago






  • 2




    $begingroup$
    Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
    $endgroup$
    – Isabella Ghement
    32 mins ago












  • 1




    $begingroup$
    I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
    $endgroup$
    – Peter Flom
    35 mins ago






  • 2




    $begingroup$
    Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
    $endgroup$
    – Isabella Ghement
    32 mins ago







1




1




$begingroup$
I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
$endgroup$
– Peter Flom
35 mins ago




$begingroup$
I agree with all that. Yet, there does seem to be a similarity - that is, both look at the qth quantile as a function of the same independent variables.
$endgroup$
– Peter Flom
35 mins ago




2




2




$begingroup$
Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
$endgroup$
– Isabella Ghement
32 mins ago




$begingroup$
Yes, but the difference is that one method looks at the unconditional quantile (i.e., logistic regression) while the other looks at the conditional quantile (i.e., quantile regression). Those two quantiles keep track of different things.
$endgroup$
– Isabella Ghement
32 mins ago

















draft saved

draft discarded
















































Thanks for contributing an answer to Cross Validated!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

Use MathJax to format equations. MathJax reference.


To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f401421%2fhow-does-quantile-regression-compare-to-logistic-regression-with-the-variable-sp%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

名間水力發電廠 目录 沿革 設施 鄰近設施 註釋 外部連結 导航菜单23°50′10″N 120°42′41″E / 23.83611°N 120.71139°E / 23.83611; 120.7113923°50′10″N 120°42′41″E / 23.83611°N 120.71139°E / 23.83611; 120.71139計畫概要原始内容臺灣第一座BOT 模式開發的水力發電廠-名間水力電廠名間水力發電廠 水利署首件BOT案原始内容《小檔案》名間電廠 首座BOT水力發電廠原始内容名間電廠BOT - 經濟部水利署中區水資源局

Prove that NP is closed under karp reduction?Space(n) not closed under Karp reductions - what about NTime(n)?Class P is closed under rotation?Prove or disprove that $NL$ is closed under polynomial many-one reductions$mathbfNC_2$ is closed under log-space reductionOn Karp reductionwhen can I know if a class (complexity) is closed under reduction (cook/karp)Check if class $PSPACE$ is closed under polyonomially space reductionIs NPSPACE also closed under polynomial-time reduction and under log-space reduction?Prove PSPACE is closed under complement?Prove PSPACE is closed under union?

Is my guitar’s action too high? Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 23:30 UTC (7:30pm US/Eastern)Strings too stiff on a recently purchased acoustic guitar | Cort AD880CEIs the action of my guitar really high?Μy little finger is too weak to play guitarWith guitar, how long should I give my fingers to strengthen / callous?When playing a fret the guitar sounds mutedPlaying (Barre) chords up the guitar neckI think my guitar strings are wound too tight and I can't play barre chordsF barre chord on an SG guitarHow to find to the right strings of a barre chord by feel?High action on higher fret on my steel acoustic guitar