How do I find percentiles of data sets (Even vs odd)?












3














Given the following data set with an even number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 10(0.30) = 3. So the 30th percentile of this data is 105.



Given the following data set with an odd number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200, 300$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 11(0.30) = 3.3. So now what does one do?



I realize that this formula can yield a decimal even if the data set has an even amount of values, say if n = 36, and you want the 10th percentile, 36(.10) = 3.6.



In this situation, do you average the 3rd and 4th values? Or is it the 3rd value? or the 4th value? How do you decide? What if the position was 3.2 or 3.7? Does it matter in choosing which value is represents the given percentile?



Thanks for any help ahead of time.










share|cite|improve this question




















  • 1




    According to wikipedia article on percentiles, you use the ceiling. But this conflicts with the article on median which it says is the same as the 50th percentile and calculates it as an interpolation. So it is either an interpolation: 105+0.3(113-105), or it is the "next biggest": 113.
    – David Peterson
    Dec 17 '14 at 5:52






  • 1




    Wikipedia's Quantile article gives nine different calculation methods. Personally I like R-2 for a population and R-7 for a sample
    – Henry
    Feb 19 '17 at 20:31












  • I would expect that this very much depends on what you need to do with percentiles; although I can't think of an example, I expect if it isn't entirely clear how to resolve this based on what you're doing with the percentiles, then it probably doesn't make a whole lot of difference.
    – Vedvart1
    Jun 8 '17 at 5:40










  • From a script on probability and statistics (by L. Meier, ETHZ): For an empirical quantile $q_alpha (0<alpha<1)$ for an ordered data set: $q_alpha = frac{1}{2}(x_{nalpha}+x_{nalpha+1})$ if $alphacdot n$ is even, else $q_{alpha} = x_{ceil{alphacdot n} }$. Here $n$ is the number of entries in the data set.
    – Nox
    Dec 21 '17 at 14:49


















3














Given the following data set with an even number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 10(0.30) = 3. So the 30th percentile of this data is 105.



Given the following data set with an odd number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200, 300$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 11(0.30) = 3.3. So now what does one do?



I realize that this formula can yield a decimal even if the data set has an even amount of values, say if n = 36, and you want the 10th percentile, 36(.10) = 3.6.



In this situation, do you average the 3rd and 4th values? Or is it the 3rd value? or the 4th value? How do you decide? What if the position was 3.2 or 3.7? Does it matter in choosing which value is represents the given percentile?



Thanks for any help ahead of time.










share|cite|improve this question




















  • 1




    According to wikipedia article on percentiles, you use the ceiling. But this conflicts with the article on median which it says is the same as the 50th percentile and calculates it as an interpolation. So it is either an interpolation: 105+0.3(113-105), or it is the "next biggest": 113.
    – David Peterson
    Dec 17 '14 at 5:52






  • 1




    Wikipedia's Quantile article gives nine different calculation methods. Personally I like R-2 for a population and R-7 for a sample
    – Henry
    Feb 19 '17 at 20:31












  • I would expect that this very much depends on what you need to do with percentiles; although I can't think of an example, I expect if it isn't entirely clear how to resolve this based on what you're doing with the percentiles, then it probably doesn't make a whole lot of difference.
    – Vedvart1
    Jun 8 '17 at 5:40










  • From a script on probability and statistics (by L. Meier, ETHZ): For an empirical quantile $q_alpha (0<alpha<1)$ for an ordered data set: $q_alpha = frac{1}{2}(x_{nalpha}+x_{nalpha+1})$ if $alphacdot n$ is even, else $q_{alpha} = x_{ceil{alphacdot n} }$. Here $n$ is the number of entries in the data set.
    – Nox
    Dec 21 '17 at 14:49
















3












3








3


2





Given the following data set with an even number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 10(0.30) = 3. So the 30th percentile of this data is 105.



Given the following data set with an odd number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200, 300$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 11(0.30) = 3.3. So now what does one do?



I realize that this formula can yield a decimal even if the data set has an even amount of values, say if n = 36, and you want the 10th percentile, 36(.10) = 3.6.



In this situation, do you average the 3rd and 4th values? Or is it the 3rd value? or the 4th value? How do you decide? What if the position was 3.2 or 3.7? Does it matter in choosing which value is represents the given percentile?



Thanks for any help ahead of time.










share|cite|improve this question















Given the following data set with an even number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 10(0.30) = 3. So the 30th percentile of this data is 105.



Given the following data set with an odd number of values:

$100, 100, 105, 113, 129, 132, 146, 152, 176, 200, 300$



The value representing the 30th percentile, using the formula n(p/100) where n = sample size and p = percentile, is at position 11(0.30) = 3.3. So now what does one do?



I realize that this formula can yield a decimal even if the data set has an even amount of values, say if n = 36, and you want the 10th percentile, 36(.10) = 3.6.



In this situation, do you average the 3rd and 4th values? Or is it the 3rd value? or the 4th value? How do you decide? What if the position was 3.2 or 3.7? Does it matter in choosing which value is represents the given percentile?



Thanks for any help ahead of time.







statistics median percentile






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Dec 17 '14 at 5:42







Riptyde4

















asked Dec 17 '14 at 5:23









Riptyde4Riptyde4

2242414




2242414








  • 1




    According to wikipedia article on percentiles, you use the ceiling. But this conflicts with the article on median which it says is the same as the 50th percentile and calculates it as an interpolation. So it is either an interpolation: 105+0.3(113-105), or it is the "next biggest": 113.
    – David Peterson
    Dec 17 '14 at 5:52






  • 1




    Wikipedia's Quantile article gives nine different calculation methods. Personally I like R-2 for a population and R-7 for a sample
    – Henry
    Feb 19 '17 at 20:31












  • I would expect that this very much depends on what you need to do with percentiles; although I can't think of an example, I expect if it isn't entirely clear how to resolve this based on what you're doing with the percentiles, then it probably doesn't make a whole lot of difference.
    – Vedvart1
    Jun 8 '17 at 5:40










  • From a script on probability and statistics (by L. Meier, ETHZ): For an empirical quantile $q_alpha (0<alpha<1)$ for an ordered data set: $q_alpha = frac{1}{2}(x_{nalpha}+x_{nalpha+1})$ if $alphacdot n$ is even, else $q_{alpha} = x_{ceil{alphacdot n} }$. Here $n$ is the number of entries in the data set.
    – Nox
    Dec 21 '17 at 14:49
















  • 1




    According to wikipedia article on percentiles, you use the ceiling. But this conflicts with the article on median which it says is the same as the 50th percentile and calculates it as an interpolation. So it is either an interpolation: 105+0.3(113-105), or it is the "next biggest": 113.
    – David Peterson
    Dec 17 '14 at 5:52






  • 1




    Wikipedia's Quantile article gives nine different calculation methods. Personally I like R-2 for a population and R-7 for a sample
    – Henry
    Feb 19 '17 at 20:31












  • I would expect that this very much depends on what you need to do with percentiles; although I can't think of an example, I expect if it isn't entirely clear how to resolve this based on what you're doing with the percentiles, then it probably doesn't make a whole lot of difference.
    – Vedvart1
    Jun 8 '17 at 5:40










  • From a script on probability and statistics (by L. Meier, ETHZ): For an empirical quantile $q_alpha (0<alpha<1)$ for an ordered data set: $q_alpha = frac{1}{2}(x_{nalpha}+x_{nalpha+1})$ if $alphacdot n$ is even, else $q_{alpha} = x_{ceil{alphacdot n} }$. Here $n$ is the number of entries in the data set.
    – Nox
    Dec 21 '17 at 14:49










1




1




According to wikipedia article on percentiles, you use the ceiling. But this conflicts with the article on median which it says is the same as the 50th percentile and calculates it as an interpolation. So it is either an interpolation: 105+0.3(113-105), or it is the "next biggest": 113.
– David Peterson
Dec 17 '14 at 5:52




According to wikipedia article on percentiles, you use the ceiling. But this conflicts with the article on median which it says is the same as the 50th percentile and calculates it as an interpolation. So it is either an interpolation: 105+0.3(113-105), or it is the "next biggest": 113.
– David Peterson
Dec 17 '14 at 5:52




1




1




Wikipedia's Quantile article gives nine different calculation methods. Personally I like R-2 for a population and R-7 for a sample
– Henry
Feb 19 '17 at 20:31






Wikipedia's Quantile article gives nine different calculation methods. Personally I like R-2 for a population and R-7 for a sample
– Henry
Feb 19 '17 at 20:31














I would expect that this very much depends on what you need to do with percentiles; although I can't think of an example, I expect if it isn't entirely clear how to resolve this based on what you're doing with the percentiles, then it probably doesn't make a whole lot of difference.
– Vedvart1
Jun 8 '17 at 5:40




I would expect that this very much depends on what you need to do with percentiles; although I can't think of an example, I expect if it isn't entirely clear how to resolve this based on what you're doing with the percentiles, then it probably doesn't make a whole lot of difference.
– Vedvart1
Jun 8 '17 at 5:40












From a script on probability and statistics (by L. Meier, ETHZ): For an empirical quantile $q_alpha (0<alpha<1)$ for an ordered data set: $q_alpha = frac{1}{2}(x_{nalpha}+x_{nalpha+1})$ if $alphacdot n$ is even, else $q_{alpha} = x_{ceil{alphacdot n} }$. Here $n$ is the number of entries in the data set.
– Nox
Dec 21 '17 at 14:49






From a script on probability and statistics (by L. Meier, ETHZ): For an empirical quantile $q_alpha (0<alpha<1)$ for an ordered data set: $q_alpha = frac{1}{2}(x_{nalpha}+x_{nalpha+1})$ if $alphacdot n$ is even, else $q_{alpha} = x_{ceil{alphacdot n} }$. Here $n$ is the number of entries in the data set.
– Nox
Dec 21 '17 at 14:49












1 Answer
1






active

oldest

votes


















0















  1. Order all the numbers in the data set from smallest to largest.

  2. Multiply percent times the total number of numbers, n.
    3a. If your result from Step 2 is a whole number, go to Step 4. If the result from Step 2 is not a whole number, round it up to the nearest whole number and go to Step 3b.
    3b. Count the numbers in your data set from left to right (from the smallest to the largest number) until you reach the value from Step 3a. This corresponding number in your data set is the kth percentile.

    4. Count the numbers in your data set from left to right until you reach that whole number. The kth percentile is the average of that corresponding number in your data set and the next number in your data set.




Given the following data set with an even number of values:



100,100,105,113,129,132,146,152,176,200-you correctly caculated 3 which is a whole number. Step 4 says to count from left to right until you reach the third number which is 105. Additionally step 4 says to average 105 and the next number in the set which is 113. You get 109. Therefore the 30th percentile of this data is 109.



Now for the set 100,100,105,113,129,132,146,152,176,200,300 you correctly figured 3.3 so according to step 3a round 3.3 up to 4 and go to step 3b. Step 3b says to count from left to right until you get to the 4th number which is 113. 113 is the 30th percentile in this case.






share|cite|improve this answer





















    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "69"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f1071653%2fhow-do-i-find-percentiles-of-data-sets-even-vs-odd%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0















    1. Order all the numbers in the data set from smallest to largest.

    2. Multiply percent times the total number of numbers, n.
      3a. If your result from Step 2 is a whole number, go to Step 4. If the result from Step 2 is not a whole number, round it up to the nearest whole number and go to Step 3b.
      3b. Count the numbers in your data set from left to right (from the smallest to the largest number) until you reach the value from Step 3a. This corresponding number in your data set is the kth percentile.

      4. Count the numbers in your data set from left to right until you reach that whole number. The kth percentile is the average of that corresponding number in your data set and the next number in your data set.




    Given the following data set with an even number of values:



    100,100,105,113,129,132,146,152,176,200-you correctly caculated 3 which is a whole number. Step 4 says to count from left to right until you reach the third number which is 105. Additionally step 4 says to average 105 and the next number in the set which is 113. You get 109. Therefore the 30th percentile of this data is 109.



    Now for the set 100,100,105,113,129,132,146,152,176,200,300 you correctly figured 3.3 so according to step 3a round 3.3 up to 4 and go to step 3b. Step 3b says to count from left to right until you get to the 4th number which is 113. 113 is the 30th percentile in this case.






    share|cite|improve this answer


























      0















      1. Order all the numbers in the data set from smallest to largest.

      2. Multiply percent times the total number of numbers, n.
        3a. If your result from Step 2 is a whole number, go to Step 4. If the result from Step 2 is not a whole number, round it up to the nearest whole number and go to Step 3b.
        3b. Count the numbers in your data set from left to right (from the smallest to the largest number) until you reach the value from Step 3a. This corresponding number in your data set is the kth percentile.

        4. Count the numbers in your data set from left to right until you reach that whole number. The kth percentile is the average of that corresponding number in your data set and the next number in your data set.




      Given the following data set with an even number of values:



      100,100,105,113,129,132,146,152,176,200-you correctly caculated 3 which is a whole number. Step 4 says to count from left to right until you reach the third number which is 105. Additionally step 4 says to average 105 and the next number in the set which is 113. You get 109. Therefore the 30th percentile of this data is 109.



      Now for the set 100,100,105,113,129,132,146,152,176,200,300 you correctly figured 3.3 so according to step 3a round 3.3 up to 4 and go to step 3b. Step 3b says to count from left to right until you get to the 4th number which is 113. 113 is the 30th percentile in this case.






      share|cite|improve this answer
























        0












        0








        0







        1. Order all the numbers in the data set from smallest to largest.

        2. Multiply percent times the total number of numbers, n.
          3a. If your result from Step 2 is a whole number, go to Step 4. If the result from Step 2 is not a whole number, round it up to the nearest whole number and go to Step 3b.
          3b. Count the numbers in your data set from left to right (from the smallest to the largest number) until you reach the value from Step 3a. This corresponding number in your data set is the kth percentile.

          4. Count the numbers in your data set from left to right until you reach that whole number. The kth percentile is the average of that corresponding number in your data set and the next number in your data set.




        Given the following data set with an even number of values:



        100,100,105,113,129,132,146,152,176,200-you correctly caculated 3 which is a whole number. Step 4 says to count from left to right until you reach the third number which is 105. Additionally step 4 says to average 105 and the next number in the set which is 113. You get 109. Therefore the 30th percentile of this data is 109.



        Now for the set 100,100,105,113,129,132,146,152,176,200,300 you correctly figured 3.3 so according to step 3a round 3.3 up to 4 and go to step 3b. Step 3b says to count from left to right until you get to the 4th number which is 113. 113 is the 30th percentile in this case.






        share|cite|improve this answer













        1. Order all the numbers in the data set from smallest to largest.

        2. Multiply percent times the total number of numbers, n.
          3a. If your result from Step 2 is a whole number, go to Step 4. If the result from Step 2 is not a whole number, round it up to the nearest whole number and go to Step 3b.
          3b. Count the numbers in your data set from left to right (from the smallest to the largest number) until you reach the value from Step 3a. This corresponding number in your data set is the kth percentile.

          4. Count the numbers in your data set from left to right until you reach that whole number. The kth percentile is the average of that corresponding number in your data set and the next number in your data set.




        Given the following data set with an even number of values:



        100,100,105,113,129,132,146,152,176,200-you correctly caculated 3 which is a whole number. Step 4 says to count from left to right until you reach the third number which is 105. Additionally step 4 says to average 105 and the next number in the set which is 113. You get 109. Therefore the 30th percentile of this data is 109.



        Now for the set 100,100,105,113,129,132,146,152,176,200,300 you correctly figured 3.3 so according to step 3a round 3.3 up to 4 and go to step 3b. Step 3b says to count from left to right until you get to the 4th number which is 113. 113 is the 30th percentile in this case.







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Jun 8 '15 at 23:35









        Hope It HelpsHope It Helps

        1




        1






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Mathematics Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.





            Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


            Please pay close attention to the following guidance:


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f1071653%2fhow-do-i-find-percentiles-of-data-sets-even-vs-odd%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            MongoDB - Not Authorized To Execute Command

            How to fix TextFormField cause rebuild widget in Flutter

            in spring boot 2.1 many test slices are not allowed anymore due to multiple @BootstrapWith