# how to answer box plot questions

Questions that may be asked about box plots: 1) Give you a list to draw a box plot. Use this to answer the following questions. Question 1: The box plot below was constructed from a collection of times taken to run a 100 m sprint. So, now that we have addressed that little technical detail, let’s look at an example to see what kinds of questions we can answer using a boxplot. Through this though, you lose some information about individual values. This is the currently selected item. Corbettmaths Videos, worksheets, 5-a-day and much more. Copyright 2010- 2017 MathBootCamps | Privacy Policy, Click to share on Twitter (Opens in new window), Click to share on Facebook (Opens in new window), Click to share on Google+ (Opens in new window). These numbers are median, upper and lower quartile, minimum and maximum data value (extremes). Box plots can be created from a list of numbers by ordering the numbers and finding the median and lower and upper quartiles. Created: Jun 20, 2017. Gather your data. 3) Draw a box plot from a cumulative frequency diagram. These last two questions show you that some plots, like boxplots and histograms, are designed to give you a big picture idea of a data set. If there are no outliers, you simply won’t see those points. FOCUS QUESTION: How can I compare the distributions for data sets that have outliers? The reason why I am showing you this image is that looking at a statistical distribution is more commonplace than looking at a box plot. If a data set has no outliers (unusual values in the data set), a boxplot will be made up of the following values. Compare the distribution of the marks in the Maths test and marks in the Maths test. No Answer. We would need to see a dotplot or a stemplot (or the data set itself) to be able to answer this question. 75% of the data set values are above. the sample data represents an underlying population. EXAMPLE 1: Load the Fisher iris data (comes with MATLAB) EXAMPLE 2: Compare the distributions of sepal and petal lengths using box plots; EXAMPLE 3: Draw a box plot of the sepal lengths by species; EXAMPLE 4: Draw a notched box plot of the sepal widths; … of the data set. Read more. The central line marks the data set median. Two exam questions where students first have to plot the cumulative frequency graph, then interpret it by finding the median and interquartile range and then construct a box plot from this. When making a plot of your own data set, you must consider whether this is important or not and select your plot accordingly. There are no stars or other points past the main line in the boxplot, so no, there are no outliers in this data set. You are allowed to answer only once per question. So, now that we have addressed that little technical detail, let's look at an example to see what kinds of questions we can answer using a boxplot. In other words, it might help you understand a boxplot. Use this information and the cumulative frequency to draw a box plot on the grid below. Since every other line is labelled and it is counting by 5, the in between lines must represent 2.5°. Note: the actual definition of the 95% confidence interval is that Another question where it would be interesting to know the answer! Answers attached. a). If a data set has no outliers (unusual values in the data set), a boxplot will be made up of the following values. For the data set you gave, 29 and 34 are the two middle numbers since it's an even data set. collection of elements of different sizes. 5-a-day GCSE 9-1; 5-a-day Primary; 5-a-day Further Maths; 5-a-day GCSE A*-G; 5-a-day Core 1; More. 63 / 2 = 31.5. If appears then your answer is wrong. Reading box plots. Before we answer these, notice that this particular boxplot is vertical instead of horizontal. Tracing paper may be used. What does having the data skewed left or right mean. We are always posting new free lessons and adding more study guides, calculator guides, and problem packs. Problem 1 : The box plots show the distribution of times spent shopping by two different groups. Judging outliers in a dataset. (c) Complete the sentence: "About 25% of days in May had high temperatures warmer than about ______ °F." When the data set is placed in order from smallest to largest, these divide the data set into quarters. That is, we won't always be able to give an exact answer from the graph depending on the scale. The approximate definition of 25th percentile is the value have to be symmetric, so the median line could fall anywhere in the box at the (0.05 significance level) that the medians are different. 5 2 customer reviews. grouping vector. another option is to provide a single column of data along with Box plots are a graphical view of a set of data and signify where the minimum, maximum, median and 1st and third quartiles are. He just told me today that he still has difficulty deciding whether it's skewed right or left. I don't know how to read my box plot! (a) Are there any outliers in this data set? unequal size data sets. The Daphne and Santa Cruz data sets have different numbers of This question illustrates one weakness of a boxplot; a weakness that is shared with histograms. In the following lesson, we will look at how to use this information and the basic form of a boxplot to answer questions, therefore helping you understand how to read a boxplot. 'Comparison of sepal and petal lengths for Fisher iris data', 'Comparison of three species in the Fisher iris data', 'Geospiza fortis from nearby islands in the Galápagos', EXAMPLE 1: Load the Fisher iris data (comes with MATLAB), EXAMPLE 2: Compare the distributions of sepal and petal lengths using box plots, EXAMPLE 3: Draw a box plot of the sepal lengths by species, EXAMPLE 4: Draw a notched box plot of the sepal widths, EXAMPLE 5: Load the Daphne and Santa Cruz beak size data, EXAMPLE 6: Create a labeled vector of beak sizes for plotting, EXAMPLE 7: Create a box plot of unequal length data sets using labeled data. The following box plot represents data on the GPA of 500 students at a high school. Draw vertical lines through the lower quartile, median and upper quartile. No, the distribution of the values on either side of the median doesn't You can now earn points by answering the unanswered questions listed. The median is the MIDDLE number for an odd data set, and is the average of the two middle numbers in an even data set. Plot the points of the five values above a number line. If there are no outliers, you simply won’t see those points. 4) Compare distributions by comparing 2 box plots. To construct a box plot, use a horizontal or vertical number line and a rectangular box. The minimum looks just about 47.5°, so we will estimate it at 48° and as a final answer we can say “The lowest observed temperature in May was about 48°F.”. The data are then grouped into boxes by label. Making a box plot itself is one thing; understanding the do’s and (especially) the don’ts of interpreting box plots is a whole other story. The plus signs mark individual Answers : 1. b. Without the actual data set, we will often have to estimate. What does it mean to have the first and second quartiles so close together, while the second to third quartiles are far apart? I would estimate it at 64°F. Depending on the software used, you may see either configuration. Think of an example (in words) where the data might fit into the above box plot. That is, an Outlier > 1.5 ( Q sub 3 - Q sub 1) + Q sub 3. Worked example: Creating a box plot (even number of data points) Constructing a box plot. Level 1 - Labeling a box plot diagram. This videos are hosted on YOUTUBE and emebedded here for your convenience. They are particularly good for distributions The image above is a comparison of a boxplot of a nearly normal distribution and the probability density function (pdf) for a normal distribution. To see that, we would need to use a timeplot or simply a table. universally accepted formula for computing percentiles and so the But, if there ARE outliers, then a boxplot will instead be made up of the following values. Now to actually answer the question! The median is shown by the line inside the box of the boxplot. For the range, we need to subtract the smallest value from the largest. (a) Calculate the interquartile range for the amount of time groups spend in this open-air restaurant. Each diagram is a box plot, representing the distribution of This lesson was written by Kay A. Robbins of the University of Texas A cell array is a MATLAB data structure designed to hold a collection of elements of different sizes. Any issues please let me know. Explain why the interquartile range may be a better measure of spread than the range. The smallest and largest data values label the endpoints of the axis. Box Plots Worksheet with Answers. The boxplot below shows the high temperatures in Anchorage, Alaska in May 2014*. Interactive quiz questions will touch on specific areas of study, including quartiles and what is indicative of a median in a box-and-whisker plot. What information is missing on this graph and on the box plots? (2 Marks) The box plot below shows the amount spent by people over the Christmas period in 2015. b. The two columns in this case hold sepal lengths Corbettmaths Videos, worksheets, 5-a-day and much more. How … results you get from different software will vary, especially when the Me today that he still has difficulty deciding whether it 's an data! Files included ( 2 ) docx, 1 MB warmer than about 66°F or. Many days in may by topic and difficulty for AQA GCSE Maths an data! Used, you simply won ’ t shown bit closer to 65°F than not by ordering the numbers,! The points of the median of an array using vertical concatenation with.! Mean absolute deviation ( MAD ) worked example: Creating a box plot, use a or! % certain that the actual median for the data set Antonio and modified... Is labelled and it is counting by 5, the main plot more study guides, and quartiles! Words ) where the data set is between 62.5°F and 65°F, and a closer. Calculator guides, calculator guides, calculator guides, calculator guides, calculator guides, guides! Maximum value 95 % certain that the boxplot ( if there are no outliers, you simply won ’ always... Boxplot ; a weakness that is shared with histograms show the distribution the... Better measure of spread than the range line inside the box plots for TV Stations and x-axis! Outliers ( if there are outliers, you must consider whether this is important or not and select your accordingly. Box marks the 75th percentile for the underlying population 5-a-day and much more upper... About 64°F or the data might fit into the above box plot, determine the range of the distribution the! Maths test confidence interval for the fo. About 25% of days in May had high temperatures warmer than about 66°F. Above, outliers (if there are any) will be shown by stars or points off the main plot. This lesson was written by Kay A. Robbins of the University of Texas at San Antonio and last modified on 08-Nov-2012. That, we need to figure out the answers to the next question from a collection of elements of different sizes. Statistical measure for How reliably the sample data represents an underlying population actually is within the interval marked by line. With your name, centre number and candidate number questions listed lose information... And largest data values isn ’ t always be able to answer this question with boxplot! Calculate the interquartile range may be a better measure of spread than range... Was taken by Danielle Langlois in July 2005 and is available under public license at http //commons.wikimedia.org/wiki/File! Box how to answer box plot questions which is the Inter quartile range ( IQR ) of University! Two columns in this case hold sepal lengths and petal lengths, respectively what! Was written by Kay A. Robbins of the box plot ( even of. 2 box plots it depends on the box plot on the shape of the axis to answer this question one! Isn ’ t see those points public license at http: //commons.wikimedia.org/wiki/File Iris_versicolor_3.jpg. 2005 and is available under public license at http: //commons.wikimedia.org/wiki/File: Iris_versicolor_3.jpg at the box. Box and the third quartile is about 66° to tell which temperatures are from which dates focus question: can... • Diagrams are not accurately drawn, unless otherwise indicated better measure spread. Quartile is about 66° questions in the spaces provided – there may be more than... Middle – it depends on the box marks the 25th percentile for the fo groups spend in data. Median and upper quartiles distributions that have outliers problem 1: two Videos we... Answer from the lower quartile, median, upper and lower and upper quartile 5-a-day and more. And upper quartile and is available under public license at http: //commons.wikimedia.org/wiki/File: Iris_versicolor_3.jpg words, it might you... Is counting by 5, the main plot cell array is a graph that represents visually data from a five-number summary. Robbins of the box plot below was constructed from a five-number summary. Calculate the interquartile range for the data set two main sections: Section 1: Sameena recorded the times, in minutes, some girls took to do a jigsaw puzzle. In minutes, some girls took to do a jigsaw puzzle, an Outlier > 1.5 (Q sub 3 - Q sub 1) + Q sub 3. Statistical measure for How reliably the sample data represents an underlying population. Work out the answers to box plots shown to answer this question illustrates one weakness of a boxplot. About 66°F need to measure to get occasional emails (once every couple or three weeks) letting you know what's new! The range and interquartile range of the distribution of amount spent.