X
Create
Sign in

  • Movies
  • TV Shows
  • Music
  • Speeches
  • Gaming
  • Education
  • Beauty
  • Sports
  • Technology
  • Science
  • Health
  • Travel
  • Transportation
  • Career & Work
  • Hobbies
  • Animals
  • Home & Garden
  • Holidays
  • Relationships
  • Parenting
  • Food
  • Culture
  • Finance
  • Business
  • Legal
  • Arts

Describing Data - Box plot

This lesson explains how to create a box plot from a set of data. Two examples are provided. One example has outliers. Site: ******
#Pie Chart Examples #Pie Chart Example #Education
Edit
140 views
1 editor
edited 1+ month ago
Home
Share on facebook Share on twitter Share on Google+
Tip: Highlight text to annotate itX
- WELCOME TO A LESSON ON BOX PLOTS, ALSO CALLED BOX AND WHISKER GRAPHS. FOR VISUALIZING DATA THERE IS A GRAPHICAL REPRESENTATION OF A FIVE-NUMBER SUMMARY CALLED A BOX PLOT OR BOX AND WHISKER PLOT. TO CREATE A BOX PLOT A NUMBER LINE IS DRAWN FIRST, WHICH WE SEE HERE BELOW THE BOX PLOT. A BOX IS DRAWN FROM THE FIRST QUARTILE TO THE THIRD QUARTILE WHICH WOULD BE THIS BOX HERE. QUARTILE 1 = 16, AND QUARTILE 3 = 28. A LINE IS DRAWN THROUGH THE BOX AT THE MEDIAN, WHICH WOULD BE THIS LINE HERE. AND NOTICE HOW OUR MEDIAN IS 20, AND THEN WHISKERS ARE EXTENDED OUT TO THE MINIMUM AND MAXIMUM VALUES. SO ON THE LEFT, NOTICE HOW THIS WHISKER IS DRAWN OUT TO THE MINIMUM OF 4. WE HAVE ANOTHER WHISKER ON THE RIGHT DRAWN OUT TO THE MAXIMUM OF 32. WE SHOULD RECALL FROM THE FIVE-NUMBER SUMMARY THIS DIVIDES THE DATA INTO FOUR QUARTERS WHERE EACH QUARTER REPRESENTS 25% OF THE TOTAL DATA. SO 25% OF THE DATA FALLS IN THIS INTERVAL HERE BETWEEN THE MINIMUM AND Q1. THEN THERE'S 25% OF THE DATA BETWEEN Q1 AND THE MEDIAN. 25% BETWEEN THE MEDIAN AND Q3, AND 25% BETWEEN Q3 AND THE MAXIMUM. THERE IS ALSO SOMETHING CALLED OUTLIERS WHEN CREATING BOX PLOTS, WHICH WE'LL TALK ABOUT AFTER OUR FIRST EXAMPLE. SO THE KEY TO CREATING A BOX PLOT IS TO FIND THE FIVE-NUMBER SUMMARY OR THE MINIMUM Q1, MEDIAN Q3 AND MAXIMUM. IN OUR LESSON ON THE FIVE-NUMBER SUMMARY WE DID DISCUSS HOW TO FIND THESE VALUES, BUT REMEMBER THERE ARE SEVERAL WAYS TO FIND THE QUARTILES OR Q1 AND Q3, BUT IN THIS CLASS WE'LL BE USING THE LOCATOR OR PERCENTILE METHOD TO FIND Q1 AND Q3. IF YOU USE THE TI-83 OR 84 THE METHOD IS DIFFERENT AND CAN YIELD DIFFERENT RESULTS. SO JUST FOR A QUICK REVIEW, USING THE LOCATOR METHOD TO FIND Q1, WE FIND L = 0.25 x N, WHERE N IS THE NUMBER OF DATA VALUES. IF L IS A DECIMAL WE ROUND UP TO THE NEXT WHOLE NUMBER AND USE A DATA VALUE IN THE ROUNDED UP POSITION AS Q1. IF L IS A WHOLE NUMBER THEN WE FIND THE MEAN OR THE AVERAGE OF THE DATA VALUES IN THE L AND L + 1th POSITIONS. WE USE A SIMILAR METHOD TO FIND Q3 EXCEPT WE USE L = 0.75 x N INSTEAD OF 0.25 x N. SO IN OUR LESSON ON THE FIVE-NUMBER SUMMARY WE TOOK THIS DATA HERE AND FOUND THESE VALUES FOR OUR FIVE-NUMBER SUMMARY, WHICH WE CAN NOW USE TO CREATE A BOX PLOT. BUT LET'S QUICKLY REVIEW HOW WE FOUND THESE VALUES. FIRST WE DECIDED TO WRITE THE DATA VALUES HORIZONTALLY IN ORDER FROM LEAST TO GREATEST, AS WE SEE HERE, WHERE THE MINIMUM WOULD BE SIX. THE MAXIMUM WOULD BE 97, AND THERE ARE 15 DATA VALUES OR AN ODD NUMBER OF DATA VALUES, SO THE MEDIAN IS IN THE LIST. IT WOULD BE THE VALUE IN THE EIGHTH POSITION, WHICH IS 49. THERE ARE SEVEN VALUES BELOW THIS, SEVEN VALUES ABOVE THIS. THEREFORE, THIS IS THE MEDIAN. AND THEN FINALLY TO FIND THE QUARTILES WE'LL BEGIN WITH Q1. SO WE FIND L = 0.25 x 15. AGAIN, THIS IS 15 BECAUSE WE HAVE 15 DATA VALUES. THIS GIVES US A DECIMAL OF 3.75, WHICH WE ROUND UP TO THE NEXT WHOLE NUMBER OF 4. THE DATA VALUE IN THE 4th POSITION WILL BE Q1, SO 18 IS QUARTILE 1. AND THEN FOR Q3 WE FIND L, WHICH IS 0.75 x 15, WHICH IS 11.25, WHICH ROUNDS TO 12. THE VALUE IN THE 12th POSITION WILL BE Q3. SO 8, 9, 10, 11, 12, WHICH IS 82. THESE ARE THE FIVE VALUES WE'LL USE TO CREATE OUR BOX PLOT. SO THE FIRST STEP IS TO CREATE A BOX USING Q1 AND Q3, WHICH IS HERE. NEXT, WE DRAW A LINE AT THE MEDIAN, WHICH IS HERE AT 49. AND THEN WE DRAW A WHISKER DOWN TO THE MINIMUM, AND ANOTHER WHISKER UP TO THE MAXIMUM, WHICH WOULD BE HERE TO 6 AND HERE TO 97. THIS IS OUR BOX AND WHISKER PLOT FOR THE GIVEN DATA. BUT WE DO HAVE TO MODIFY OUR BOX PLOT IF WE DO HAVE OUTLIERS IN THE DATA. SO NOW LET'S TALK ABOUT OUTLIERS. SOMETIMES THE MOST INTERESTING POINTS OF A DATA SET ARE THE POINTS THAT DO NOT SEEM TO BELONG, MEANING THEY SEEM TO DIFFER BY A SUBSTANTIAL AMOUNT FROM THE REST OF THE DATA. WE CALL THESE POINTS OUTLIERS. OUTLIERS ARE VALUES THAT ARE BEYOND 1.5 x THE INNER QUARTILE RANGE, WHICH WOULD BE Q3 - Q1. WE INDICATE OUTLIERS BY PLOTTING THE POINTS RATHER THAN EXTENDING THE WHISKER OUT TO THE OUTLIERS. NOTICE HERE Q3 - Q1 WOULD BE 82 - 70 OR 12, AND 1.5 x 12 = 18. SO THE VALUES THAT ARE 18 UNITS ABOVE 82 OR BELOW 70 WOULD BE CONSIDERED OUTLIERS. NOTICE HOW THIS VALUE HERE LOOKS LIKE IT'S 35, WHICH IS MORE THAN 18 UNITS BELOW Q1, THEREFORE, IT'S AN OUTLIER. SO INSTEAD OF EXTENDING THE WHISKER ALL THE WAY OUT TO 35, WE STOP THE WHISKER AT THE NEXT LOWEST VALUE AND ONLY PUT A SINGLE POINT AT THE VALUE OF 35. THIS INDICATES THAT WHILE MOST OF THE DATA IS BETWEEN 62 AND 99, THERE IS ONE OUTLIER WAY DOWN HERE AT 35. AND IT IS POSSIBLE TO HAVE MORE THAN ONE OUTLIER. WE COULD HAVE ANOTHER POINT HERE AT 40, SINCE THIS VALUE WOULD STILL BE MORE THAN 18 UNITS BEYOND 70. SO LET'S TAKE A LOOK AT ONE MORE EXAMPLE. WE WANT TO FIND THE FIVE-NUMBER SUMMARY OF THE DATA AND THEN CREATE A BOX PLOT. FIRST NOTICE HOW I ORDERED THE DATA FROM LEAST TO GREATEST HORIZONTALLY. THE MINIMUM IS 34, AND THE MAXIMUM IS 93. WE HAVE A TOTAL OF 12 DATA VALUES OR AN EVEN NUMBER OF DATA VALUES, SO THE MEDIAN WILL BE THE AVERAGE OR MEAN OF THE 6th AND 7th DATA VALUES, WHICH WOULD BE THESE TWO VALUES HERE. WELL, 80 + 83 DIVIDED BY 2 = 81.5, SO THE MEDIAN IS 81.5. NEXT, USING THE LOCATOR METHOD, WE'LL FIND Q1 AND Q3 AND BEGIN BY DETERMINING L = 0.25 x N FOR Q1. N IS 12. NOTICE HERE THIS GIVES US 3. BECAUSE L IS A WHOLE NUMBER WE USE THE AVERAGE OF THE VALUES IN THE 3rd AND 4th POSITIONS FOR Q1. SO Q1 IS GOING TO BE EQUAL TO, LET'S SEE, THE VALUES IN THE 3rd AND 4th POSITIONS WOULD BE 68 AND 75. SO 68 + 75 DIVIDED BY 2 GIVES US 71.5 FOR Q1. NOW, FOR Q3 WE'LL HAVE L = 0.75 x 12 = 9. SO TO FIND Q3, BECAUSE L IS A WHOLE NUMBER, WE'LL TAKE THE AVERAGE OF THE VALUES IN THE 9th AND 10th POSITIONS, WHICH WOULD BE 88 AND 90. SO Q3 = 88 + 90 DIVIDED BY 2 = 89. NOW, WE'RE ALMOST READY TO MAKE THE BOX PLOT, BUT LET'S CHECK FOR OUTLIERS FIRST. WE WANT TO DETERMINE 1.5 x THE INNER QUARTILE RANGE, SO 1.5 x Q3 - Q1. SO THIS COMES OUT TO 1.5 x 17.5 = 26.25. SO WE'RE LOOKING FOR VALUES THAT ARE MORE THAN 26.25 UNITS BELOW Q1 OR ABOVE Q3. AND NOTICE HOW 34 IS MORE THAN 26.25 UNITS BELOW 71.5, THEREFORE, 34 IS AN OUTLIER. SO I'LL GO AHEAD AND HIGHLIGHT THIS. WE'LL ACTUALLY REPRESENT 34 AS A POINT. NOTICE HOW THE NEXT VALUE WOULD BE 65, SO WE'LL ACTUALLY EXTEND OUR WHISKER OUT TO 65. NOTICE HOW 65 IS NOT AN OUTLIER, SO WE'LL EXTEND THE WHISKER OUT TO 65 AND PUT A POINT AT 34. AND NOW WE CAN GO AHEAD AND MAKE OUR BOX PLOT. WE CREATE THE BOX USING Q1 AND Q3, WHICH IS HERE, 71.5 TO 89. DRAW A LINE AT THE MEDIAN, WHICH IS 81.5 HERE. TO CREATE THE LOWER WHISKER WE ONLY EXTEND THE WHISKER DOWN TO 65, AS WE SEE HERE. AND THEN PUT THIS POINT HERE AT 34, BECAUSE THIS IS AN OUTLIER. AND THE UPPER WHISKER WILL GO OUT TO 93, WHICH WE SEE HERE. AND THAT'S HOW WE CREATE BOX PLOTS WITH AND WITHOUT OUTLIERS. I HOPE YOU HAVE FOUND THIS HELPFUL.  
Activity
  • Activity
  • Annotations
  • Notes
  • Edits
Sort
  • Newest
  • Best
deicy annotated1+ month ago

This lesson explains how to create a box plot from a set of data. Two examples are provided. One example has outliers. Site: ****** ...

#Pie Chart Examples #Pie Chart Example #Education
Permalink Edit Editors
Share

Share this annotation:

deicy edited1+ month ago

Describing Data - Box plot

English Worldwide About Copyright Privacy Terms
© 2023 Readable
Photos Media Bookmark
X Annotate