OPRE 6301-SYSM 6303 Chapter 04 Slides PDF

Title OPRE 6301-SYSM 6303 Chapter 04 Slides
Course Statistics and data analysis
Institution The University of Texas at Dallas
Pages 23
File Size 2 MB
File Type PDF
Total Downloads 63
Total Views 138

Summary

Download OPRE 6301-SYSM 6303 Chapter 04 Slides PDF


Description

OPRE 6301/SYSM 6303 Statistics and Data Analysis

4-1

Chapter Four Numerical Descriptive Techniques

4-2

1

Measures of Central Location Arithmetic Mean Mean or Average N

x

i

Population Mean:



i 1

N n

x

i

Sample Mean:

x

i 1

n

xi = the data values labeled 1 through N or n 4-3

Measures of Central Location Mean Time Spent on the Internet 0

7

12

5

33

14

8

0

9

22

4-4

2

Measures of Central Location Mean Long-Distance Telephone Bill Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls

4-5

Measures of Central Location Median The middle value Median Time Spent on the Internet Must first sort the date into ascending order 0

0

5

7

8

9

12

14

22

33

4-6

3

Measures of Central Location Median Long-Distance Telephone Bill Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls Interpretation: half the telephone bills are below 26.905 and half are above 26.905 4-7

Measures of Central Location Mode Observation that occurs most frequently Mode Time Spent on the Internet Best to sort the date into ascending order 0

0

5

7

8

9

12

14

22

33

4-8

4

Measures of Central Location Mode Long-Distance Telephone Bill Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls Two issues with the mode: may not be “central” may not be unique 4-9

Histogram Shapes Symmetrical

Skewed

Positively or Right Skewed

Negatively or Left Skewed

3-10

5

Measures o Variability Range Largest observation – Smallest observation Advantage: simple Disadvantage: simple

4-11

Measures o Variability Range Set 1 4

4

4

4

4

50

39

50

Set 2 4

8

15

24

4-12

6

Measures of Variability Variance N

Population Variance:

 x   

2

i

 2

i1

N n

Sample Variance:

 x  x 

2

i

s2 

i 1

n 1

4-13

Measures of Variability Calculating Sample Variance 8 xi

4

9

xi  x 

11

3

xi  x 2

4-14

7

Measures of Variability Understanding the Sample Variance 8

4

9

11

3

4-15

Measures o Variability Variance of Long-Distance Telephone Bill Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls

4-16

8

Measures of Variability Standard Deviation Population St. Deviation:

  2

Sample St. Deviation:

s  s2

Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls 4-17

Measures of Variability Comparing Two Data Sets Let’s explore the interpretation of variance using Excel and Data File Xm04-08.xls

4-18

9

Empirical Rule

4-19

Empirical Rule example A histogram of returns on an investment is bellshaped and has a mean of 10% and a s = 8%. How is the distribution of the returns?

4-20

10

Chebysheff’ Theorem The proportion of observations in any sample or population that lie within k standard deviations of the mean is at least 1

1 for k  1 k2

4-21

Cheby

Theorem – example

Salaries of a computer store are positively skewed. Mean =$28,000, std.dev=$3,000. What can you say about these salaries?

4-22

11

Measures of Variability Coefficient of Variation ratio of standard deviation to mean Population Coeff. Of Variation:

CV 

 

Sample Coeff. Of Variation:

cv 

s x

4-23

Measures of Relative Standing Percentile the Pth percentile is the value for which P% are less than that value and (100-P)% are greater than that value

4-24

12

Measures of Relative Standing Quartiles Values that divide our data set into fourths Q1 = 25th percentile Q2 = 50th percentile Q3 = 75th percentile

4-25

Measures of Relative Standing The location of a percentile

LP  n  1

P 100

4-26

13

Measures of Relative Standing Calculate the 25th, 50th and 75th Percentiles for Time Spent on the Internet (The Quartiles) 0

0

5

7

8

9

12

14

22

33

4-27

Measures of Relative Standing Calculate the 25th, 50th and 75th Percentiles for Time Spent on the Internet(Quartiles) 0

0

5

7

8

9

12

14

22

33

4-28

14

Measures of Relative Standing Calculate the 25th, 50th and 75th Percentiles for Time Spent on the Internet(Quartiles) 0

0

5

7

8

9

12

14

22

33

4-29

Measures of Relative Standing Calculate the 25th, 50th and 75th Percentiles for Time Spent on the Internet(Quartiles) 0

0

5

7

8

9

12

14

22

33

4-30

15

Measures of Relative Standing Interquartile Range another measure of variability

IQR  Q3  Q1

4-31

Measures o Relative Standing IQR of Long-Distance Telephone Bills Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls

4-32

16

Box Plot Graph of five statistics Minimum and maximum data values 1st, 2nd and 3rd Quartiles Determine outliers using IQR 1.5 times IQR less than Q1 1.5 times IQR larger than Q3 4-33

Box Plot Recall our Long-Distance Telephone Bill Data from Data File Xm03-01.xls

4-34

17

Box Plot Using Box Plots for the Comparison of Multiple Data Sets Let’s explore this technique using Excel and Data File Xm04-15.xls

4-35

Measures of Linear Relationship Covariance N

Population Covariance:

  x  yi   y 

x

i

 xy 

i 1

N n

Sample Covariance:

 x  xy  y  i

s xy 

i

i 1

n 1

4-36

18

Measures of Linear Relationship Calculating Covariance

mean

x

y

4

24

9

27

2

18

5

23

xi  x  yi  y  xi  x yi  y 

4-37

Measures of Linear Relationship Coefficient of Correlation Population Correlation:

 xy 

Sample Correlation:

rxy 

 1    1

 xy  x y sxy s x sy

 1  r  1

4-38

19

Measures of Linear Relationship Covariance Coefficient of Correlation Let’s explore these calculations using Excel and Data File Xm04-08

4-39

Measures of Linear Relationship Least Squares Method Simple Linear Regression (simple => two variables only) Two variables Independent Variable Dependent Variable 4-40

20

Measures of Linear Relationship Which variable is which? Example: Salary vs. Grocery Bill

4-41

Measures of Linear Relationship Let’s explore the Least Squares Method using Data File Xm04-17

4-42

21

Measures of Linear Relationship Regression Line +e

-e

4-43

Measures of Linear Relationship Least Squares Method

yˆ  b0  b1x y  mx  b

4-44

22

Measures of Linear Relationship Least Squares Method

b1 

sxy s2x

b0  y  b1x 4-45

Measures of Linear Relationship Least Squares Method Let’s explore the Least Squares Method using Excel and Data File Xm04-17

4-46

23...


Similar Free PDFs