Title | Data analysis notes - idk what it is |
---|---|
Course | math unit 3 |
Institution | St Joseph's College, Echuca |
Pages | 87 |
File Size | 1 MB |
File Type | |
Total Downloads | 99 |
Total Views | 143 |
idk what it is...
1
Displaying and 1A classifying data
Data and Variables In a data set, we call the qualities or quantities about which we record information variables Categorical variables Represent characteristic or qualities of people or things (eye colour fitness levels) - Nominal variables Have data values that can be used to group individuals according to particular characteristic (sex) - Ordinal variables Have data values that can be used both groups and other individuals according to a particular characteristic (fitness level) Numerical variables Are used to represent quantities, things that we count or measure - Discrete variables Represent quantities that are counted (number of car) - Continuous variables Represent quantities that are measured rather than counted (length, weight) Comparing numerical and categorical variables
1
1B Displaying and describing the distribution of categorical variables The frequency tables A frequency table is a listing of values a variable taken in a dataset, along with how often (frequently) each value occurs. Frequency can be recorded as a: - Number; the of times a value occurs, or - Percentage; the percentage of time a value occurs (percentage frequency) Percent = count/total count x 100%
Solution: Percentage = group/category x 100 = % total
6 ×100 =54.5 11
5 ×100 =45.5 11
CAS = ctrl + enter to get 0.0 instead of
0 How has forming a frequency table helped? The process of forming a frequency table for a categorical variable: - displays the data in a compact form - tells us something about the way the data values are distributed (the pattern of the data). Solution:
2
1B Displaying and describing the distribution of categorical variables Stacked or segmented bar charts In a segmented bar chart, the bars are stacked one on top of another to give a single bar with several parts or segments. The lengths of the segments are determined by the frequencies. The height of the bar gives the total frequency Solution:
The mode The mode is the most frequently occurring value or category. Some guidelines for writing a report describing the distribution of a categorical variable Briefly summarise the context in which the data were collected including the number of individuals involved in the study. If there is a clear modal category, ensure that it is mentioned. Include frequencies or percentages in the report. Percentages are preferred. If there are a lot of categories, it is not necessary to mention every category, but the modal category should always be mentioned.
solution: The climate types of 23 countries were classified as being, ‘cold’, ‘mild’ or ‘hot’. The majority of the countries, 60.9%, were found to have a mild climate. Of the remaining countries, 26.1% were found to have a hot climate, while 13.0% were found to have a cold climate. 3
1C Displaying and describing the distributions of numerical variables The grouped frequency distribution
Solution:
The histogram and its construction The histogram is a graphical display of the information in the grouped frequency table. Solution:
4
5
Range = largest value- smallest value Median’s the middle number What to look for in a histogram - Shape and outliers Symmetric distribution
-
skewed distributions
Outliers Is any data values that stand out from the main body of data Centre Centre of a distribution or the median Spread
Solution: A)
B)
C) Using a histogram to describe the distribution of a numerical variable in the context of its data
6
1D Using a log scale to display data Properties of logs to the base 10 1 If a number is greater than one, its log to the base 10 is greater than zero. 2 If a number is greater than zero but less than one, its log to the base 10 is negative. 3 If the number is zero, then its log is undefined.
Solution:
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87...