Assignment 2 Inferential Statistic PDF

Title Assignment 2 Inferential Statistic
Author Mai Linh
Course Business Statistics ECON1030
Institution Royal Melbourne Institute of Technology University Vietnam
Pages 21
File Size 681.2 KB
File Type PDF
Total Downloads 230
Total Views 528

Summary

ECON1193 – BUSINESS STATISTICAssignment 2 – Individual Case Study – Inferential StatisticTopic 2: Age dependency ratio, old (%working-age population)Student’s name Mai Minh NgocStudent ID SLecturer: Pham Thi Minh ThuyCampus: RMIT Hanoi Summarize descriptive statistic. In the previous assignment, I u...


Description

ECON1193 – BUSINESS STATISTIC Assignment 2 – Individual Case Study – Inferential Statistic Topic 2: Age dependency ratio, old (%working-age population)

Student’s name Student ID Lecturer: Campus:

Mai Minh Ngoc S3730089 Pham Thi Minh Thuy RMIT Hanoi

1. Summarize descriptive statistic. In the previous assignment, I used Descriptive Statistic to conclude about the ratio of old-age dependency (% working-age population) of 2 nations that Brazil and Kenya. This shows an evidence in the tables: Central Tendency

Mean

Median

Mode

Brazil

8.103964641

7.552690108

#N/A

Kenya

5.275461043

5.251903511

#N/A

Table 1: Central Tendency of Brazil and Kenya Coefficient of Country

Range

Interquartile Range

Variance

Standard Deviation

Variation

Brazil

5.376663755

2.605952993

2.650233071

1.627953645

20.088%

Kenya

1.92380989

0.97702102

0.35234497

0.593586531

11.25%

Table 2: The measures of Variation of Brazil and Kenya From two tables above about the age dependency ratio, old in Brazil and Kenya from 1980 to 2016 in Assignment 1. This assignment answered the topic question is “Is the age dependency ratio and income are related?”, which the answer is Yes. We know that, Brazil have higher income than Kenya, and furthermore, all the number of Brazil much hinger than Kenya about the Mean, Median, Variance, Standard Deviation and CV. With the table 3, all the percentage of age dependency and income always depend to each other. The higher income, the higher ratio of old age dependency and vice versa.

P(HI)

0.4

P(HI|LO)

0

P(HI|HO)

1

P(MI)

0.457142857

P(MI|LO)

0.761904762

P(MI|HO)

0

P(LI)

0.142857143

P(LI|LO)

0.238095238

P(LI|HO)

0

Table 3. The probability of the percentage age dependency and GNI. Word count: 134 2. Confidence intervals. a. Calculate confidence intervals. In this section, I’m going to calculate about Confidence Interval of age dependency ratio, old (% working-age population), gross national income (GNI) per capita (current US$) and life expectancy at birth, total (years) of 35 countries. The Sample size of Dataset 2 given is 35 countries, n = 35. In this part 2a, all calculation will choose Significant level at 5%, � = 0.05 → confidence level is 95% (1 – � = 0.95). In this part, we know about the information that: -

n = 35

-

population standard deviation � is unknown →

So, I’m going to use t-table. o Substitute the sample standard deviation, S o n = 35, degree of freedom, d.F = n - 1 = 35 - 1 = 34. o Lower, Upper tail =



α 2

=

0.05 2

= 0.025

tn-1, (α)⁄2 = 2.0322

Because of σ is unknown, so confidence interval estimated by: �=

´X ± tn-1

S √n

t is the critical value of the t distribution, o d.F = n-1

o Each tail = 

α 2

Age dependency ratio, old (% working-age population) o n = 35 o d.F = n – 1 = 35 – 1 = 34 o tn-1, (α)⁄2 = 2.0322 →

�=

´X + tn-1

�=

´X - tn-1

S √n

= 16.121 + (2.0322).

10.280 √35

=

19.652 S √n

= 16.121 - (2.0322).

10.280 √35

= 12.590

12.590 ≤ � ≤ 19.6522 As a result, we are 95% confident that the true mean age dependency ratio, old is between 12.590 and 19.652. This suggest that we are 95% sure that the ratio of elder (% working-age population) in the world in 2015 is between 12.590% and 19.652%. 

Gross National Income (GNI) per capita (Current US$) o n = 35 o d.F = n – 1 = 35 – 1 = 34 o tn-1, (α)⁄2 = 2.0322 →

�=

´X + tn-1

�=

´X - tn-1

S √n

= 19594 + (2.0322).

23429.862 √ 35

=

27642.262 S √n

= 19594 - (2.0322).

11545.737 ≤ � ≤ 27642.262

23429.862 √ 35

= 11545.737

We are 95% confident that the true mean of GNI per capita (current US$) is between 11545.737 and 27642.262. This is suggesting that in 2015, we are sure that 95% average person in the world earn US$ 11545.737 and US$ 27642.262. 

Life expectancy at birth, total (years) o n = 35 o d.F = n – 1 = 35 – 1 = 34 o tn-1, (α)⁄2 = 2.0322 →

�=

´X + tn-1

�=

´X - tn-1

S √n S √n

= 73.328 + (2.0322).

= 73.328 - (2.0322).

9.098 √ 35

9.098 √ 35

= 76.453

= 70.202

70.202 ≤ � ≤ 76.453 As a result, we are 95% confident that the mean of life expectancy at birth of the world in 2015 is between 70.202 and 76.453. This suggest we are 95% sure that an average newborn expectancy in the world in 2015 is between 70.202 and 76.453 years.

b. Discuss, what assumption are required to calculates these confidence intervals. The sample size of dataset 2 given is 35, n = 35 is large enough (n ≥ 30), we can apply Central Limit Theorem and sampling distribution for the mean can be approximated by normal distribution. Hence, we can use the normal distribution and the t-table without any assumptions of the population.

c. Discuss the possible impact on the confidence intervals, if the sample size increase double According to the formula of confidence interval is �1 =

´X ± tn-1

S . (with n1 = n) √n

With confidence interval �1 =

´X

The new confidence interval is �2 = Width confidence interval �2 =

´X

- tn-1

S √n

´X ± tn-1 + tn-1

´X + tn-1

-

S √2 n

S √n

= 2.(tn-1

S ) √n

S . (with n2 = 2n) √2 n -

´X + tn-1

S √2 n

= 2.(tn-1

S √2 n

) If the sample size (n increases, the standard error

S √n

will decrease because (n) is

a denominator. The data is more concentrated around the mean. The width confidence interval (± tn-1

S ) will decrease √n

√ 2 times. Therefore, the confidence

intervals will be more accurate. Interpretation: Increasing the sample size mean we are studying 70 nations instead of 35. Hence, the result of the world ratio of old-age dependency is represent better of all 195 countries in the world. A 95% CI, n=70 will be narrower than a 95%, n=35. Therefore, we can easily predict more accurately about the percentage retired people (% working-age population) in the world. Word count: 174.

3. Hypothesis testing. a) According to the report from United National the world average old age dependency ratio is 12.9% in 2013. The data from the report of United National shows that the average percentage of elder dependency in 2013 is 12.9% (%working-age population). However, the calculation in part 2 shows that the average of age dependency ratio, old is about 16.121% (%working-age population). It’s much higher than the percentage in 2013, (16.121% ˃ 12.9%). Therefore, the ratio of the world’s elder dependence expected in the future will increase. Age dependency ratio, old (%working-age population)

Mean

16.12118821

Table 4: The mean of age dependency ratio, old (%working-age population). Word count: 67. b) Hypothesis testing. 

Choosing Confidence level is 95% → level of significant (�) = 0.05.



No assumption required because sample size given is 35, n = 35 is large enough (n ≥ 30), so we can apply Central Limit Theorem. We can be using Ztable or t-table.

-

We called the Null Hypothesis is H0, and the Alternative Hypothesis is H1. The age dependency ratio, old of the world in 2013 is 12.9%. 

H0: ≤ 12.9 (old age dependency ratio is less than or equal to 12.9%).



H1: ˃ 12.9 (old age dependency ratio is more than 12.9%). → This is one tailed test – upper tail.



Since the population standard deviation, � is unknown → we are using t-table: The test statistic we use is: t =

o t= 

´ X−μ S √n

=

16.121 −12.9 10.280 √ 35

´ X−μ S √n = 1.853

(1)

� = 0.05 because we are using one tailed test, so the � will present for the upper rejection region. o d.F = n – 1 = 35 -1 = 34. o � = 0.05. → t n-1,

α

= 1.6909 (the critical value of t is 1.6909). (2)

(1) > (2), as the test statistic falls into in the rejection region. (t = 1.853 > 1.6909) → Hence the Null Hypothesis (H0) is rejected.

Interpretation: As reject H0, hence with 95% level of confidence it can be concluded that the age dependency ratio of old people of the world will not decrease in the future. 

As H0 is rejected, hence type I error might have been committed. →

� = P(Type I) = 5%

o Despite the test result, there are still have 5% of chance in the future, the ratio of elder in the world might decrease more than 12.9%. 

Minimizing type I error, we need to increase the confidence level = (1 – �) to decrease level of significant (�). Critical value will be more extreme and lager than the test statistic (CV > 1.853), hence we do not reject H 0.

4. Regression analysis. a) Determine the dependence variable (DV) and the independent variables (IV) of the dataset 2. 

Dependence variable (DV) o Age dependency ratio, old (%working-age population).



Independent variable (IV) o Gross Nation Income (GNI) per capita (current US$). o Domestic general government health expenditure per capita, PPP (current international $). o Life expectancy at birth, total (years). o Smoking prevalence, total (age 15+).

b) For each independent variables (IV), the relationship between DV and IV. b.1) Age dependency ratio, old (%working-age population) & Gross National Income, per capita (current US$). o Expected relationship: I expected to see a positive linear between age dependency ratio, old (DV) with the GNI per capita (IV).

Age dependency ratio, old (%working-age population)

Age dependency ratio, old & GNI per capita 40 35

f(x) = 0 x + 9.62 R² = 0.57

30 25 20 15 10 5 0

0

10000

20000

30000

40000

50000

60000

70000

80000

90000

100000

GNI per capita (US$)

Figure […]. The scatter plot of age dependency ratio, old & GNI per capita (US$). o Comment: According to the scatter plot, there is a little week linear relationship between the ratio of elder dependency and GNI per person; (0...


Similar Free PDFs