Title | Combinedquizzes - all quizzes |
---|---|
Course | Data Science |
Institution | University of Sydney |
Pages | 157 |
File Size | 7.2 MB |
File Type | |
Total Downloads | 9 |
Total Views | 151 |
all quizzes...
Evaluate Quiz 1 Due Aug 15 at 23:59
Points 5
Questions 5
Available Aug 9 at 9:00 - Nov 24 at 23:59 4 months
Time Limit 120 Minutes
Allowed Attempts 2
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
16 minutes
5 out of 5
Score for this attempt: 5 out of 5 Submitted Aug 15 at 15:32 This attempt took 16 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to A/Prof Elizabeth Scott, what is true of the teenage brain?
Correct!
A time of much change, and disorders can emerge
Question 2
1 / 1 pts
Is this statement true or false? "A contemporaneous control group occurs at the same time as the placebo group."
Correct!
False
Good work! A contemporaneous control group occurs at the same time as the treatment group.
1 / 1 pts
Question 3
Is this statement true or false? "In a double blind experiment, all the subjects cannot see the investigators."
Correct!
False
Good work!
1 / 1 pts
Question 4
Day
Male
Female
Total
Day 1
760 / 900 = 84.4%
40 / 100 = 40%
1000
Day 2
600 / 700 = 85.7%
150/300 = 50%
1000
On day 1, 1000 people were asked 'Do you like Coke more than Pepsi?'. On day 2 another 1000 people were asked the same question. The number of people who said yes is recorded in the above table. (E.g. 760 out of 900 males answered 'yes' on day 1.) Did more people like Coke on day 2 compared to day 1? [Note: the question is different to the exercise in class, so think carefully.]
Correct!
No
Good job. :)
Question 5
1 / 1 pts
Using the data() command, which of these is NOT a real dataset preloaded into R.
Correct!
tigers
Quiz Score: 5 out of 5
Evaluate Quiz 2 Due Aug 22 at 23:59
Points 5
Available until Nov 24 at 23:59
Questions 5 Time Limit 120 Minutes
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
11 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Aug 21 at 19:19 This attempt took 11 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Dr Mike Bambach, where does his road fatality data come from?
Correct!
Linking of police data and hospital data
Question 2
1 / 1 pts
What variables could be represented by a double bar plot? Given data: Age = quant Income = quant Country of birth = qual Phone provider = qual
Correct!
Country of birth by phone provider
Good work! Double bar plot needs 2 qual variables.
Question 3
Is this true or false? "A histogram is basically the same as a bar chart".
Correct!
False
Well done! Histogram = quant data Bar plot = qual data
1 / 1 pts
Question 4
1 / 1 pts
A large study is conducted by the Australian government through a third-party company. The aim is to investigate the community's attitude to the budget. The following information is collected: Age [0-18, 18-25, 25-40, 40-60, 60+] Place of birth [Australia or Other] Annual Income [in $] Nature of Employment [Eg student, full-time, part-time, casual] Attitude to Budget: "I believe the Australia government is managing the budget well." (Measured on Likert scale: 1. Strongly disagree; 2. Disagree; 3. Neither agree nor disagree; 4. Agree; 5. Strongly agree)
Which of the following graphical summaries best represents the relationship between Nature of Employment and Annual Income?
Correct!
Side by side box plot
Question 5
1 / 1 pts
Using the `iris` data create a histogram of the `Sepal.Width` variable using `ggplot` and the `geom_histogram` functions. Set the number of bins or columns in the plot to be 15 using the argument `bins=15` argument inside the `geom_histogram` function. What is the height of the largest column in the resulting plot between?
Correct!
35-40
Quiz Score: 5 out of 5
Evaluate Quiz 3 Due Aug 29 at 23:59
Points 5
Questions 5
Available Aug 23 at 9:00 - Oct 9 at 23:59 about 2 months
Time Limit 120 Minutes
Allowed Attempts Unlimited
This quiz was locked Oct 9 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
24 minutes
5 out of 5
Score for this attempt: 5 out of 5 Submitted Aug 29 at 21:37 This attempt took 24 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Dr Danika Wright, what did she discover about "outliers" in real estate data.
Correct!
The outliers are usually real properties
Question 2
In data wrangling, what is an API?
1 / 1 pts
Correct!
Application Programming Interface
Good work!
Question 3
1 / 1 pts
How does the population SD relate to the RMS?
Correct!
SDpop=RMS of (gaps from the mean)
Good work!
Question 4
1 / 1 pts
Data is collected on the number of hours undergraduate students slept across an entire semester. The data was then stored inunisleepas the average number of hours slept per night for a given student. Using quantile(unisleep), it is found that the 25% percentile is 7 and the 75% percentile is 9. The minimum is 0 and the maximum is 15.
Which statement is TRUE?
Correct!
There are outliers
Question 5
1 / 1 pts
Using the road fatalities data set from the lab and ggplot, create a boxplot of the Age variable grouped by the State variable. For which state is the size of the box the smallest? (ie The size of the IQR)
Correct!
NT
Quiz Score: 5 out of 5
Evaluate Quiz 4 Due Sep 5 at 23:59
Points 5
Available after Aug 30 at 9:00
Questions 5 Time Limit 120 Minutes
Attempt History LATEST
Attempt
Time
Score
Attempt 1
11 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Sep 4 at 20:32 This attempt took 11 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Kylie Moulds, in what area do more young swimmers drop out of elite swimming?
Correct!
city areas
Question 2
Which of the following istrue regarding the General Normal Curve?
Correct!
It has any mean and any standard deviation
Well done!
1 / 1 pts
Question 3
1 / 1 pts
For a Normal curve with mean 10 and standard deviation 4, what percentage of the data lie between 10 and 14?
Correct!
34%
Wonderful job!
Question 4
Which command will calculate P(X>1.2) when X∼N(3,4)?
Correct!
pnorm(1.2,3,2,lower.tail=F)
1 / 1 pts
Question 5
1 / 1 pts
Create a normal model for the `Petal.Width` variable from the `iris` data set and use this model to answer the following question. What percentage of the sample (to the nearest percent) would you expect to have a petal width between 0.5 and 1.
Correct!
21.7%
Quiz Score: 5 out of 5
Evaluate Quiz 5 Due Sep 12 at 23:59
Points 5
Questions 5
Available Sep 6 at 9:00 - Nov 24 at 23:59 3 months
Time Limit 120 Minutes
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
12 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Sep 11 at 13:53 This attempt took 12 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Mitch Gibbs, what is linear modelling used for in his oyster research?
Correct!
calibration curves for lipids
Question 2
1 / 1 pts
If we are using the independent variable to predict the dependent variable , then which of the following statements istrue?
Correct!
A residual plot graphs residuals against x
Well done!
Question 3
1 / 1 pts
Which of the following isNOT an indication that a linear model might be appropriate?
Correct!
The residual plot looks linear
Well done! This is the only answer that is false.
Question 4
1 / 1 pts
It is hypothesised that the second-hand market sale price of a certain vehicle decreases linearly with the distance (kms) travelled. Data is stored inprice (y)anddistance (x),where both variables are in 1000s of units (dollars and kms). The following linear model is produced: y= 40 - 5xwith a correlation coefficient of -0.8.
What plots would support this hypothesis?
Correct! A scatter plot of price and distance is mostly linear, and the residual plot shows no trend.
Question 5
1 / 1 pts
Using the olympics data with the filter function, determine how many women in the dataset are from the USA.
Correct!
7
Quiz Score: 5 out of 5
Evaluate Quiz 6 Due Sep 26 at 23:59
Points 5
Questions 5
Available Sep 20 at 9:00 - Nov 24 at 23:59 2 months
Time Limit 120 Minutes
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
11 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Sep 26 at 15:01 This attempt took 11 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Dr Jason Chin, what Australian law case could be an example of the Prosecutor's Fallacy?
Correct!
Kathleen Folbigg
Question 2
1 / 1 pts
If event A and event B are complementary and P(event A) = 0.6, then what is P(event B)? Please answer correct to 1 dp.
Correct!
orrect Answer
0.4
0.4 margin of error +/- 0.1
Great!
Question 3
1 / 1 pts
Is this statement true or false? "If distinct objects are drawn at random, then each object has a different chance of being selected."
Correct!
False
Fantastic!
Question 4
1 / 1 pts
In a group of 20 friends, 9 like pizza, 11 like coke and 5 like both. If you know someone in the group likes coke, what is the chance they don’t like pizza?
Correct!
6/11
Question 5
1 / 1 pts
The rules of a game are: The game costs $10 dollars to play You get to throw a fair die 20 times Every time you throw a 5 or 6 you win $1
Set the seed to 1 using the code `set.seed(1)` and run a simulation of this game. How much profit or loss do you make?
Correct!
$2 loss
Quiz Score: 5 out of 5
Evaluate Quiz 7 Due Oct 10 at 23:59
Points 5
Questions 5
Available Oct 4 at 9:00 - Nov 24 at 23:59 about 2 months
Time Limit 120 Minutes
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
55 minutes
4 out of 5
Score for this quiz: 4 out of 5 Submitted Oct 10 at 15:24 This attempt took 55 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Andy Tran, if we could model the physics of a coin toss, what could we potentially predict?
Correct!
the side it will fall on
Question 2
Is this statement true or false?
1 / 1 pts
"For repeated simulations of any chance process, the simulation histogram converges to the probability histogram."
Correct!
False
Great job! (That was a tricky one)
Question 3
0 / 1 pts
Which of these statements istrue?
ou Answered
A probability histogram represents chance by data.
Oops! This is false - a probability histogram represents chance by area
orrect Answer
A probability histogram represents chance by area.
Question 4
1 / 1 pts
Consider a box with mean M and standard deviation SD. 10 draws are taken at random with replacement from the box, and thesumis calculated. What is the EV and SE of the Sample Sum?
Correct!
10M and sqrt(10)SD
Question 5
1 / 1 pts
Suppose we have a 12 sided fair die and we toss it 20 times. What is the expected value of the sum of all tosses?
Correct!
130
Quiz Score: 4 out of 5
Evaluate Quiz 8 Due Oct 17 at 23:59
Points 5
Questions 5
Available Oct 11 at 9:00 - Nov 24 at 23:59 about 1 month
Time Limit 120 Minutes
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
49 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Oct 17 at 21:52 This attempt took 49 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Dr Mel Keep, what is 1 limitation with surveys?
Correct! We only have a snapshot not a perspective of how things are over time.
Question 2
1 / 1 pts
Is this statement true or false? Bootstrapping involves using the population to estimate properties of the sample.
Correct!
False
Great job!
Question 3
1 / 1 pts
Is this statement true or false? When a selection process is biased, we must take a larger sample to reduce this bias.
Correct!
False
Well done!
Question 4
1 / 1 pts
Consider 3 box models as follows, where the draws are madewith replacement, and thesumof the draws is calculated. BoxModel1: 100 draws from the box with 1, 2, 3, 4, 5 BoxModel2: 100 draws from the box with 2, 4, 6, 8, 10 BoxModel3: 10,000 draws from the box with 1, 2, 3, 4, 5
Denote the standard error (SE) for thesumof draws of the 3 models by SE 1, SE 2and SE3respectively. Which of the following statements is TRUE?
Correct!
SE2 = 2SE1 , SE3 = 10SE1
Question 5
1 / 1 pts
Using the same Oyster example from this week’s lab video answer the following question: Using a bootstrap technique consisting of 5000 samples, what is the approximate that probability of a sample of 1000 oysters containing less than or equal to 300 pearls? Set the seed to 99 before creating the bootstrap sample using the code set.seed(99).
Correct!
17.5%
Quiz Score: 5 out of 5
Evaluate Quiz 9 Due Oct 24 at 23:59
Points 5
Questions 5
Available Oct 18 at 9:00 - Nov 24 at 23:59 about 1 month
Time Limit 120 Minutes
This quiz was locked Nov 24 at 23:59.
Attempt History LATEST
Attempt
Time
Score
Attempt 1
32 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Oct 24 at 22:23 This attempt took 32 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Dr Lara Ford, what famous medical trial in the UK established the importance of early peanut exposure?
Correct!
LEAP
Question 2
1 / 1 pts
Suppose we have a Proportion Test, testing the percentage of International Students in a class. The null hypothesis H0 is: p = 0.3. We take a sample of 10, and the observed value is 2.
What is the test statistic?
Correct!
Good work! The is the test statistics based on the Sample Mean.
Question 3
What is the formula for the test statistic?
Correct!
(OV-EV)/SE
Good work!
1 / 1 pts
1 / 1 pts
Question 4
You have a suspicion that a die is improperly weighted, so you throw it 100 times and get 20 1’s. You want to find out if there is enough evidence to claim that the die is unbalanced. Let p = P(die rolls a '1'). What are your hypotheses?
Correct!
H0 : p = 1/6 vs H1 : p ≠ 1/6
1 / 1 pts
Question 5
Suppose we have a coin that we suspect is biased towards heads. Out of 50 tosses we had 34 heads. Using a simulation method with 10000 trials, find an approximate p-value (rounded to 2dp) where we have the following null and alternative hypotheses: Set the seed to 2 before running the simulation.
Correct!
0.01
Quiz Score: 5 out of 5
Evaluate Quiz 10 Due Oct 31 at 23:59
Points 5
Available after Oct 25 at 9:00
Questions 5 Time Limit 120 Minutes
Attempt History LATEST
Attempt
Time
Score
Attempt 1
49 minutes
5 out of 5
Score for this quiz: 5 out of 5 Submitted Oct 29 at 22:26 This attempt took 49 minutes.
Question 1
1 / 1 pts
Sydney Data Stories: According to Dr Llewellyn Mills, what did his caffeine withdrawal trials reveal?
Correct!
the nocebo effect in withdrawal
Question 2
Consider two datasets:
1 / 1 pts
Dataset A = 3,9,9 Dataset B = 2,5,3
What is the mean of the differences A-B (to 1 dp)?
Correct!
orrect Answer
3.7
3.67 margin of error +/- 0.1
Question 3
1 / 1 pts
Is this statement true of false? "Statistical inference involves making decisions about the sample data given what we know about the population."
Correct!
False
Nice work!
Question 4
1 / 1 pts
Kettle Chips suspects that a machine which fills packets of 175g Sea Salt Chips may be wrongly ...