Study Guides (283,956)
CA (135,620)
UTSC (8,880)
Statistics (228)
STAB22H3 (176)
All (64)
Final

STAB22H3 Study Guide - Final Guide: Scantron Corporation, Test Statistic, Sampling Distribution
Exam
Premium

by OneClass2540294
25 Pages
88 Views
Fall 2018

Department
Statistics
Course Code
STAB22H3
Professor
All
Study Guide
Final

This preview shows pages 1-3. Sign up to view the full 25 pages of the document.
University of Toronto Scarborough
STAB22 Final Examination
December 2009
For this examination, you are allowed two handwritten letter-sized
sheets of notes (both sides) prepared by you, a non-programmable,
non-communicating calculator, and writing implements.
This question paper has 25 numbered pages, with statistical tables
at the back. Before you start, check to see that you have all
the pages. You should also have a Scantron sheet on which to
enter your answers. If any of this is missing, speak to an invigilator.
This examination is multiple choice. Each question has equal
weight, and there is no penalty for guessing. To ensure that
you receive credit for your work on the exam, fill in the bubbles
on the Scantron sheet for your correct student number (under
“Identification”), your last name, and as much of your first name
as fits.
Mark in each case the best answer out of the alternatives given
(which means the numerically closest answer if the answer is a
number and the answer you obtained is not given.)
If you need paper for rough work, use the back of the sheets of
this question paper.
Before you begin, two more things:
Check that the colour printed on your Scantron sheet matches
the colour of your question paper. If it does not, get a new
Scantron from an invigilator.
Complete the signature sheet, but sign it only when the in-
vigilator collects it. The signature sheet shows that you were
present at the exam.
At the end of the exam, you must hand in your Scantron sheet (or
you will receive a mark of zero for the examination). You will be
graded only on what appears on the Scantron sheet. You may take
away the question paper after the exam, but whether you do or
not, anything written on the question paper will not be considered
in your grade.
1
1. Heart problems can be examined via a small tube (called a catheter) threaded into the heart from a
vein in the patient’s leg. It is important that the company that manufactures the catheter maintains
a diameter of 2 mm. Suppose µdenotes the population mean diameter. Each day, quality control
personnel make measurements to test a null hypothesis that µ= 2.00 against an alternative that
µ6= 2.00, using α= 0.05. If a problem is discovered, the manufacturing process is stopped until the
problem is corrected.
What, in this context, is a type II error?
(a) Concluding that the mean catheter diameter is not 2 mm when it is actually less than 2 mm.
(b) * Concluding that the mean catheter diameter is satisfactory when in fact it is either bigger or
smaller than 2 mm.
(c) Using a sample size that is too small.
(d) Concluding that the mean catheter diameter is 2 mm when it actually is 2 mm.
(e) Concluding that the mean catheter diameter is unsatisfactory when in fact it is equal to 2 mm.
A Type II error is failing to reject the null when it is in fact wrong. In this case, the mean
is actually not 2, but we cannot reject the null hypothesis that it is 2. This is (b).
2. The Kentucky Derby is a famous horse race that has been run every year since the late 19th century.
The scatterplot below shows the speed (in miles per hour) of the winning horse, plotted against the
year. The regression line is shown on the plot.
What kind of association do you see between speed and year?
(a) no association
(b) negative and non-linear
(c) negative and linear
(d) positive and linear
(e) * positive and non-linear
The pattern is not absolutely clear, but it does seem that on average the speed is higher
when the year number is higher (that is, more recently). So the association is positive. Is
it linear? Well, the majority of observations are below the line at the left and right ends,
and above the line in the middle, so it looks as if a non-linear association would be a better
description of what’s going on. This is reasonable because you’d guess that there is a lot
of “room for improvement” in the early years, but now, any breaking of the speed record is
going to be only by a small amount.
3. Weighing large trucks is a slow business because the truck has to stop exactly on a scale. This is
called the “static weight”. The Minnesota Department of Transportation developed a new method,
called “weight-in-motion”, to weigh a truck as it drove over the scale without stopping. To test the
2
effectiveness of the “weight-in-motion” method, 10 trucks were each weighed using both methods. All
weights are measured in thousands of pounds.
A scatterplot of the results is shown below. Superimposed on the scatterplot is the line that the data
would follow if the weight-in-motion was always equal to the static weight. Use the scatterplot to
answer this question and the one following.
How would you describe the positive association?
(a) not useful since the data are not close to the line
(b) * approximately linear
(c) definitely curved
(d) no association of note
The line on the plot is a bit of a distraction, because the points form a more or less linear
pattern, just not around the line shown. (The trend in the points is not obviously a curve,
at least).
4. Question 3 described some data on two methods of weighing trucks. The Minnesota Department of
Transportation wants to predict the static weight of trucks from the weight-in-motion. Which of the
following statements best describes what they can do?
(a) * taking the weights-in-motion and modifying them in some linear way would accurately predict
the static weight.
(b) The weights-in-motion can be used to predict the static weights, but a non-linear transformation
would have to be applied to do it.
(c) The static weight is accurately predicted by the weight-in-motion itself.
(d) There is no way to use the weights-in-motion to predict the static weight.
Since there is a linear association (just not of the form y=x), the static weight can be
predicted reasonably well from the weight-in-motion, by multiplying the weight-in-motion
by something and adding something else — that is, by using the linear regression equation.
5. A factory hiring people to work on an assembly line gives job applicants a test of manual agility.
This test involves fitting strangely-shaped pegs into matching holes on a board. In the test, each job
applicant has 60 seconds to fit as many pegs into their holes as possible. For one job application cycle,
the results were as follows:
Male applicants Female applicants
Subjects 41 51
Mean pegs placed 17.9 19.4
SD of pegs placed 2.5 3.4
The factory wishes to see if there is evidence for a difference between males and females. Which is
more appropriate, a matched-pairs t-test or a two-sample t-test? Using the more appropriate test,
what P-value do you obtain?
3

Loved by over 2.2 million students

Over 90% improved by at least one letter grade.

Leah — University of Toronto

OneClass has been such a huge help in my studies at UofT especially since I am a transfer student. OneClass is the study buddy I never had before and definitely gives me the extra push to get from a B to an A!

Leah — University of Toronto
Saarim — University of Michigan

Balancing social life With academics can be difficult, that is why I'm so glad that OneClass is out there where I can find the top notes for all of my classes. Now I can be the all-star student I want to be.

Saarim — University of Michigan
Jenna — University of Wisconsin

As a college student living on a college budget, I love how easy it is to earn gift cards just by submitting my notes.

Jenna — University of Wisconsin
Anne — University of California

OneClass has allowed me to catch up with my most difficult course! #lifesaver

Anne — University of California
Description
University of Toronto Scarborough STAB22 Final Examination December 2009 For this examination, you are allowed two handwritten letter-sized sheets of notes (both sides) prepared by you, a non-programmable, non-communicating calculator, and writing implements. This question paper has 25 numbered pages, with statistical tables at the back. Before you start, check to see that you have all the pages. You should also have a Scantron sheet on which to enter your answers. If any of this is missing, speak to an invigilator. This examination is multiple choice. Each question has equal weight, and there is no penalty for guessing. To ensure that you receive credit for your work on the exam, ll in the bubbles on the Scantron sheet for your correct student number (under \Identication"), your last name, and as much of your rst name as ts. Mark in each case the best answer out of the alternatives given (which means the numerically closest answer if the answer is a number and the answer you obtained is not given.) If you need paper for rough work, use the back of the sheets of this question paper. Before you begin, two more things: Check that the colour printed on your Scantron sheet matches the colour of your question paper. If it does not, get a new Scantron from an invigilator. Complete the signature sheet, but sign it only when the in- vigilator collects it. The signature sheet shows that you were present at the exam. At the end of the exam, you must hand in your Scantron sheet (or you will receive a mark of zero for the examination). You will be graded only on what appears on the Scantron sheet. You may take away the question paper after the exam, but whether you do or not, anything written on the question paper will not be considered in your grade. 1 1. Heart problems can be examined via a small tube (called a catheter) threaded into the heart from a vein in the patients leg. It is important that the company that manufactures the catheter maintains a diameter of 2 mm. Suppose denotes the population mean diameter. Each day, quality control personnel make measurements to test a null hypothesis that = 2:00 against an alternative that 6= 2:00, using = 0:05. If a problem is discovered, the manufacturing process is stopped until the problem is corrected. What, in this context, is a type II error? (a) Concluding that the mean catheter diameter is not 2 mm when it is actually less than 2 mm. (b) * Concluding that the mean catheter diameter is satisfactory when in fact it is either bigger or smaller than 2 mm. (c) Using a sample size that is too small. (d) Concluding that the mean catheter diameter is 2 mm when it actually is 2 mm. (e) Concluding that the mean catheter diameter is unsatisfactory when in fact it is equal to 2 mm. A Type II error is failing to reject the null when it is in fact wrong. In this case, the mean is actually not 2, but we cannot reject the null hypothesis that it is 2. This is (b). 2. The Kentucky Derby is a famous horse race that has been run every year since the late 19th century. The scatterplot below shows the speed (in miles per hour) of the winning horse, plotted against the year. The regression line is shown on the plot. What kind of association do you see between speed and year? (a) no association (b) negative and non-linear (c) negative and linear (d) positive and linear (e) * positive and non-linear The pattern is not absolutely clear, but it does seem that on average the speed is higher when the year number is higher (that is, more recently). So the association is positive. Is it linear? Well, the majority of observations are below the line at the left and right ends, and above the line in the middle, so it looks as if a non-linear association would be a better description of whats going on. This is reasonable because youd guess that there is a lot of \room for improvement" in the early years, but now, any breaking of the speed record is going to be only by a small amount. 3. Weighing large trucks is a slow business because the truck has to stop exactly on a scale. This is called the \static weight". The Minnesota Department of Transportation developed a new method, called \weight-in-motion", to weigh a truck as it drove over the scale without stopping. To test the 2 eectiveness of the \weight-in-motion" method, 10 trucks were each weighed using both methods. All weights are measured in thousands of pounds. A scatterplot of the results is shown below. Superimposed on the scatterplot is the line that the data would follow if the weight-in-motion was always equal to the static weight. Use the scatterplot to answer this question and the one following. How would you describe the positive association? (a) not useful since the data are not close to the line (b) * approximately linear (c) denitely curved (d) no association of note The line on the plot is a bit of a distraction, because the points form a more or less linear pattern, just not around the line shown. (The trend in the points is not obviously a curve, at least). 4. Question 3 described some data on two methods of weighing trucks. The Minnesota Department of Transportation wants to predict the static weight of trucks from the weight-in-motion. Which of the following statements best describes what they can do? (a) * taking the weights-in-motion and modifying them in some linear way would accurately predict the static weight. (b) The weights-in-motion can be used to predict the static weights, but a non-linear transformation would have to be applied to do it. (c) The static weight is accurately predicted by the weight-in-motion itself. (d) There is no way to use the weights-in-motion to predict the static weight. Since there is a linear association (just not of the form y = x), the static weight can be predicted reasonably well from the weight-in-motion, by multiplying the weight-in-motion by something and adding something else | that is, by using the linear regression equation. 5. A factory hiring people to work on an assembly line gives job applicants a test of manual agility. This test involves tting strangely-shaped pegs into matching holes on a board. In the test, each job applicant has 60 seconds to t as many pegs into their holes as possible. For one job application cycle, the results were as follows: Male applicants Female applicants Subjects 41 51 Mean pegs placed 17.9 19.4 SD of pegs placed 2.5 3.4 The factory wishes to see if there is evidence for a dierence between males and females. Which is more appropriate, a matched-pairs t-test or a two-sample t-test? Using the more appropriate test, what P-value do you obtain? 3 (a) bigger than 0.05 (b) * between 0.01 and 0.02 (c) between 0.005 and 0.01 (d) less than 0.005 (e) between 0.02 and 0.05 This is a two-sample situation, because there is no way of matching up males and females (the giveaway being that there are dierent numbers of each). So, letting 1 be males and 2 females, we are testing 0 : 1= 2s. H :a 61 , 2ince we are looking for any dierence. This is a 2-sided test. I do this kind of problem by rst nding r 2 2 d = 2:5 + 3:4 = 0:6157 41 51 and then getting the test statistic as 17:9 19:4 t = = 2:436: 0:6157 We use 40 df, and look up the t without the minus sign in Table D. (Equally good is to do the subtraction the other way around to get a positive result.) The P-value one-sided would be between 0.005 and 0.01; our test is two-sided so we have to double that, between 0.01 and 0.02. 6. A discrete random variable X has the distribution shown below: Value 0 1 2 Probability 0.2 0.4 0.4 The random variable Y is dened as Y = 2X + 10. What is the mean (expected value) of Y ? (a) * 12.4 (b) 10 (c) 1.2 (d) 2.4 (e) there is not enough information The mean of Y is two times the mean of X, plus 10. The table at the top of the question enables us to work out the mean of X as 0(0:2) + 1(0:4) + 2(0:4) = 1:2, so the mean of Y is 2(1:2) + 10 = 12:4. Alternatively, the distribution of Y has the same probabilities as the distribution of X, but the possible values are 2(0) + 10 = 10, 2(1) + 10 = 12, 2(2) + 10 = 14 instead of 0, 1 and 2. Then you can nd the mean of Y directly as 10(0:2) + 12(0:4) + 14(0:4) = 12:4. Longer, but it still works. 7. A consumer magazine tested 14 (randomly chosen) brands of vanilla yogurt and measured the number of calories per serving of each one. Some Minitab output from the analysis is shown below. One-Sample T: Calories Variable N 90% CI Calories 14 (136.676, 179.038) One-Sample T: Calories 4
More Less
Unlock Document

Only pages 1-3 are available for preview. Some parts have been intentionally blurred.

Unlock Document
You're Reading a Preview

Unlock to view full version

Unlock Document

You've reached the limit of 4 previews this month

Create an account for unlimited previews.

Already have an account?

Log In


OR

Don't have an account?

Join OneClass

Access over 10 million pages of study
documents for 1.3 million courses.

Sign up

Join to view


OR

By registering, I agree to the Terms and Privacy Policies
Already have an account?
Just a few more details

So we can recommend you notes for your school.

Reset Password

Please enter below the email address you registered with and we will send you a link to reset your password.

Add your courses

Get notes from the top students in your class.


Submit