Allan J. Rossman and
Beth L. Chance
California Polytechnic State University
San Luis Obispo, CA 93407
Statistics Teaching and
Resource Library, November 5, 2001
© 2001 by
Allan J. Rossman and Beth L.
Chance,
all rights reserved. This text may be freely shared among
individuals, but it may not be republished in any medium without express
written consent from the author and advance notification of the editor.
This activity leads students to
appreciate the usefulness of simulations for approximating probabilities.
It also provides them with experience calculating probabilities based on
geometric arguments and using the bivariate normal distribution. We have
used it in courses in probability and mathematical statistics, as well as
in an introductory statistics course at the post-calculus level.
The scenario of the activity is easy to state and to understand: Tom and
Mary agree to meet for lunch at a certain restaurant, but their arrival
times are random variables. Furthermore, they agree to wait only for a
certain length of time for the other to arrive. The goal is to calculate
the probability that they meet. Students are first told to assume
independent arrival times uniformly distributed between noon and one
o’clock. They are first to approximate this probability through simulation
and then to calculate it exactly using geometric arguments. Then the
assumption about arrival times is relaxed to allow for different types of
distributions, including normal. Next the assumption of independence is
relaxed, providing students with experience performing bivariate normal
probability calculations.
In each case students are expected to approximate the solution through
simulation before solving it exactly. They are also expected to employ
graphical as well as algebraic problem-solving strategies, in addition to
their simulation analyses. Finally, students are asked to explain
intuitively why it makes sense for the probabilities to change as they do.
Key words: simulation, probability, geometry, independence, bivariate
normal distribution
Objectives
The objectives of this activity are:
 |
To help
students to appreciate the power of simulation for
approximating probabilities |
 |
To develop
students’ intuition about probabilistic reasoning
and their ability to express it verbally |
 |
To provide
experience using elementary geometric arguments to
solve probability problems
|
 |
To provide
experience with performing calculations related to
the bivariate normal probability distribution
|
 |
To lead
students to think about the effects of the
parameters of the bivariate normal probability
distribution on probability calculations
|
|
Activity Description
Students need access to software for
conducting simulations. Minitab instructions are provided in the prototype
activity, but any package could be used. A graphing calculator could be
used with fewer repetitions of each simulation.
This activity is designed to be completed in a 75-minute class period. The
first part of the activity, involving analysis of uniform arrival times,
can be completed in a 50-minute class period.
The simulation analysis of the case of independent, uniform arrival times
should yield a scatterplot such as the following, where solid circles
represent cases where Tom and Mary meet, open circles where they do not
meet (15 minute wait time):

The distributions of the differences in
arrival times and absolute differences, for one simulation of 1000
repetitions, are:


Because these arrival distributions are
independent and uniform, one can determine the probability of meeting as
the area of the region where they meet divided by the total area of the
square. It is easier to find the probability of not meeting, since those
two regions are each triangles of area .5(45)(45), so the total area of
the region where they do not meet is (45)(45) = 2025. The probability of
meeting is therefore 1-(45)(45)/(60)(60) = 7/16 = .4375. In a
calculus-based course, an instructor could also have students evaluate
this probability as a double integral over this region. One could also ask
students to first find the distribution of the difference between two
independent uniform distributions and then integrate that probability
density function.
When one extends the waiting time to 30
minutes, the probability of meeting rises to 1-(30/60)2 = 3/4 =
0.75. For a waiting time of m minutes, the
probability of meeting is 1 - [(60-m)/60]2, a sketch of which
appears below:

When the distributions of arrival times
are assumed to be independently normal with mean 30 minutes after noon and
standard deviation 10 minutes for each person, a simulation analysis
produces results such as the following:

As most students expect, the probability
of their meeting increases because they are both more likely to arrive
near 12:30. Students can also find that the simulation is consistent with
the theoretical result that the difference in arrival times follows a
normal distribution:


Assessment
A key to assessing whether students
learn what they should from this activity is to ask them to compare their
findings from one situation to the next and to explain why it makes sense
for the probabilities to change as they do. Asking questions about the
effects on the meeting probability of further alterations of the
probability distributions or parameters can further test students’
understanding. Examples include asking students to explain what happens to
the probability of meeting as:
 |
The time that
they agree to wait increases |
 |
The
probability distribution of their arrival
times changes from uniform to normal (with
same mean and standard deviation)
|
 |
The means of
their arrival times move farther apart |
 |
The standard
deviations of their arrival times become
smaller |
|
One can also assess students’ ability to
perform simulations in this problem-solving setting by changing the
probability distribution yet again, perhaps to beta distributions. Such an
exercise would be assigned either for homework or on an exam, if one had
access to computers during exams or was willing to include a take-home
exam component.
Another type of assessment follow-up to this activity could involve exact
calculations rather than simulation analyses. Specifically, students could
be told that the arrival times are normal and be expected to determine the
probability of meeting by using the result that the difference of two
normal distributions is itself normal. Moreover, students who have studied
multivariable calculus could be given a bivariate probability density
function for the arrival times and asked to use double integration to
calculate the probability of meeting.
One helpful built-in feature of this activity is that preceding the
theoretical analyses with simulations provides students with an immediate
way of checking the reasonableness of their analyses.
Teacher notes
This activity lends itself to use with a
variety of student audiences. We have used it with introductory students
at the post-calculus level, but many phases of it are also appropriate for
an introductory, algebra-based course. The parts involving uniform
distributions and independent normal distributions are appropriate for all
introductory students, but the bivariate normal analysis with a non-zero
correlation is best reserved for more mathematically inclined students.
In terms of where this activity “fits” into one’s syllabus, we believe
that it can be most effective as an early introduction to understanding
probability as long-term relative frequency and to the usefulness of
simulation. While it may be helpful for students to have some prior
experience with statistical software before engaging in this activity,
that experience is not essential.
We have used this activity as an in-class experience for students,
encouraging them to work in pairs. We believe that it could also work well
as an out-of-class assignment, with class time then devoted to students’
presenting and discussing their results and findings.
A further extension of the activity, appropriate for courses at the
post-calculus level, might use a more “generic” bivariate probability
density function and have students apply double integrals to calculate
probabilities. Another extension would be to use more unusual arrival time
distributions where simulation would provide a much more efficient
problem-solving strategy than integration.
Acknowledgements
This activity was developed under grant
#9950476 from the National Science Foundation. It is part of a project to
develop curricular materials for a data-oriented, active learning approach
to introductory statistics at the post-calculus level. The materials will
be published by Duxbury Press.
Editor's note:
Before 11-6-01, the "student's version" of an activity was called the
"prototype".