Question. 5)Write short notes on
a) Bernoulli
Trials
b) Standard
Normal distribution
c) Central
Limit theorem
Ans :
a) Bernoulli trials
The Bernoulli trials process is one of the
simplest, yet most important, of all random processes. It is an essential topic
in any course in probability or mathematical statistics. The process consists
of independent trials with two outcomes and with constant probabilities from
trial to trial. Thus it is the mathematical abstraction of coin tossing. The
process leads to several important probability distributions: the binomial,
geometric, and negative binomial.
In the theory
of probability and statistics, a Bernoulli trialis
an experiment whose outcome is random and can be either of two possible
outcomes, "success" and "failure".
In practice it refers to a
single experiment which can have one of two possible outcomes. These
events can be phrased into "yes or no" questions:
- Did
the coin land heads?
- Was
the newborn child a girl?
Therefore success and failure are labels for
outcomes, and should not be construed literally. Examples of Bernoulli trials
include
- Flipping
a coin. In this context, obverse ("heads") conventionally
denotes success and reverse ("tails") denotes failure.
A fair coin has the probability of success 0.5 by definition.
- Rolling
a die, where a six is "success" and everything else a
"failure".
- In
conducting a political opinion poll, choosing a voter at random to
ascertain whether that voter will vote "yes" in an upcoming
referendum.
Mathematical description
Mathematically, a Bernoulli trial can be
described by asample space Ω consisting of two values, s for
"success" and f for "failure". Therefore
the sample space is . Then a random variable X can be
defined on this sample space, that is, a function . In this case the random variable is very simple and
given by
If p is the probability of
observing a 1 and 1 − p the probability of
observing a 0 (the probability distribution ofX),
then X has a Bernoulli distribution and
the expected value and the variance of X are
given by
The standard deviation of X is This distribution is a special case of
the Binomial distribution.
A Bernoulli process consists of repeatedly
performing independent but identical Bernoulli trials.
The process of determining an expectation
value and deviation, based on a limited number of Bernoulli trials is
colloquially known as "checking if a coin is fair".
b) Standard Normal Distribution
The standard normal distribution is
a special case of the normal distribution. It is the distribution that
occurs when a normal random variable has a mean of zero and a
standard deviation of one.
Standard Score (aka, z Score)
The normal random variable of a standard
normal distribution is called a standard score or a z-score.
Every normal random variable X can be transformed into a z score
via the following equation:
z =
(X - μ) / σ
where X is a normal random
variable, μ is the mean mean of X, and σ is the standard deviation
of X.
Standard Normal Distribution Table
A standard normal distribution table shows
a cumulative probability associated with a particular z-score. Table
rows show the whole number and tenths place of the z-score. Table columns show
the hundredths place. The cumulative probability (often from minus infinity to
the z-score) appears in the cell of the table.
For example, a section of the standard normal
table is reproduced below. To find the cumulative probability of a z-score equal
to -1.31, cross-reference the row of the table containing -1.3 with the column
containing 0.01. The table shows that the probability that a standard normal
random variable will be less than -1.31 is 0.0951; that is, P(Z < -1.31) =
0.0951.
z
|
0.00
|
0.01
|
0.02
|
0.03
|
0.04
|
0.05
|
0.06
|
0.07
|
0.08
|
0.09
|
-3.0
|
0.0013
|
0.0013
|
0.0013
|
0.0012
|
0.0012
|
0.0011
|
0.0011
|
0.0011
|
0.0010
|
0.0010
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
-1.4
|
0.0808
|
0.0793
|
0.0778
|
0.0764
|
0.0749
|
0.0735
|
0.0722
|
0.0708
|
0.0694
|
0.0681
|
-1.3
|
0.0968
|
0.0951
|
0.0934
|
0.0918
|
0.0901
|
0.0885
|
0.0869
|
0.0853
|
0.0838
|
0.0823
|
-1.2
|
0.1151
|
0.1131
|
0.1112
|
0.1093
|
0.1075
|
0.1056
|
0.1038
|
0.1020
|
0.1003
|
0.0985
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
...
|
3.0
|
0.9987
|
0.9987
|
0.9987
|
0.9988
|
0.9988
|
0.9989
|
0.9989
|
0.9989
|
0.9990
|
0.9990
|
Of course, you may not be interested in the
probability that a standard normal random variable falls between minus infinity
and a given value. You may want to know the probability that it lies between a
given value and plus infinity. Or you may want to know the probability that a
standard normal random variable lies between two given values. These
probabilities are easy to compute from a normal distribution table. Here's how.
- Find
P(Z > a). The probability that a standard normal random variable (z) is
greater than a given value (a) is easy to find. The table shows the P(Z
< a). The P(Z > a) = 1 - P(Z < a).
Suppose, for example, that we want to know the probability that a z-score will be greater than 3.00. From the table (see above), we find that P(Z < 3.00) = 0.9987. Therefore, P(Z > 3.00) = 1 - P(Z < 3.00) = 1 - 0.9987 = 0.0013. - Find
P(a < Z < b). The probability that a standard normal random
variables lies between two values is also easy to find. The P(a < Z
< b) = P(Z < b) - P(Z < a).
For example, suppose we want to know the probability that a z-score will be greater than -1.40 and less than -1.20. From the table (see above), we find that P(Z < -1.20) = 0.1151; and P(Z < -1.40) = 0.0808. Therefore, P(-1.40 < Z < -1.20) = P(Z < -1.20) - P(Z < -1.40) = 0.1151 - 0.0808 = 0.0343.
c) Central Limit theorem
The
central limit theorem explains why many distributions tend to be close to the
normal distribution. The key ingredient is that the random variable being
observed should be the sum or mean of many independent identically distributed
random variables. One version of the theorem is
In this applet, we look at rolling dice
again. Let X be the number of spots showing when one die is
rolled. The mean value for rolling one die is 3.5, and the variance is . If Sn is the number
of spots showing when n dice are rolled, then if n is
``large'' the random variable
should
be approximately standard normal, so Sn itself
should be approximately normal with mean 3.5*n and variance 35n/12.
The red curve is the graph of the density function with these parameters.
No comments:
Post a Comment