BCOMP3

Probability
Probability is defined as a number between 0 and 1 representing the likelihood of

an event happening. A probability of 0 indicates no chance of that event occurring, while
a probability of 1 means the event will occur. If you're working on a probability problem
and come up with a negative answer, or an answer greater than 1, you've made a
mistake! Go back and check your work.
Basic Concepts
Probability of a Single Event
If you roll a six-sided die, there are six possible outcomes, and each of these
outcomes is equally likely. A six is as likely to come up as a three, and likewise for the
other four sides of the die. What, then, is the probability that a one will come up? Since
there are six possible outcomes, the probability is 1/6. What is the probability that either
a one or a six will come up? The two outcomes about which we are concerned (a one or
a six coming up) are called favorable outcomes. Given that all outcomes are equally
likely, we can compute the probability of a one or a six using the formula:
In this case there are two favorable outcomes and six possible outcomes. So the
probability of throwing either a one or six is 1/3. Don't be misled by our use of the term
"favorable," by the way. You should understand it in the sense of "favorable to the event
in question happening." That event might not be favorable to your well-being. You might
be betting on a three, for example.
The above formula applies to many games of chance. For example, what is the
probability that a card drawn at random from a deck of playing cards will be an ace?
Since the deck has four aces, there are four favorable outcomes; since the deck has 52
cards, there are 52 possible outcomes. The probability is therefore 4/52 = 1/13. What
about the probability that the card will be a club? Since there are 13 clubs, the
probability is 13/52 = 1/4.
Let's say you have a bag with 20 cherries: 14 sweet and 6 sour. If you pick a
cherry at random, what is the probability that it will be sweet? There are 20 possible
cherries that could be picked, so the number of possible outcomes is 20. Of these 20
possible outcomes, 14 are favorable (sweet), so the probability that the cherry will be
sweet is 14/20 = 7/10. There is one potential complication to this example, however. It
must be assumed that the probability of picking any of the cherries is the same as the
probability of picking any other. This wouldn't be true if (let us imagine) the sweet
cherries are smaller than the sour ones. (The sour cherries would come to hand more
readily when you sampled from the bag.) Let us keep in mind, therefore, that when we
assess probabilities in terms of the ratio of favorable to all potential cases, we rely
heavily on the assumption of equal probability for all outcomes.
Probability of Two (or more) Independent Events
Events A and B are independent events if the probability of Event B occurring is

the same whether or not Event A occurs. Let's take a simple example. A fair coin is
tossed two times. The probability that a head comes up on the second toss is 1/2
regardless of whether or not a head came up on the first toss. The two events are (1)
first toss is a head and (2) second toss is a head. So these events are independent.
Consider the two events (1) "It will rain tomorrow in Houston" and (2) "It will rain
tomorrow in Galveston" (a city near Houston). These events are not independent
because it is more likely that it will rain in Galveston on days it rains in Houston than on
days it does not.
Probability of A and B
When two events are independent, the probability of both occurring is the product
of the probabilities of the individual events. More formally, if events A and B are
independent, then the probability of both A and B occurring is:
P(A and B) = P(A) x P(B)
where P(A and B) is the probability of events A and B both occurring, P(A) is the
probability of event A occurring, and P(B) is the probability of event B occurring.
If you flip a coin twice, what is the probability that it will come up heads both
times? Event A is that the coin comes up heads on the first flip and Event B is that the
coin comes up heads on the second flip. Since both P(A) and P(B) equal 1/2, the
probability that both events occur is
1/2 x 1/2 = 1/4.
Conditional Probabilities
Often it is required to compute the probability of an event given that another

event has occurred. For example, what is the probability that two cards drawn at
random from a deck of playing cards will both be aces? It might seem that you could
use the formula for the probability of two independent events and simply multiply 4/52 x
4/52 = 1/169. This would be incorrect, however, because the two events are not
independent. If the first card drawn is an ace, then the probability that the second card
is also an ace would be lower because there would only be three aces left in the deck.
Once the first card chosen is an ace, the probability that the second card chosen
is also an ace is called the conditional probability of drawing an ace. In this case, the
"condition" is that the first card is an ace. Symbolically, we write this as:
P(ace on second draw | an ace on the first draw)
The vertical bar "|" is read as "given," so the above expression is short for: "The
probability that an ace is drawn on the second draw given that an ace was drawn on the
first draw." What is this probability? Since after an ace is drawn on the first draw, there
are 3 aces out of 51 total cards left. This means that the probability that one of these
aces will be drawn is 3/51 = 1/17.
If Events A and B are not independent, then P(A and B) = P(A) x P(B|A).
Applying this to the problem of two aces, the probability of drawing two aces from a
deck is 4/52 x 3/51 = 1/221.
Birthday Problem
If there are 25 people in a room, what is the probability that at least two of them
share the same birthday. If your first thought is that it is 25/365 = 0.068, you will be
surprised to learn it is much higher than that. This problem requires the application of
the sections on P(A and B) and conditional probability.
This problem is best approached by asking what is the probability that no two
people have the same birthday. Once we know this probability, we can simply subtract it
from 1 to find the probability that two people share a birthday.
If we choose two people at random, what is the probability that they do not share
a birthday? Of the 365 days on which the second person could have a birthday, 364 of
them are different from the first person's birthday. Therefore the probability is 364/365.
Let's define P2 as the probability that the second person drawn does not share a
birthday with the person drawn previously. P2 is therefore 364/365. Now define P3 as
the probability that the third person drawn does not share a birthday with anyone drawn
previously given that there are no previous birthday matches. P3 is therefore a
conditional probability. If there are no previous birthday matches, then two of the 365
days have been "used up," leaving 363 non-matching days. Therefore P3 = 363/365. In
like manner, P4 = 362/365, P5 = 361/365, and so on up to P25 = 341/365.
In order for there to be no matches, the second person must not match any
previous person and the third person must not match any previous person, and the
fourth person must not match any previous person, etc. Since P(A and B) = P(A)P(B),
all we have to do is multiply P2, P3, P4 ...P25 together. The result is 0.431. Therefore
the probability of at least one match is 0.569.
Gambler's Fallacy
A fair coin is flipped five times and comes up heads each time. What is the
probability that it will come up heads on the sixth flip? The correct answer is, of course,
1/2. But many people believe that a tail is more likely to occur after throwing five heads.
Their faulty reasoning may go something like this: "In the long run, the number of heads
and tails will be the same, so the tails have some catching up to do."
Basic Principles of Counting

Counting
An efficient way of counting is necessary to handle large masses of statistical

data (e.g. the level of inventory at the end of a given month, or the number of production
runs on a given machine in a 24 hour period, etc.), and for an understanding
of probability.
In this section, we shall develop a few counting techniques. Such techniques will
enable us to count the following, without having to list all of the items:
 the number of ways,

 the number of samples, or
 the number of outcomes.
Before we learn some of the basic principles of counting, let's see some of the
notation we'll need.
Number of Outcomes of an Event
As an example, we may have an event E defined as
E = "day of the week"
We write the "number of outcomes of event E" as n(E).
So in the example,
n(E)=7, since there are 7 days in the week.
Addition Rule
Let E1 and E2 be mutually exclusive events (i.e. there are no common outcomes).
Let event E describe the situation where either event E1 or event E2 will occur.
The number of times event E will occur can be given by the expression:
n(E) = n(E1) + n(E2)
where
n(E) = Number of outcomes of event E
n(E1) = Number of outcomes of event E1
n(E2) = Number of outcomes of event E2
Multiplication Rule
Now consider the case when two events E1 and E2 are to be performed and the
events E1 and E2 are independent events i.e. one does not affect the other's outcome.
Example
Say the only clean clothes you've got are 2 t-shirts and 4 pairs of jeans. How
many different combinations can you choose?
Answer
We can think of it as follows:
We have 2 t-shirts and with each t-shirt we could pick 4 pairs of jeans. Altogether

there are
2×4=8 possible combinations
We could write
E1 = "choose t-shirt" and
E2 = "choose jeans"
Multiplication Rule in General
Suppose that event E1 can result in any one of n(E1) possible outcomes; and for
each outcome of the event E1, there are n(E2) possible outcomes of event E2.
Together there will be n(E1) × n(E2) possible outcomes of the two events.
That is, if event E is the event that both E1 and E2 must occur, then
n(E) = n(E1) × n(E2)
In our example above,
n(E1) = 2 (since we had 2 t-shirts)
n(E2) = 4 (since there were 4 pairs of jeans)
So total number of possible outcomes is given by:
n(E) = n(E1) × n(E2) = 2 × 4 = 8
Probability Rules
There are three main rules associated with basic probability: the addition rule, the
multiplication rule, and the complement rule. You can think of the complement rule as
the 'subtraction rule' if it helps you to remember it.
1.) The Addition Rule: P(A or B) = P(A) + P(B) - P(A and B)
If A and B are mutually exclusive events, or those that cannot occur together,

then the third term is 0, and the rule reduces to P(A or B) = P(A) + P(B). For example,
you can't flip a coin and have it come up both heads and tails on one toss.
2.) The Multiplication Rule: P(A and B) = P(A) * P(B|A) or P(B) * P(A|B)
If A and B are independent events, we can reduce the formula to P(A and B) =

P(A) * P(B). The term independent refers to any event whose outcome is not affected by
the outcome of another event. For instance, consider the second of two coin flips, which
still has a .50 (50%) probability of landing heads, regardless of what came up on the
first flip. What is the probability that, during the two coin flips, you come up with tails on
the first flip and heads on the second flip?
P = P(tails) * P(heads) = (0.5) * (0.5) = 0.25
3.) The Complement Rule: P(not A) = 1 - P(A)

Do you see why the complement rule can also be thought of as the subtraction
rule? This rule builds upon the mutually exclusive nature of P(A) and P(not A). These
two events can never occur together, but one of them always has to occur.
Therefore P(A) + P(not A) = 1. For example, if the weatherman says there is a 0.3
chance of rain tomorrow, what are the chances of no rain?
P(no rain) = 1 - P(rain) = 1 - 0.3 = 0.7
What Is a Probability Distribution?
A probability distribution is a statistical function that describes all the possible

values and likelihoods that a random variable can take within a given range. This range
will be bounded between the minimum and maximum possible values, but precisely
where the possible value is likely to be plotted on the probability distribution depends on
a number of factors. These factors include the distribution's mean (average), standard
deviation, skewness, and kurtosis.
Types of Distributions
Bernoulli Distribution
A Bernoulli distribution has only two possible outcomes, namely 1 (success) and

0 (failure), and a single trial. So the random variable X which has a Bernoulli distribution
can take value 1 with the probability of success, say p, and the value 0 with the
probability of failure, say q or 1-p.
Here, the occurrence of a head denotes success, and the occurrence of a tail
denotes failure.
Probability of getting a head = 0.5 = Probability of getting a tail since there are
only two possible outcomes.
The probability mass function is given by: px(1-p)1-x where x € (0, 1).

It can also be written as
The probabilities of success and failure need not be equally likely, like the result
of a fight between me and Undertaker. He is pretty much certain to win. So in this case
probability of my success is 0.15 while my failure is 0.85
Here, the probability of success(p) is not same as the probability of failure. So,
the chart below shows the Bernoulli Distribution of our fight.
Here, the probability of success = 0.15 and probability of failure = 0.85. The
expected value is exactly what it sounds. If I punch you, I may expect you to punch me
back. Basically expected value of any distribution is the mean of the distribution. The
expected value of a random variable X from a Bernoulli distribution is found as follows:
E(X) = 1*p + 0*(1-p) = p
The variance of a random variable from a bernoulli distribution is:
V(X) = E(X²) – [E(X)]² = p – p² = p(1-p)
There are many examples of Bernoulli distribution such as whether it’s going to
rain tomorrow or not where rain denotes success and no rain denotes failure and
Winning (success) or losing (failure) the game.
Uniform Distribution
When you roll a fair die, the outcomes are 1 to 6. The probabilities of getting
these outcomes are equally likely and that is the basis of a uniform distribution. Unlike
Bernoulli Distribution, all the n number of possible outcomes of a uniform distribution are
equally likely.
A variable X is said to be uniformly distributed if the density function is:

The graph of a uniform distribution curve looks like
You can see that the shape of the Uniform distribution curve is rectangular, the
reason why Uniform distribution is called rectangular distribution.
For a Uniform Distribution, a and b are the parameters.
The number of bouquets sold daily at a flower shop is uniformly distributed with a
maximum of 40 and a minimum of 10.
Let’s try calculating the probability that the daily sales will fall between 15 and 30.
The probability that daily sales will fall between 15 and 30 is (30-15)*(1/(40-10)) =
0.5. Similarly, the probability that daily sales are greater than 20 is = 0.667
The mean and variance of X following a uniform distribution is:
Mean -> E(X) = (a+b)/2
Variance -> V(X) = (b-a)²/12
The standard uniform density has parameters a = 0 and b = 1, so the PDF for
standard uniform density is given by:
Binomial Distribution
Suppose that you won the toss today and this indicates a successful event. You
toss again but you lost this time. If you win a toss today, this does not necessitate that
you will win the toss tomorrow. Let’s assign a random variable, say X, to the number of
times you won the toss. What can be the possible value of X? It can be any number
depending on the number of times you tossed a coin.
There are only two possible outcomes. Head denoting success and tail denoting
failure. Therefore, probability of getting a head = 0.5 and the probability of failure can be
easily computed as: q = 1- p = 0.5.
A distribution where only two outcomes are possible, such as success or failure,
gain or loss, win or lose and where the probability of success and failure is same for all
the trials is called a Binomial Distribution.
The outcomes need not be equally likely. Remember the example of a fight
between me and Undertaker? So, if the probability of success in an experiment is 0.2
then the probability of failure can be easily computed as q = 1 – 0.2 = 0.8.
Each trial is independent since the outcome of the previous toss doesn’t
determine or affect the outcome of the current toss. An experiment with only two
possible outcomes repeated n number of times is called binomial. The parameters of a
binomial distribution are n and p where n is the total number of trials and p is the
probability of success in each trial.
On the basis of the above explanation, the properties of a Binomial Distribution are
1. Each trial is independent.

2. There are only two possible outcomes in a trial- either a success or a failure.
3. A total number of n identical trials are conducted.
4. The probability of success and failure is same for all trials. (Trials are identical.)
The mathematical representation of binomial distribution is given by:
A binomial distribution graph where the probability of success does not equal the
probability of failure looks like
Now, when probability of success = probability of failure, in such a situation the
graph of binomial distribution looks like
The mean and variance of a binomial distribution are given by:
Mean -> µ = n*p
Variance -> Var(X) = n*p*q
Normal Distribution
Normal distribution represents the behavior of most of the situations in the universe

(That is why it’s called a “normal” distribution. I guess!). The large sum of (small)
random variables often turns out to be normally distributed, contributing to its
widespread application. Any distribution is known as Normal distribution if it has the
following characteristics:
1. The mean, median and mode of the distribution coincide.

2. The curve of the distribution is bell-shaped and symmetrical about the line x=μ.
3. The total area under the curve is 1.
4. Exactly half of the values are to the left of the center and the other half to the
right.
A normal distribution is highly different from Binomial Distribution. However, if the

number of trials approaches infinity then the shapes will be quite similar.
The PDF of a random variable X following a normal distribution is given by:
The mean and variance of a random variable X which is said to be normally

distributed is given by:
Mean -> E(X) = µ
Variance -> Var(X) = σ^2
Here, µ (mean) and σ (standard deviation) are the parameters.

The graph of a random variable X ~ N (µ, σ) is shown below.
A standard normal distribution is defined as the distribution with mean 0 and

standard deviation 1.
Poisson Distribution
Suppose you work at a call center, approximately how many calls do you get in a
day? It can be any number. Now, the entire number of calls at a call center in a day is
modeled by Poisson distribution. Some more examples are
1. The number of emergency calls recorded at a hospital in a day.

2. The number of thefts reported in an area on a day.
3. The number of customers arriving at a salon in an hour.
4. The number of suicides reported in a particular city.
5. The number of printing errors at each page of the book.
You can now think of many examples following the same course. Poisson
Distribution is applicable in situations where events occur at random points of time and
space wherein our interest lies only in the number of occurrences of the event.
A distribution is called Poisson distribution when the following assumptions are valid:
1. Any successful event should not influence the outcome of another successful
event.
2. The probability of success over a short interval must equal the probability of
success over a longer interval.
3. The probability of success in an interval approaches zero as the interval
becomes smaller.
Now, if any distribution validates the above assumptions then it is a Poisson

distribution. Some notations used in Poisson distribution are:
 λ is the rate at which an event occurs,

 t is the length of a time interval,
 And X is the number of events in that time interval.
Here, X is called a Poisson Random Variable and the probability distribution of X is

called Poisson distribution.
Let µ denote the mean number of events in an interval of length t. Then, µ = λ*t.
The PMF of X following a Poisson distribution is given by:
The mean µ is the parameter of this distribution. µ is also defined as the λ times
length of that interval. The graph of a Poisson distribution is shown below:
The graph shown below illustrates the shift in the curve due to increase in mean.
It is perceptible that as the mean increases, the curve shifts to the right.
The mean and variance of X following a Poisson distribution:
Mean -> E(X) = µ
Variance -> Var(X) = µ
Exponential Distribution
Let’s consider the call center example one more time. What about the interval of
time between the calls ? Here, exponential distribution comes to our rescue.
Exponential distribution models the interval of time between the calls.
Other examples are:
1. Length of time beteeen metro arrivals,

2. Length of time between arrivals at a gas station
3. The life of an Air Conditioner
Exponential distribution is widely used for survival analysis. From the expected life of
a machine to the expected life of a human, exponential distribution successfully delivers
the result.
A random variable X is said to have an exponential distribution and

parameter λ>0 which is also called the rate.
f(x) = { λe-λx, x ≥ 0
For survival analysis, λ is called the failure rate of a device at any time t, given
that it has survived up to t.
Mean and Variance of a random variable X following an exponential distribution:
Mean -> E(X) = 1/λ
Variance -> Var(X) = (1/λ)²
Also, the greater the rate, the faster the curve drops and the lower the rate, flatter
the curve. This is explained better with the graph shown below.
To ease the computation, there are some formulas given below.
 P{X≤x} = 1 – e-λx, corresponds to the area under the density curve to the left of
x.
 P{X>x} = e-λx, corresponds to the area under the density curve to the right of x.
 P{x1<X≤ x2} = e-λx1 – e-λx2, corresponds to the area under the density curve
between x1 and x2.
TYPES OF SAMPLING METHODS:
 Probability sampling involves random selection, allowing you to make statistical

inferences about the whole group.
 Non-probability sampling involves non-random selection based on
convenience or other criteria, allowing you to easily collect initial data.
Population vs sample
The population is the entire group that you want to draw conclusions about.
The sample is the specific group of individuals that you will collect data from.
The population can be defined in terms of geographical location, age, income,

and many other characteristics.
It can be very broad or quite narrow: maybe you want to make inferences about
the whole adult population of your country; maybe your research focuses on customers
of a certain company, patients with a specific health condition, or students in a single
school.
It is important to carefully define your target population according to the purpose

and practicalities of your project.
If the population is very large, demographically mixed, and geographically

dispersed, it might be difficult to gain access to a representative sample.
Sampling frame
The sampling frame is the actual list of individuals that the sample will be drawn
from. Ideally, it should include the entire target population (and nobody who is not part
of that population).
Example:
You are doing research on working conditions at Company X. Your population is

all 1000 employees of the company. Your sampling frame is the company’s HR
database which lists the names and contact details of every employee.
Sample size
The number of individuals in your sample depends on the size of the population,
and on how precisely you want the results to represent the population as a whole.
You can use a sample size calculator to determine how big your sample should
be. In general, the larger the sample size, the more accurately and confidently you can
make inferences about the whole population.
Probability Sampling Methods
Probability sampling means that every member of the population has a chance of
being selected. It is mainly used in quantitative research. If you want to produce results
that are representative of the whole population, you need to use a probability sampling
technique.
There are four main types of probability sample.
1. Simple Random Sampling
In a simple random sample, every member of the population has an equal chance of
being selected. Your sampling frame should include the whole population.
To conduct this type of sampling, you can use tools like random number generators or
other techniques that are based entirely on chance.
Example
You want to select a simple random sample of 100 employees of Company X.

You assign a number to every employee in the company database from 1 to 1000, and
use a random number generator to select 100 numbers.
2. Systematic Sampling
Systematic sampling is similar to simple random sampling, but it is usually slightly

easier to conduct. Every member of the population is listed with a number, but instead
of randomly generating numbers, individuals are chosen at regular intervals.
Example
All employees of the company are listed in alphabetical order. From the first 10
numbers, you randomly select a starting point: number 6. From number 6 onwards,
every 10th person on the list is selected (6, 16, 26, 36, and so on), and you end up with
a sample of 100 people.
If you use this technique, it is important to make sure that there is no hidden
pattern in the list that might skew the sample. For example, if the HR database groups
employees by team, and team members are listed in order of seniority, there is a risk
that your interval might skip over people in junior roles, resulting in a sample that is
skewed towards senior employees.
3. Stratified Sampling
This sampling method is appropriate when the population has mixed characteristics,
and you want to ensure that every characteristic is proportionally represented in the
sample.
You divide the population into subgroups (called strata) based on the relevant
characteristic (e.g. gender, age range, income bracket, job role).
From the overall proportions of the population, you calculate how many people should
be sampled from each subgroup. Then you use random or systematic sampling to
select a sample from each subgroup.
Example
The company has 800 female employees and 200 male employees. You want to
ensure that the sample reflects the gender balance of the company, so you sort the
population into two strata based on gender. Then you use random sampling on each
group, selecting 80 women and 20 men, which gives you a representative sample of
100 people.
4. Cluster Sampling
Cluster sampling also involves dividing the population into subgroups, but each
subgroup should have similar characteristics to the whole sample. Instead of sampling
individuals from each subgroup, you randomly select entire subgroups.
If it is practically possible, you might include every individual from each sampled
cluster. If the clusters themselves are large, you can also sample individuals from within
each cluster using one of the techniques above.
This method is good for dealing with large and dispersed populations, but there is
more risk of error in the sample, as there could be substantial differences between
clusters. It’s difficult to guarantee that the sampled clusters are really representative of
the whole population.
Example
The company has offices in 10 cities across the country (all with roughly the
same number of employees in similar roles). You don’t have the capacity to travel to
every office to collect your data, so you use random sampling to select 3 offices – these
are your clusters.
Non-probability sampling methods
In a non-probability sample, individuals are selected based on non-random

criteria, and not every individual has a chance of being included.
This type of sample is easier and cheaper to access, but it has a higher risk
of sampling bias, and you can’t use it to make valid statistical inferences about the
whole population.
Non-probability sampling techniques are often appropriate for exploratory

and qualitative research. In these types of research, the aim is not to test
a hypothesis about a broad population, but to develop an initial understanding of a small
or under-researched population.
1. Convenience Sampling
A convenience sample simply includes the individuals who happen to be most

accessible to the researcher.
This is an easy and inexpensive way to gather initial data, but there is no way to tell if
the sample is representative of the population, so it can’t produce generalizable results.
Example
You are researching opinions about student support services in your university,
so after each of your classes, you ask your fellow students to complete a survey on the
topic. This is a convenient way to gather data, but as you only surveyed students taking
the same classes as you at the same level, the sample is not representative of all the
students at your university.
2. Voluntary Response Sampling
Similar to a convenience sample, a voluntary response sample is mainly based on

ease of access. Instead of the researcher choosing participants and directly contacting
them, people volunteer themselves (e.g. by responding to a public online survey).
Voluntary response samples are always at least somewhat biased, as some people
will inherently be more likely to volunteer than others.
Example
You send out the survey to all students at your university and a lot of students
decide to complete it. This can certainly give you some insight into the topic, but the
people who responded are more likely to be those who have strong opinions about the
student support services, so you can’t be sure that their opinions are representative of
all students.
3. Purposive Sampling
This type of sampling involves the researcher using their judgement to select a
sample that is most useful to the purposes of the research.
It is often used in qualitative research, where the researcher wants to gain detailed
knowledge about a specific phenomenon rather than make statistical inferences. An
effective purposive sample must have clear criteria and rationale for inclusion.
Example
You want to know more about the opinions and experiences of disabled students
at your university, so you purposefully select a number of students with different support
needs in order to gather a varied range of data on their experiences with student
services.
4. Snowball Sampling
If the population is hard to access, snowball sampling can be used to recruit
participants via other participants. The number of people you have access to
“snowballs” as you get in contact with more people.
Example
You are researching experiences of homelessness in your city. Since there is no

list of all homeless people in the city, probability sampling isn’t possible. You meet one
person who agrees to participate in the research, and she puts you in contact with other
homeless people that she knows in the area.
Sources:
 https://www.investopedia.com/terms/p/probabilitydistribution.asp#:~:text=There
%20are%20many%20different%20classifications,binomial%20distribution%2C
%20and%20Poisson%20distribution.
 https://www.analyticsvidhya.com/blog/2017/09/6-probability-distributions-data-
science/
 https://study.com/academy/lesson/basic-probability-theory-rules-formulas.html
 http://onlinestatbook.com/2/probability/basic.html
 https://www.scribbr.com/methodology/sampling-methods/
 https://loremipsum.io/generator/?n=5&t=p
 https://www.intmath.com/counting-probability/2-basic-principles-counting.php

BCOMP3

Încărcat de

Informații document

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

BCOMP3

Încărcat de

Drepturi de autor:

Formate disponibile

Probability

Probability is defined as a number between 0 and 1 representing the likelihood of

Probability of a Single Event

Probability of Two (or more) Independent Events

Events A and B are independent events if the probability of Event B occurring is

P(A and B) = P(A) x P(B)

1/2 x 1/2 = 1/4.

Often it is required to compute the probability of an event given that another

P(ace on second draw | an ace on the first draw)

Basic Principles of Counting

An efficient way of counting is necessary to handle large masses of statistical

 the number of ways,

Number of Outcomes of an Event

As an example, we may have an event E defined as

E = "day of the week"

We write the "number of outcomes of event E" as n(E).

n(E)=7, since there are 7 days in the week.

Let E1 and E2 be mutually exclusive events (i.e. there are no common outcomes).

Let event E describe the situation where either event E1 or event E2 will occur.

The number of times event E will occur can be given by the expression:

n(E) = n(E1) + n(E2)

n(E1) = Number of outcomes of event E1

n(E2) = Number of outcomes of event E2

We can think of it as follows:

We have 2 t-shirts and with each t-shirt we could pick 4 pairs of jeans. Altogether

E1 = "choose t-shirt" and

E2 = "choose jeans"

Multiplication Rule in General

That is, if event E is the event that both E1 and E2 must occur, then

n(E) = n(E1) × n(E2)

In our example above,

n(E1) = 2 (since we had 2 t-shirts)

n(E2) = 4 (since there were 4 pairs of jeans)

So total number of possible outcomes is given by:

n(E) = n(E1) × n(E2) = 2 × 4 = 8

1.) The Addition Rule: P(A or B) = P(A) + P(B) - P(A and B)

If A and B are mutually exclusive events, or those that cannot occur together,

2.) The Multiplication Rule: P(A and B) = P(A) * P(B|A) or P(B) * P(A|B)

If A and B are independent events, we can reduce the formula to P(A and B) =

P = P(tails) * P(heads) = (0.5) * (0.5) = 0.25

3.) The Complement Rule: P(not A) = 1 - P(A)

P(no rain) = 1 - P(rain) = 1 - 0.3 = 0.7

What Is a Probability Distribution?

A probability distribution is a statistical function that describes all the possible

A Bernoulli distribution has only two possible outcomes, namely 1 (success) and

The probability mass function is given by: px(1-p)1-x where x € (0, 1).

E(X) = 1*p + 0*(1-p) = p

The variance of a random variable from a bernoulli distribution is:

V(X) = E(X²) – [E(X)]² = p – p² = p(1-p)

A variable X is said to be uniformly distributed if the density function is:

For a Uniform Distribution, a and b are the parameters.

The mean and variance of X following a uniform distribution is:

Mean -> E(X) = (a+b)/2

Variance -> V(X) = (b-a)²/12

1. Each trial is independent.

The mathematical representation of binomial distribution is given by:

The mean and variance of a binomial distribution are given by:

Mean -> µ = n*p

Variance -> Var(X) = n*p*q

Normal distribution represents the behavior of most of the situations in the universe

1. The mean, median and mode of the distribution coincide.

A normal distribution is highly different from Binomial Distribution. However, if the

The mean and variance of a random variable X which is said to be normally

Mean -> E(X) = µ

E(X) = 1p + 0(1-p) = p

Variance -> Var(X) = npq