Bayes' Theorem is a mathematical formula that helps determine the conditional probability of an event based on prior knowledge and new evidence.
It adjusts probabilities when new information comes in and helps make better decisions in uncertain situations.
Bayes' Theorem helps us update probabilities based on prior knowledge and new evidence. In this case, knowing that the pet is quiet (new information), we can use Bayes' Theorem to calculate the updated probability of the pet being a cat or a dog, based on how likely each animal is to be quiet.
Bayes Theorem and Conditional Probability
Bayes' theorem (also known as the Bayes Rule or Bayes Law) is used to determine the conditional probability of event A when event B has already occurred.
The general statement of Bayes’ theorem is “The conditional probability of an event A, given the occurrence of another event B, is equal to the product of the event of B, given A, and the probability of A divided by the probability of event B.” i.e.
For example, if we want to find the probability that a white marble drawn at random came from the first bag, given that a white marble has already been drawn, and there are three bags each containing some white and black marbles, then we can use Bayes’ Theorem.
Check: Bayes’s Theorem for Conditional Probability
For any two events A and B, Bayes's formula for the Bayes theorem is given by:
Formula for the Bayes theoremWhere,
- P(A) and P(B) are the probabilities of events A and B, also, P(B) is never equal to zero.
- P(A|B) is the probability of event A when event B happens,
- P(B|A) is the probability of event B when A happens.
Bayes Theorem Statement
Bayes's Theorem for n sets of events is defined as,
Let E1, E2,…, En be a set of events associated with the sample space S, in which all the events E1, E2,…, En have a non-zero probability of occurrence. All the events E1, E2,…, E form a partition of S. Let A be an event from space S for which we have to find the probability, then according to Bayes theorem,
P(E_i \mid A) = \frac{P(E_i) \cdot P(A \mid E_i)}{\sum_{k=1}^{n} P(E_k) \cdot P(A \mid E_k)}
for k = 1, 2, 3, …., n
Bayes Theorem Derivation
The proof of Bayes's, Theorem is given as, according to the conditional probability formula,
P(E_i \mid A) = \frac{P(E_i \cap A)}{P(A)}.....(i)
Then, by using the multiplication rule of probability, we get
P(E_i \cap A) = P(E_i) \cdot P(A \mid E_i)......(ii)
Now, by the total probability theorem,
P(A) = \sum_{k=1}^{n} P(E_k) \cdot P(A \mid E_k).....(iii)
Substituting the value of P(Ei∩A) and P(A) from eq (ii) and eq(iii) in eq(i) we get,
P(E_i \mid A) = \frac{P(E_i) \cdot P(A \mid E_i)}{\sum_{k=1}^{n} P(E_k) \cdot P(A \mid E_k)}
Bayes’ theorem is also known as the formula for the probability of “causes”. As we know, the Ei‘s are a partition of the sample space S, and at any given time, only one of the events Ei occurs. Thus, we conclude that the Bayes theorem formula gives the probability of a particular Ei, given that event A has occurred.
After learning about Bayes theorem in detail, let us understand some important terms related to the concepts we covered in the formula and derivation.
Hypotheses
- Hypotheses refer to possible events or outcomes in the sample space, they are denoted as E1, E2, …, En.
- Each hypothesis represents a distinct scenario that could explain an observed event.
Priori Probability
- Priori Probability P(Ei) is the initial probability of an event occurring before any new data is taken into account.
- It reflects existing knowledge or assumptions about the event.
- Example: The probability of a person having a disease before taking a test.
Posterior Probability
- Posterior probability (P(Ei∣A) is the updated probability of an event after considering new information.
- It is derived using the Bayes Theorem.
- Example: The probability of having a disease given a positive test result.
Conditional Probability
- The probability of an event A based on the occurrence of another event B is termed conditional Probability.
- It is denoted as P(A|B) and represents the probability of A when event B has already happened.
Joint Probability
- When the probability of two or more events occurring together and at the same time is measured, it is marked as Joint Probability.
- For two events A and B, it is denoted by joint probability is denoted as P(A∩B).
Random Variables
- Real-valued variables whose possible values are determined by random experiments are called random variables.
- The probability of finding such variables is the experimental probability.
Bayes Theorem Applications
Bayesian inference is very important and has found application in various activities, including medicine, science, philosophy, engineering, sports, law, etc., and Bayesian inference is directly derived from Bayes theorem.
Some of the Key Applications are:
- Medical Testing → Finding the real probability of having a disease after a positive test.
- Spam Filters → Checking if an email is spam based on keywords.
- Weather Prediction → Updating the chance of rain based on new data.
- AI & Machine Learning → Used in Naïve Bayes classifiers to predict outcomes.
Check, Bayes' Life Applications of Bayes theorem
Difference Between Conditional Probability and Bayes Theorem
The difference between Conditional Probability and Bayes's. The theorem can be understood with the help of the table given below.
Bayes Theorem | Conditional Probability |
---|
Bayes's Theorem is derived using the definition of conditional probability. It is used to find the reverse probability. | Conditional Probability is the probability of event A when event B has already occurred. |
Formula: P(A|B) = [P(B|A)P(A)] / P(B) | Formula: P(A|B) = P(A∩B) / P(B) |
Purpose: To update the probability of an event based on new evidence. | Purpose: To find the probability of one event based on the occurrence of another. |
Focus: Uses prior knowledge and evidence to compute a revised probability. | Focus: Direct relationship between two events. |
Theorem of Total Probability
Let E1, E2,…., En be mutually exclusive and exhaustive events of a sample space S, and let E be any event that occurs with some Ei. Then, prove that :
P(E) = n∑i=1P(E/Ei) . P(Ei)
Proof:
Let S be the sample space.
Since the events E1, E2,…,En are mutually exclusive and exhaustive, we have:
S = E1 ∪ E2 ∪ E3 ∪ . . . ∪ En and Ei ∩ Ej = ∅ for i ≠ j.
Now, consider the event E: E = E ∩ S
Substituting S with the union of Ei's:
⇒ E = E ∩ (E1 ∪ E2 ∪ E3 ∪ . . . ∪ En)
Using distributive law:
⇒ E = (E ∩ E1) ∪ (E ∩ E2) ∪ . . . ∪ (E ∩ En)
Since the events Ei are mutually exclusive, the intersections E∩Ei are also mutually exclusive. Therefore:
P(E) = P{(E ∩ E1) ∪ (E ∩ E2)∪ . . . ∪(E ∩ En)}
⇒ P(E) = P(E ∩ E1) + P(E ∩ E2) + . . . + P(E ∩ En)
{Therefore, (E ∩ E1), (E ∩ E2), . . . ,(E ∩ En)} are pairwise disjoint}
⇒ P(E) = P(E/E1) . P(E1) + P(E/E2) . P(E2) + . . . + P(E/En) . P(En) [by multiplication theorem]
⇒ P(E) = n∑i=1P(E/Ei) . P(Ei)
Articles Related to Bayes' Theorem-
Bayes Theorem for Programmers-
Solved Examples of Bayes's Theorem
Example 1: A person has undertaken a job. The probabilities of completion of the job on time with and without rain are 0.44 and 0.9, and 5, respectively. If the probability that it will rain is 0.45, then determine the probability that the job will be completed on time.
Solution:
Let E1 be the event that the mining job will be completed on time and E2 be the event that it rains. We have,
P(A) = 0.45,
P(no rain) = P(B) = 1 − P(A) = 1 − 0.45 = 0.55
By multiplication law of probability,
P(E1) = 0.44, and P(E2) = 0.95
Since, events A and B form partitions of the sample space S, by total probability theorem, we have
P(E) = P(A) P(E1) + P(B) P(E2)
⇒ P(E) = 0.45 × 0.44 + 0.55 × 0.95
⇒ P(E) = 0.198 + 0.5225 = 0.7205
So, the probability that the job will be completed on time is 0.7205
Example 2: There are three urns containing 3 white and 2 black balls, 2 white and 3 black balls, and 1 black and 4 white balls, respectively. There is an equal probability of each urn being chosen. One ball is equal probability chosen at random. What is the probability that a white ball will be drawn?
Solution:
Let E1, E2, and E3 be the events of choosing the first, second, and third urn respectively. Then,
P(E1) = P(E2) = P(E3) = 1/3
Let E be the event that a white ball is drawn. Then,
P(E/E1) = 3/5, P(E/E2) = 2/5, P(E/E3) = 4/5
By theorem of total probability, we have
P(E) = P(E/E1) . P(E1) + P(E/E2) . P(E2) + P(E/E3) . P(E3)
⇒ P(E) = (3/5 × 1/3) + (2/5 × 1/3) + (4/5 × 1/3)
⇒ P(E) = 9/15 = 3/5
Example 3: A card from a pack of 52 cards is lost. From the remaining cards of the pack, two cards are drawn and are found to be both hearts. Find the probability of the lost card being a heart.
Solution:
Let E1, E2, E3, and E4 be the events of losing a card of hearts, clubs, spades, and diamonds respectively.
Then P(E1) = P(E2) = P(E3) = P(E4) = 13/52 = 1/4.
Let E be the event of drawing 2 hearts from the remaining 51 cards. Then,
P(E|E1) = probability of drawing 2 hearts, given that a card of hearts is missing
⇒ P(E|E1) = 12C2 / 51C2 = (12 × 11)/2! × 2!/(51 × 50) = 22/425
P(E|E2) = probability of drawing 2 clubs ,given that a card of clubs is missing
⇒ P(E|E2) = 13C2 / 51C2 = (13 × 12)/2! × 2!/(51 × 50) = 26/425
P(E|E3) = probability of drawing 2 spades ,given that a card of hearts is missing
⇒ P(E|E3) = 13C2 / 51C2 = 26/425
P(E|E4) = probability of drawing 2 diamonds ,given that a card of diamonds is missing
⇒ P(E|E4) = 13C2 / 51C2 = 26/425
Therefore,
P(E1|E) = probability of the lost card is being a heart, given the 2 hearts are drawn from the remaining 51 cards
⇒ P(E1|E) = P(E1) . P(E|E1)/P(E1) . P(E|E1) + P(E2) . P(E|E2) + P(E3) . P(E|E3) + P(E4) . P(E|E4)
⇒ P(E1|E) = (1/4 × 22/425) / {(1/4 × 22/425) + (1/4 × 26/425) + (1/4 × 26/425) + (1/4 × 26/425)}
⇒ P(E1|E) = 22/100 = 0.22
Hence, The required probability is 0.22.
Example 4: Suppose 15 men out of 300 men and 25 women out of 1000 are good orators. An orator is chosen at random. Find the probability that a male person is selected.
Solution:
Given,
- Total Men = 300
- Total Women = 1000
- Good Orators among Men = 15
- Good Orators among Women = 25
Total number of good orators = 15 (from men) + 25 (from women) = 40
Probability of selecting a male orator:
P(Male Orator) = Numbers of male orators / total no of orators = 15/40 = 3/8
Example 5: A man is known to speak the lies 1 out of 4 times. He throws a die and reports that it is a six. Find the probability that it Bayes' actually a six.
Solution:
In a throw of a die, let
E1 = event of getting a six,
E2 = event of not getting a six and
E = event that the man reports that it is a six.
Then, P(E1) = 1/6, and P(E2) = (1 - 1/6) = 5/6
P(E|E1) = probability that the man reports that six occurs when six has actually occurred
⇒ P(E|E1) = probability that the man speaks the truth
⇒ P(E|E1) = 3/4
P(E|E2) = probability that the man reports that six occurs when six has not actually occurred
⇒ P(E|E2) = probability that the man does not speak the truth
⇒ P(E|E2) = (1 - 3/4) = 1/4
Probability of getting a six ,given that the man reports it to be six
P(E1|E) = P(E|E1) × P(E1)/P(E|E1) × P(E1) + P(E|E2) × P(E2) [by Bayes theorem]
⇒ P(E1|E) = (3/4 × 1/6)/{(3/4 × 1/6) + (1/4 × 5/6)}
⇒ P(E1|E) = (1/8 × 3) = 3/8
Hence the probability required is 3/8.
Similar Reads
Basic Arithmetic
What are Numbers?Numbers are symbols we use to count, measure, and describe things. They are everywhere in our daily lives and help us understand and organize the world.Numbers are like tools that help us:Count how many things there are (e.g., 1 apple, 3 pencils).Measure things (e.g., 5 meters, 10 kilograms).Show or
15+ min read
Arithmetic OperationsArithmetic Operations are the basic mathematical operationsâAddition, Subtraction, Multiplication, and Divisionâused for calculations. These operations form the foundation of mathematics and are essential in daily life, such as sharing items, calculating bills, solving time and work problems, and in
9 min read
Fractions - Definition, Types and ExamplesFractions are numerical expressions used to represent parts of a whole or ratios between quantities. They consist of a numerator (the top number), indicating how many parts are considered, and a denominator (the bottom number), showing the total number of equal parts the whole is divided into. For E
7 min read
What are Decimals?Decimals are numbers that use a decimal point to separate the whole number part from the fractional part. This system helps represent values between whole numbers, making it easier to express and measure smaller quantities. Each digit after the decimal point represents a specific place value, like t
10 min read
ExponentsExponents are a way to show that a number is multiplied by itself many times. They are everywhere in math and science, helping us write big and easily.Think of exponents like a shortcut for repeated multiplication:23 means 2 x 2 x 2 = 8 52 means 5 x 5 = 25So instead of writing the same number many t
8 min read
PercentageIn mathematics, a percentage is a figure or ratio that signifies a fraction out of 100, i.e., A fraction whose denominator is 100 is called a Percent. In all the fractions where the denominator is 100, we can remove the denominator and put the % sign.For example, the fraction 23/100 can be written a
5 min read
Algebra
Variable in MathsA variable is like a placeholder or a box that can hold different values. In math, it's often represented by a letter, like x or y. The value of a variable can change depending on the situation. For example, if you have the equation y = 2x + 3, the value of y depends on the value of x. So, if you ch
5 min read
Polynomials| Degree | Types | Properties and ExamplesPolynomials are mathematical expressions made up of variables (often represented by letters like x, y, etc.), constants (like numbers), and exponents (which are non-negative integers). These expressions are combined using addition, subtraction, and multiplication operations.A polynomial can have one
9 min read
CoefficientA coefficient is a number that multiplies a variable in a mathematical expression. It tells you how much of that variable you have. For example, in the term 5x, the coefficient is 5 â it means 5 times the variable x.Coefficients can be positive, negative, or zero. Algebraic EquationA coefficient is
8 min read
Algebraic IdentitiesAlgebraic Identities are fundamental equations in algebra where the left-hand side of the equation is always equal to the right-hand side, regardless of the values of the variables involved. These identities play a crucial role in simplifying algebraic computations and are essential for solving vari
14 min read
Properties of Algebraic OperationsAlgebraic operations are mathematical processes that involve the manipulation of numbers, variables, and symbols to produce new results or expressions. The basic algebraic operations are:Addition ( + ): The process of combining two or more numbers to get a sum. For example, 3 + 5 = 8.Subtraction (â)
3 min read
Geometry
Lines and AnglesLines and Angles are the basic terms used in geometry. They provide a base for understanding all the concepts of geometry. We define a line as a 1-D figure that can be extended to infinity in opposite directions, whereas an angle is defined as the opening created by joining two or more lines. An ang
9 min read
Geometric Shapes in MathsGeometric shapes are mathematical figures that represent the forms of objects in the real world. These shapes have defined boundaries, angles, and surfaces, and are fundamental to understanding geometry. Geometric shapes can be categorized into two main types based on their dimensions:2D Shapes (Two
2 min read
Area and Perimeter of Shapes | Formula and ExamplesArea and Perimeter are the two fundamental properties related to 2-dimensional shapes. Defining the size of the shape and the length of its boundary. By learning about the areas of 2D shapes, we can easily determine the surface areas of 3D bodies and the perimeter helps us to calculate the length of
10 min read
Surface Areas and VolumesSurface Area and Volume are two fundamental properties of a three-dimensional (3D) shape that help us understand and measure the space they occupy and their outer surfaces.Knowing how to determine surface area and volumes can be incredibly practical and handy in cases where you want to calculate the
10 min read
Points, Lines and PlanesPoints, Lines, and Planes are basic terms used in Geometry that have a specific meaning and are used to define the basis of geometry. We define a point as a location in 3-D or 2-D space that is represented using coordinates. We define a line as a geometrical figure that is extended in both direction
14 min read
Coordinate Axes and Coordinate Planes in 3D spaceIn a plane, we know that we need two mutually perpendicular lines to locate the position of a point. These lines are called coordinate axes of the plane and the plane is usually called the Cartesian plane. But in real life, we do not have such a plane. In real life, we need some extra information su
6 min read
Trigonometry & Vector Algebra
Trigonometric RatiosThere are three sides of a triangle Hypotenuse, Adjacent, and Opposite. The ratios between these sides based on the angle between them is called Trigonometric Ratio. The six trigonometric ratios are: sine (sin), cosine (cos), tangent (tan), cotangent (cot), cosecant (cosec), and secant (sec).As give
4 min read
Trigonometric Equations | Definition, Examples & How to SolveTrigonometric equations are mathematical expressions that involve trigonometric functions (such as sine, cosine, tangent, etc.) and are set equal to a value. The goal is to find the values of the variable (usually an angle) that satisfy the equation.For example, a simple trigonometric equation might
9 min read
Trigonometric IdentitiesTrigonometric identities play an important role in simplifying expressions and solving equations involving trigonometric functions. These identities, which include relationships between angles and sides of triangles, are widely used in fields like geometry, engineering, and physics. Some important t
10 min read
Trigonometric FunctionsTrigonometric Functions, often simply called trig functions, are mathematical functions that relate the angles of a right triangle to the ratios of the lengths of its sides.Trigonometric functions are the basic functions used in trigonometry and they are used for solving various types of problems in
6 min read
Inverse Trigonometric Functions | Definition, Formula, Types and Examples Inverse trigonometric functions are the inverse functions of basic trigonometric functions. In mathematics, inverse trigonometric functions are also known as arcus functions or anti-trigonometric functions. The inverse trigonometric functions are the inverse functions of basic trigonometric function
11 min read
Inverse Trigonometric IdentitiesInverse trigonometric functions are also known as arcus functions or anti-trigonometric functions. These functions are the inverse functions of basic trigonometric functions, i.e., sine, cosine, tangent, cosecant, secant, and cotangent. It is used to find the angles with any trigonometric ratio. Inv
9 min read
Calculus
Introduction to Differential CalculusDifferential calculus is a branch of calculus that deals with the study of rates of change of functions and the behaviour of these functions in response to infinitesimal changes in their independent variables.Some of the prerequisites for Differential Calculus include:Independent and Dependent Varia
6 min read
Limits in CalculusIn mathematics, a limit is a fundamental concept that describes the behaviour of a function or sequence as its input approaches a particular value. Limits are used in calculus to define derivatives, continuity, and integrals, and they are defined as the approaching value of the function with the inp
12 min read
Continuity of FunctionsContinuity of functions is an important unit of Calculus as it forms the base and it helps us further to prove whether a function is differentiable or not. A continuous function is a function which when drawn on a paper does not have a break. The continuity can also be proved using the concept of li
13 min read
DifferentiationDifferentiation in mathematics refers to the process of finding the derivative of a function, which involves determining the rate of change of a function with respect to its variables.In simple terms, it is a way of finding how things change. Imagine you're driving a car and looking at how your spee
2 min read
Differentiability of a Function | Class 12 MathsContinuity or continuous which means, "a function is continuous at its domain if its graph is a curve without breaks or jumps". A function is continuous at a point in its domain if its graph does not have breaks or jumps in the immediate neighborhood of the point. Continuity at a Point: A function f
11 min read
IntegrationIntegration, in simple terms, is a way to add up small pieces to find the total of something, especially when those pieces are changing or not uniform.Imagine you have a car driving along a road, and its speed changes over time. At some moments, it's going faster; at other moments, it's slower. If y
3 min read
Probability and Statistics
Basic Concepts of ProbabilityProbability is defined as the likelihood of the occurrence of any event. Probability is expressed as a number between 0 and 1, where, 0 is the probability of an impossible event and 1 is the probability of a sure event.Concepts of Probability are used in various real life scenarios : Stock Market :
6 min read
Bayes' TheoremBayes' Theorem is a mathematical formula that helps determine the conditional probability of an event based on prior knowledge and new evidence.It adjusts probabilities when new information comes in and helps make better decisions in uncertain situations.Bayes' Theorem helps us update probabilities
12 min read
Probability Distribution - Function, Formula, TableA probability distribution is a mathematical function or rule that describes how the probabilities of different outcomes are assigned to the possible values of a random variable. It provides a way of modeling the likelihood of each outcome in a random experiment.While a frequency distribution shows
15+ min read
Descriptive StatisticStatistics is the foundation of data science. Descriptive statistics are simple tools that help us understand and summarize data. They show the basic features of a dataset, like the average, highest and lowest values and how spread out the numbers are. It's the first step in making sense of informat
5 min read
What is Inferential Statistics?After learning basic statistics like how data points relate (covariance and correlation) and probability distributions, the next important step is Inferential Statistics. Unlike descriptive statistics, which just summarizes data, inferential statistics helps us make predictions and conclusions about
6 min read
Measures of Central Tendency in StatisticsCentral tendencies in statistics are numerical values that represent the middle or typical value of a dataset. Also known as averages, they provide a summary of the entire data, making it easier to understand the overall pattern or behavior. These values are useful because they capture the essence o
11 min read
Set TheorySet theory is a branch of mathematics that deals with collections of objects, called sets. A set is simply a collection of distinct elements, such as numbers, letters, or even everyday objects, that share a common property or rule.Example of SetsSome examples of sets include:A set of fruits: {apple,
3 min read
Practice