Skip to content
geeksforgeeks
  • Tutorials
    • Python
    • Java
    • Data Structures & Algorithms
    • ML & Data Science
    • Interview Corner
    • Programming Languages
    • Web Development
    • CS Subjects
    • DevOps And Linux
    • School Learning
    • Practice Coding Problems
  • Courses
    • DSA to Development
    • Get IBM Certification
    • Newly Launched!
      • Master Django Framework
      • Become AWS Certified
    • For Working Professionals
      • Interview 101: DSA & System Design
      • Data Science Training Program
      • JAVA Backend Development (Live)
      • DevOps Engineering (LIVE)
      • Data Structures & Algorithms in Python
    • For Students
      • Placement Preparation Course
      • Data Science (Live)
      • Data Structure & Algorithm-Self Paced (C++/JAVA)
      • Master Competitive Programming (Live)
      • Full Stack Development with React & Node JS (Live)
    • Full Stack Development
    • Data Science Program
    • All Courses
  • Number System and Arithmetic
  • Algebra
  • Set Theory
  • Probability
  • Statistics
  • Geometry
  • Calculus
  • Logarithms
  • Mensuration
  • Matrices
  • Trigonometry
  • Mathematics
Open In App
Next Article:
Bar Graphs and Histograms
Next article icon

Introduction of Statistics and its Types

Last Updated : 11 Apr, 2025
Comments
Improve
Suggest changes
Like Article
Like
Report

Statistics is a branch of mathematics that deals with collecting, organizing, and analyzing numerical data to solve real-world problems. It is widely regarded as a distinct scientific discipline due to its vast applications across various fields.

  • Helps make sense of complex data through quantitative models.
  • Plays a critical role in decision-making in fields like weather forecasting, stock market analysis, insurance, and data science.

Here are some examples of statistical concepts in action:


Table of Content

  • Statistics Terminologies
  • Statistics Examples
  • Types of Statistics
  • Descriptive Statistics
    • Measure of Central Tendency
  • Inferential Statistics
  • Hypothesis Testing
  • Correlation Coefficient
  • Coefficient of Variation
  • Data in Statistics
  • Representation of Data
  • Models of Statistics
  • Data Analysis
  • Types of Data Analysis
  • Applications of Statistics
  • Business Statistics
  • Scope of Statistics
  • Limitations of Statistics

Statistics in Mathematics is the field that focuses on the collection, analysis, interpretation, and presentation of data. By analyzing numerical data, it allows us to draw meaningful conclusions and insights from complex data sets.

  • According to Merriam-Webster: Statistics is the science of collecting, analyzing, interpreting, and presenting masses of numerical data.
  • According to Oxford English Dictionary: Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data.

Statistics Terminologies

Some of the most common terms you might come across in statistics are:

  • Population: It is actually a collection of a set of individual objects or events whose properties are to be analyzed.
  • Sample: It is the subset of a population.
  • Variable: It is a characteristic that can have different values.
  • Parameter: It is numerical characteristic of population.

Statistics Examples

Some real-life examples of statistics that you might have seen:

Example 1:In a class of 45 students, we calculate their mean marks to evaluate performance of that class.

Example 2: Before elections, you might have seen exit polls. Exit polls are opinion of population sample, that are used to predict election results.

Types of Statistics

There are 2 types of statistics:

  • Descriptive Statistics
  • Inferential Statistics

Types of statistics are explained in the image added below:

stat
Types of statistics

Now, let's learn the same in detail.

Descriptive Statistics

Descriptive statistics uses data that describes the population either through numerical calculated graphs or tables. It provides a graphical summary of data.

It is simply used for summarizing objects, etc. There are two categories in this as follows.

  • Measure of Central Tendency
  • Measure of Variability

Let's discuss both categories in detail.

Measure of Central Tendency

Measure of central tendency is also known as summary statistics that are used to represent the center point or a particular value of a data set or sample set. In statistics, three common measures of central tendency are:

  • Mean
  • Median
  • Mode

Statistics formula

StatisticFormulaDefinitionExample
Meanx̄=∑ x/n​​The arithmetic average of a set of values. It's calculated by adding up all the values in the data set and dividing by the number of values.Example: Consider the data set {2, 4, 6, 8, 10}.
(2+4+6+8+10/ 5) = (30/5)
Mean = 6
​
MedianMiddle value in an ordered data setThe middle value of a data set when it is arranged in ascending or descending order. If there's an odd number of observations, it's the value at the center position. If there's an even number, it's the average of the two middle values.Example: Data set {3, 6, 9, 12, 15}. Median = 9
ModeValue that appears most frequently in a data setThe value that occurs most frequently in a data set. There can be one mode (unimodal), multiple modes (multimodal), or no mode if all values occur with the same frequency.Example: Data set {2, 3, 4, 4, 5, 6, 6, 6, 7}. Mode = 6

Mean

It is the measure of the average of all values in a sample set. The mean of the data set is calculated using the following formula:

Mean = ∑\frac{x}{n}

For example:

Cars

Mileage

Cylinder

Swift

21.3

3

Verna

20.8

2

Santra

19

5

Mean = (Sum of all Terms)/(Total Number of Terms)

⇒ Mean = (21.3 + 20.8 + 19) /3 = 61.1/3
⇒ Mean = 20.366

Median

It is the measure of the central value of a sample set. In these, the data set is ordered from lowest to highest value and then finds the exact middle. The formula used to calculate the median of the data set is, suppose we are given 'n' terms in s data set:

If n is Even, Median = [(n/2)th term + {(n/2 )+ 1}th term]/2

If n is Odd, Median = \frac{(n + 1)}{2}

For example:

Cars

Mileage

Cylinder

Swift

21.3

3

Verna

20.8

2

Santra

19

5

i-20154

Data in at order: 15, 19, 20.8, 21.3

⇒ Median = (20.8 + 19) /2 = 39.8/2
⇒ Median = 19.9

Mode

It is the value most frequently arrived at in the sample set. The value repeated most of the time in the central set is the mode. The mode of the data set is calculated using the following formula:

Mode = Term with Highest Frequency

  • For example: {2, 3, 4, 2, 4, 6, 4, 7, 7, 4, 2, 4}
  • 4 is the most frequent term in this data set.
  • Thus, the mode is 4.

Measure of Variability

The measure of Variability is also known as the measure of dispersion and is used to describe variability in a sample or population. In statistics, there are three common measures of variability, as shown below:

1. Range of Data

It is a given measure of how to spread apart values in a sample set or data set.

Range = Maximum value - Minimum value

2. Variance

In probability theory and statistics, variance measures a data set's spread or dispersion. It is calculated by averaging the squared deviations from the mean. Variance is usually represented by the symbol σ2.

S^2 = \sum_{i=1}^{n} \frac{(x_i - \bar{x})^2}{n}

  • n represents total data points
  • ͞x represents the mean of data points
  • xi represents individual data points

Variance measures variability. The more spread out the data, the greater the variance compared to the average.

There are two types of variance:

  • Population variance: Often represented as σ²
  • Sample variance is often represented as s².

Note: The standard deviation is the square root of the variance.

3. Standard Deviation

Standard Deviation is a measure of how widely distributed a set of values are from the mean. It compares every data point to the average of all the data points.

Standard Deviation Formula

s = \sqrt{\frac{1}{n-1} \sum_{i=1}^{n} (x_i - \bar{x})^2}

where,

  • s is Population Standard Deviation
  • xi is ith observation
  • x̄ is the Sample Mean
  • N is the Number of Observations

A low Standard Deviationmeans values are close to the average, while a high standard deviation means values spread out over a wider range. Standard deviation is like the distance between points but is applied differently. Using algebra on squares and square roots rather than absolute values, makes standard deviation convenient in various mathematical applications.

4. Interquartile Range (IQR)

The Interquartile Range is a measure of statistical dispersion, or how spread out the data points are in a data set. It is the range between the first quartile (Q1) and the third quartile (Q3) of a data set, which represents the middle 50% of the data.

Formula for IQR:

IQR= Q3−Q1

Where:

  • Q1 is the first quartile (25th percentile), which is the median of the lower half of the data.
  • Q3 is the third quartile (75th percentile), which is the median of the upper half of the data.

Inferential Statistics

Inferential Statistics makes inferences and predictions about the population based on a sample of data taken from the population. It generalizes a large dataset and applies probabilities to conclude.

It is simply used for explaining the meaning of descriptive stats. It is simply used to analyze, interpret results, and draw conclusions. Inferential Statistics is mainly related to and associated with hypothesis testing, whose main target is to reject the null hypothesis.

Types of Inferential Statistics

Various types of inferential statistics are used widely nowadays and are very easy to interpret. These are given below:

  • One sample test of difference/One sample hypothesis test
  • Confidence Interval
  • Contingency Tables and Chi-Square Statistic
  • T-test or Anova
  • Pearson Correlation
  • Bivariate Regression
  • Multi-variate Regression

Hypothesis Testing

Hypothesis testing is a type of inferential procedure that takes the help of sample data to evaluate and assess the credibility of a hypothesis about a population.

Inferential statistics are generally used to determine how strong a relationship is within the sample. However, it is very difficult to obtain a population list and draw a random sample. Inferential statistics can be done with the help of various steps as given below:

  • Obtain and start with a theory.
  • Generate a research hypothesis.
  • Operationalize or use variables
  • Identify or find out the population to which we can apply study material.
  • Generate or form a null hypothesis for these populations.
  • Collect and gather a sample of children from the population and simply run a study.
  • Then, perform all tests of statistical to clarify if the obtained characteristics of the sample are sufficiently different from what would be expected under the null hypothesis so that we can be able to find and reject the null hypothesis.

Correlation Coefficient

A correlation coefficient is a numerical estimate of the statistical connection that exists between two variables. The variables could be two columns from a data set or two elements of a multivariate random variable.

The Pearson correlation coefficient (r) measures linear correlation, ranging from -1 to 1. It shows the strength and direction of the relationship between two variables. A coefficient of 0 means no linear relationship, while -1 or +1 indicates a perfect linear relationship.

Here are some examples of correlations:

  1. Positive linear correlation: When the variable on the x-axis rises as the variable on the y-axis increases.
  2. Negative linear correlation: When the values of the two variables move in opposite directions.
  3. Nonlinear correlation: Also called curvilinear correlation.
  4. No correlation: When the two variables are entirely independent.

Coefficient of Variation

The Coefficient of Variation (CV) is a statistical measure of the relative variability of data. It represents the ratio of the standard deviation to the mean and is often expressed as a percentage.

Formula:

CV = σ/μ × 100

Where,

  • σ is Standard Deviation
  • μ is Arithmetic Mean

Data in Statistics

Data is the collection of numbers, words, or anything that can be arranged to form meaningful information. There are various types of data in the statistics that are added below.

Types of Data

Various types of Data used in statistics are,

  • Qualitative Data -Qualitative data is the descriptive data of any object. For example, Kabir is tall, Kaira is thin, etc.
  • Quantitative Data - Quantitative data is the numerical data of any object. For example, he ate three chapatis, and we are five friends.

Types of Quantitative Data

We have two types of quantitative data that include,

  1. Discrete Data: The data that have fixed value is called discrete data and can easily be counted.
  2. Continuous Data: The data that has no fixed value and has a range of data is called continuous data. It can be measured.

Representation of Data

We can easily represent the data using various graphs, charts, or tables. The various types of representing data sets are:

  • Bar Graph
  • Pie Chart
  • Line Graph
  • Pictograph
  • Histogram
  • Frequency Distribution

Read more about the Representation of Data in statistics from Types of Graphs in Statistics

Models of Statistics

Various models of Statistics are used to measure different forms of data. Some of the models of the statistics are added below:

  • Skewness in Statistics
  • ANOVA Statistics
  • Degree of Freedom
  • Regression Analysis
  • Mean Deviation for Ungrouped Data
  • Mean Deviation for Discrete Grouped Data
  • Exploratory Data Analysis
  • Causal Analysis
  • Associational Statistical Analysis

Let's learn about them in detail.

Skewness in Statistics

Skewness in statistics is defined as the measure of the asymmetry in a probability distribution that is used to measure the normal probability distribution of data.

Skewed data can be either positive or negative. If a data curve shifts from left to right is called positively skewed. If the curve moves from right to left, it is called left-skewed.

ANOVA Statistics

ANOVA is another name for the Analysis of Variance in statistics. In the ANOVA model, we use the difference of the mean of the data set from the individual data set to measure the dispersion of the data set.

Analysis of Variance (ANOVA) is a set of statistical tools created by Ronald Fisher to compare means. It helps analyze differences among group averages. ANOVA looks at two kinds of variation: the differences between group averages and the differences within each group. The test tells us if there are disparities among the levels of the independent variable, though it doesn't pinpoint which differences matter the most.

ANOVA relies on four key assumptions:

  1. Interval Scale: The dependent data should be measured at an interval scale, meaning the intervals between values are consistent.
  2. Normal Distribution: The population distribution should ideally be normal, resembling a bell curve.
  3. Homoscedasticity: This assumption states that the variances of the errors should be consistent across all levels of the independent variable.
  4. Multicollinearity: There shouldn't be a significant correlation among the independent variables, as this can skew results.

It operates with a null hypothesis (H0) and an alternative hypothesis (H1). The null hypothesis generally posits that there's no difference among the means of the samples, while the alternative hypothesis suggests at least one difference exists among the means of the samples.

Degree of Freedom

Degree of Freedom model in statistics measures the changes in the data set if there is a change in the value of the data set. We can move data in this model if we want to estimate any parameter of the data set.

Degree of Freedom formula is equal to the size of a sample of data minus one.

Df = N – 1

where,

  • Df is Degree of Freedom
  • N is Actual Sample Size

Regression Analysis

Regression Analysis model of the statistics is used to determine the relation between the variables. It gives the relation between the dependent variable and the independent variable.

There are various types of regression analysis techniques:

  1. Linear Regression: Used when the relationship between variables is linear, where changes in the dependent variable are proportional to the independent variable.
  2. Logistic Regression: Predicts categorical dependent variables (e.g., yes/no, 0/1) and is used in classification tasks like fraud detection or spam classification.
  3. Ridge Regression: Addresses multicollinearity by adding a penalty term to the regression equation, preventing overfitting, especially when there are more predictors than observations.
  4. Lasso Regression: Similar to ridge regression but shrinks some coefficients to zero, effectively performing variable selection.
  5. Polynomial Regression: Uses polynomial functions to capture nonlinear relationships between variables that simple linear regression cannot model.
  6. Bayesian Linear Regression: Integrates Bayesian statistics, allowing for parameter estimation with uncertainty and incorporating prior knowledge into the regression model.

How to create a Data and table

  • Choose the right chart type: Different chart types are better suited for different kinds of data. For example, bar charts are good for comparing categories, while line charts are good for showing trends over time.
  • Keep it simple: Don't overload your chart with too much information. Make sure the labels are clear and easy to read.
  • Use clear and concise titles: Your chart title should accurately reflect the information being presented.
  • Use color effectively: Color can be a great way to highlight important data points, but avoid using too many colors or colors that clash.

Mean Deviation for Ungrouped Data

Suppose we are given 'n' terms in a data set, x1, x2, x3, ..., xn, then the mean deviation about means and median is calculated using the formula,

Mean Deviation for Ungrouped Data = Sum of Deviation/Number of Observation
Mean of Ungrouped Data = ∑in (x - μ)/n

Mean Deviation for Discrete Grouped Data

Let there are x1, x2, x3, ..., xn term and their respective frequency are, f1, f2, f3, ..., fn then the mean is calculated using the formula,

a) Mean Deviation About Mean

Mean deviation about the mean of the data set is calculated using the following formula,

  • Mean Deviation (μ) = ∑i = 1n fi (xi - μ)/N

b) Mean Deviation About Median

Mean deviation about the median of the data set is calculated using the following formula,

  • Mean Deviation (μ) = ∑i = 1n fi (xi - M)/N

Exploratory Data Analysis

Exploratory data analysis (EDA) is a statistical approach for summarizing the main characteristics of data sets. It's an important first step in any data analysis.

Here are some steps involved in EDA:

  • Collect data
  • Find and understand all variables
  • Clean the dataset
  • Identify correlated variables
  • Choose the right statistical methods
  • Visualize and analyze results

Exploratory Data Analysis (EDA) uses graphs and visual tools to spot overall trends and peculiarities in data. These can be anything from outliers, which are data points that stand out, to unexpected characteristics of the data set.

Following are the four types of EDA:

  1. Univariate non-graphical
  2. Multivariate non-graphical
  3. Univariate graphical
  4. Multivariate graphical

Causal Analysis

Causal analysis is a process that aims to identify and understand the causes and effects of a problem. It involves:

  1. Identifying the relevant variables and collecting data.
  2. Analyzing the data using statistical techniques to determine whether there is a significant relationship between the variables.
  3. Concluding the causal relationship between the variables.

Causal analysis differs from simple correlation in that it investigates the underlying mechanisms and factors that drive changes in variable values rather than simply finding statistical links. It provides evidence of the causal relationships between variables.

Let's look at some examples where we might use causal analysis:

  • We want to know if adding more fertilizer makes plants grow better.
  • Can taking a specific medicine prevent someone from getting sick?

To determine cause and effect, we use randomized controlled trials, where different treatments are applied to groups and results are compared. However, confounding can occur when a third factor creates a false connection, leading to incorrect conclusions about cause and effect.

Associational Statistical Analysis

Associational statistical analysis is a method that researchers use to identify correlations between many variables. It can also be used to examine whether researchers can draw conclusions and make predictions about one data set based on the features of another.

Associational analysis examines how two or more features are related while considering other possible influencing factors.

Some measures of association are:

Chi-square Test for Association

Chi-square test of independence, also called the chi-square test of association, is a statistical method for determining the relationship between two variables that are categorical. It assesses if the variables are unrelated or related. Chi-square test determines the statistical significance of the relationship between variables rather than the intensity of the association. It determines if there is a substantial difference between the observed and expected data. By comparing the two datasets, we can determine whether the variables have a logical association.

For example, a chi-square test can be done to see if there is a statistically significant relationship between gender and the type of product bought. A p-value larger than 0.02 indicates that there is no statistically significant correlation.

Data Analysis

Data Analysis is all about making sense of information by using mathematical and logical methods. In simpler terms, it means looking carefully at lots of facts and figures, organizing them, summarizing them, and then checking to see if everything adds up correctly.

Types of Data Analysis

There are three types of data analysis, which are as follows:

  1. Descriptive Analysis
  2. Predictive Analysis
  3. Prescriptive Analysis

Descriptive Analysis

Examining numerical data and calculations aids in comprehending business operations. Descriptive analysis serves various purposes, such as:

  • Assessing customer satisfaction
  • Monitoring campaigns
  • Producing reports
  • Evaluating performance

Predictive Analysis

Uses historical data, statistical models, and machine learning algorithms to identify patterns and make predictions about future outcomes. Some applications of predictive analytics are:

  1. Sales Forecasting: Predictive analytics can help businesses forecast future sales trends based on historical data and market variables.
  2. Customer Churn Prediction: Businesses can use predictive analytics to identify customers who are likely to churn or stop using their services, allowing proactive measures to retain them.
  3. Healthcare Diagnostics: Predictive analytics can aid in diagnosing diseases and predicting patient outcomes based on medical history, symptoms, and other relevant data.

Prescriptive Analysis

Uses extensive methods and technologies to analyze data and information to determine the best course of action or plan.

For example, the prescriptive analysis may suggest specific products to online customers based on their previous behavior.

Applications of Statistics

Various applications of statistics in mathematics are added below.,

  • Statistics is used in mathematical computing.
  • Statistics is used in finding probability and chances.
  • Statistics is used in weather forcasting, etc.

Business Statistics

Business statistics is the process of collecting, analyzing, interpreting, and presenting data relevant to business operations and decision-making. It is a critical tool for organizations to gain insights into their performance, market dynamics, and customer behavior.

Scope of Statistics

Statistics is a branch of mathematics that deals with the collection, organization, analysis, interpretation, and presentation of data. It is used in a wide variety of fields, including:

  • Science: Statistics is used to design experiments, analyze data, and draw conclusions about the natural world.
  • Business: Statistics is used to market products, track sales, and make financial decisions.
  • Government: Statistics is used to track economic trends, measure the effectiveness of government programs, and allocate resources.
  • Healthcare: Statistics is used to develop new drugs, track the spread of diseases, and assess the effectiveness of medical treatments.
  • Sports: Statistics is used to analyze player performance, scout new talent, and predict the outcome of games.

Limitations of Statistics

While statistics is a powerful tool, it is important to be aware of its limitations. Some of the most important limitations include:

  • Data quality: The quality of statistical analysis is limited by the quality of the data. If the data is inaccurate, incomplete, or biased, the results of the analysis will also be inaccurate, incomplete, or biased.
  • Sampling: Statistics is often based on samples of data rather than the entire population. This means that the results of the analysis may not be generalizable to the entire population.
  • Assumptions: Statistical methods often rely on assumptions about the data, such as the assumption that the data is normally distributed. If these assumptions are not met, the results of the analysis may be misleading.
  • Causation vs. correlation: Statistics can show that two variables are correlated, but it cannot prove that one variable causes the other.
  • Misinterpretation: Statistics can be misused or misinterpreted, leading to false conclusions.

Solved Problems - Statistics

Example 1: Find the mean of the data set.

xi

fi

2

3

3

4

5

4

8

5

Solution:

xi

fi

fixi

2

3

6

3

4

12

5

4

20

8

5

40

Mean = (Σf ixi)/Σfi
Σfixi = (6 + 12 + 20 + 40) = 78, and
Σfi = 16
⇒ Mean = 78/16 = 4.875

Example 2: Find the Standard Deviation of 4, 7, 10, 13, and 16.

Solution:

Given,

  • xi = 4, 7, 10, 13, 16
  • N = 5

Σxi = (4 + 7 + 10 + 13 + 16) = 50
⇒ Mean(μ) = Σxi/N = 50/5 = 10

Standard Deviation = √(σ) = √{∑i = 1n (xi - μ)}/N
⇒ SD = √{1/5[(4 - 10)2 + (7 - 10)2 + (10 - 10)2 + (13 - 10)2 + (16 - 10)2]}
⇒ SD = √{1/5[36 + 9 + 0 + 9 + 36] = √{1/5[90]} = √18


Next Article
Bar Graphs and Histograms

M

madhurihammad
Improve
Article Tags :
  • Mathematics
  • School Learning
  • Class 9
  • Math-Statistics
  • Maths-Formulas
  • Maths-Class-9

Similar Reads

    Statistics in Maths
    Statistics is the science of collecting, organizing, analyzing, and interpreting information to uncover patterns, trends, and insights. Statistics allows us to see the bigger picture and tackle real-world problems like measuring the popularity of a new product, predicting the weather, or tracking he
    3 min read

    Introduction to Statistics

    Introduction of Statistics and its Types
    Statistics is a branch of mathematics that deals with collecting, organizing, and analyzing numerical data to solve real-world problems. It is widely regarded as a distinct scientific discipline due to its vast applications across various fields.Helps make sense of complex data through quantitative
    15+ min read
    Bar Graphs and Histograms
    A Bar graph or a Histogram is a tool used for visual representation of data. Representing the data in a bar graphs or histograms, makes it easy to understand the concepts and relationships among data.A Histogram is used to display the distribution of continuous data by grouping values into intervals
    6 min read
    Pie Chart
    Pie chart is a popular and visually intuitive tool used in data representation, making complex information easier to understand at a glance. This circular graph divides data into slices, each representing a proportion of the whole, allowing for a clear comparison of different categories making it ea
    11 min read
    Frequency Distribution - Table, Graphs, Formula
    A frequency distribution is a way to organize data and see how often each value appears. It shows how many times each value or range of values occurs in a dataset. This helps us understand patterns, like which values are common and which are rare. Frequency distributions are often shown in tables or
    11 min read
    Types of Frequency Distribution
    Frequency distribution is a method of organizing and summarizing data to show the frequency (count) of each possible outcome of a dataset. It is an essential tool in statistics for understanding the distribution and pattern of data. There are several types of frequency distributions used based on th
    10 min read

    Mean

    Mean in Statistics
    In statistics, three measures are defined as central tendencies that are: Mean, Median, and Mode, where the mean provides the average value of the dataset, the median provides the central value of the dataset, and the most frequent value in the dataset is the mode.Calculation of central tendency, su
    15+ min read
    How to find Mean of grouped data by direct method?
    Statistics involves gathering, organizing, analyzing, interpreting, and presenting data to form opinions and make decisions. Applications range from educators computing average student scores and government officials conducting censuses to demographic analysis. Understanding and utilizing statistica
    8 min read
    How to Calculate Mean using Step Deviation Method?
    Step Deviation Method is a simplified way to calculate the mean of a grouped frequency distribution, especially when the class intervals are uniform. In simple words, statistics implies the process of gathering, sorting, examining, interpreting and then understandably presenting the data to enable o
    6 min read

    Median

    Median in Statistics
    Median is defined as the middle term of the given set of data if the data is arranged either in ascending or descending order.Example: Suppose we have the height of 5 friends as 167 cm, 169 cm, 171 cm, 174 cm, 179 cm. Now clearly observing the data we see that 171 cm is the middle term in the given
    10 min read

    Mode

    Mode in Statistics | Definition, Formula, How to Calculate Mode
    In statistics, the mode is the data point that comes the most frequently among a group of data members. It is one of three measures of central tendency, alongside the mean and median. To determine the mode, count how frequently each number appears. The number that comes the most frequently is the mo
    11 min read
    How To Find the Mode If No Number is Repeated?
    In statistics, for a given data distribution the mode is the value or number that occurs most frequently. It is representative of the value or integer that occurs the maximum number of times. However, there may or may not be a modal value for a given set of datasets. This is because the given data s
    6 min read
    Empirical Formula
    Empirical Formula in chemistry is defined as the simplest ratio of the elements present in the compound. It does not take into account that these elements are connected with each other in any manner. This is explained in the formula of Glucose as we know that the formula for Glucose is C6H12O6 here
    6 min read
    Frequency Distribution - Table, Graphs, Formula
    A frequency distribution is a way to organize data and see how often each value appears. It shows how many times each value or range of values occurs in a dataset. This helps us understand patterns, like which values are common and which are rare. Frequency distributions are often shown in tables or
    11 min read
    Cumulative frequency Curve
    In statistics, graph plays an important role. With the help of these graphs, we can easily understand the data. So in this article, we will learn how to represent the cumulative frequency distribution graphically.Cumulative FrequencyThe frequency is the number of times the event occurs in the given
    9 min read
    Mean Deviation
    The mean deviation (also known as Mean Absolute Deviation, or MAD) of the data set is the value that tells us how far each data is from the center point of the data set. The center point of the data set can be the Mean, Median, or Mode. Thus, the mean of the deviation is the average of the absolute
    12 min read
    Standard Deviation - Formula, Examples & How to Calculate
    Standard deviation is a statistical measure that describes how much variation or dispersion there is in a set of data points. It helps us understand how spread out the values in a dataset are compared to the mean (average). A higher standard deviation means the data points are more spread out, while
    15+ min read
    Variance
    Variance is a number that tells us how spread out the values in a data set are from the mean (average). It shows whether the numbers are close to the average or far away from it.If the variance is small, it means most numbers are close to the mean. If the variance is large, it means the numbers are
    12 min read
geeksforgeeks-footer-logo
Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)
Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
GFG App on Play Store GFG App on App Store
Advertise with us
  • Company
  • About Us
  • Legal
  • Privacy Policy
  • In Media
  • Contact Us
  • Advertise with us
  • GFG Corporate Solution
  • Placement Training Program
  • Languages
  • Python
  • Java
  • C++
  • PHP
  • GoLang
  • SQL
  • R Language
  • Android Tutorial
  • Tutorials Archive
  • DSA
  • Data Structures
  • Algorithms
  • DSA for Beginners
  • Basic DSA Problems
  • DSA Roadmap
  • Top 100 DSA Interview Problems
  • DSA Roadmap by Sandeep Jain
  • All Cheat Sheets
  • Data Science & ML
  • Data Science With Python
  • Data Science For Beginner
  • Machine Learning
  • ML Maths
  • Data Visualisation
  • Pandas
  • NumPy
  • NLP
  • Deep Learning
  • Web Technologies
  • HTML
  • CSS
  • JavaScript
  • TypeScript
  • ReactJS
  • NextJS
  • Bootstrap
  • Web Design
  • Python Tutorial
  • Python Programming Examples
  • Python Projects
  • Python Tkinter
  • Python Web Scraping
  • OpenCV Tutorial
  • Python Interview Question
  • Django
  • Computer Science
  • Operating Systems
  • Computer Network
  • Database Management System
  • Software Engineering
  • Digital Logic Design
  • Engineering Maths
  • Software Development
  • Software Testing
  • DevOps
  • Git
  • Linux
  • AWS
  • Docker
  • Kubernetes
  • Azure
  • GCP
  • DevOps Roadmap
  • System Design
  • High Level Design
  • Low Level Design
  • UML Diagrams
  • Interview Guide
  • Design Patterns
  • OOAD
  • System Design Bootcamp
  • Interview Questions
  • Inteview Preparation
  • Competitive Programming
  • Top DS or Algo for CP
  • Company-Wise Recruitment Process
  • Company-Wise Preparation
  • Aptitude Preparation
  • Puzzles
  • School Subjects
  • Mathematics
  • Physics
  • Chemistry
  • Biology
  • Social Science
  • English Grammar
  • Commerce
  • World GK
  • GeeksforGeeks Videos
  • DSA
  • Python
  • Java
  • C++
  • Web Development
  • Data Science
  • CS Subjects
@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved
We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy
Lightbox
Improvement
Suggest Changes
Help us improve. Share your suggestions to enhance the article. Contribute your expertise and make a difference in the GeeksforGeeks portal.
geeksforgeeks-suggest-icon
Create Improvement
Enhance the article with your expertise. Contribute to the GeeksforGeeks community and help create better learning resources for all.
geeksforgeeks-improvement-icon
Suggest Changes
min 4 words, max Words Limit:1000

Thank You!

Your suggestions are valuable to us.

What kind of Experience do you want to share?

Interview Experiences
Admission Experiences
Career Journeys
Work Experiences
Campus Experiences
Competitive Exam Experiences