Skip to content
geeksforgeeks
  • Tutorials
    • Python
    • Java
    • Data Structures & Algorithms
    • ML & Data Science
    • Interview Corner
    • Programming Languages
    • Web Development
    • CS Subjects
    • DevOps And Linux
    • School Learning
    • Practice Coding Problems
  • Courses
    • DSA to Development
    • Get IBM Certification
    • Newly Launched!
      • Master Django Framework
      • Become AWS Certified
    • For Working Professionals
      • Interview 101: DSA & System Design
      • Data Science Training Program
      • JAVA Backend Development (Live)
      • DevOps Engineering (LIVE)
      • Data Structures & Algorithms in Python
    • For Students
      • Placement Preparation Course
      • Data Science (Live)
      • Data Structure & Algorithm-Self Paced (C++/JAVA)
      • Master Competitive Programming (Live)
      • Full Stack Development with React & Node JS (Live)
    • Full Stack Development
    • Data Science Program
    • All Courses
  • Number System and Arithmetic
  • Algebra
  • Set Theory
  • Probability
  • Statistics
  • Geometry
  • Calculus
  • Logarithms
  • Mensuration
  • Matrices
  • Trigonometry
  • Mathematics
Open In App
Next Article:
Chi-Squared Distributions
Next article icon

Chi-Squared Distributions

Last Updated : 27 Jan, 2025
Comments
Improve
Suggest changes
Like Article
Like
Report

The Chi-Squared distribution (also chi-square or χ²-distribution) represents the distribution of the sum of the squares of k independent standard normal random variables. If Z1, Z2, . . ., Zk​ are independent standard normal random variables, the Chi-Squared variable is: X^2 = Z_1^2 + Z_2^2 + \ldots + Z_k^2

The Chi-Squared distribution is parameterised by the degrees of freedom (df), which corresponds to the number of independent random variables being summed.

  • The chi-square distribution is actually a series of distributions that vary in shape according to their degrees of freedom. As the degrees of freedom increase, the distribution becomes more symmetric and approaches a normal distribution.
  • The chi-square test is a hypothesis test designed to test for a statistically significant relationship between nominal and ordinal variables organized in a bivariate table. In other words, it tells us whether two variables are independent of one another.

Probability Density function (PDF) of Chi-Squared Distribution

The PDF of a Chi-Squared distribution with k degrees of freedom is:

f(x; k) = \frac{1}{2^{k/2} \Gamma(k/2)} x^{(k/2)-1} e^{-x/2}, \quad x \geq 0

where Γ is the Gamma function.

Properties of Chi-Squared Distribution

Some of the common properties of Chi-Squared Distribution are discussed below:

Non-Negativity

The Chi-Squared distribution is defined only for non-negative values (x \geq 0) because it is based on the sum of squared standard normal variables, which are always non-negative.

Degrees of Freedom

The shape of the Chi-Squared distribution depends on the number of degrees of freedom (k).

  • For small k, the distribution is positively skewed.
  • As k→∞ the distribution approaches a normal distribution (via the Central Limit Theorem).

Mean

The mean of the Chi-Squared distribution is equal to its degrees of freedom: Mean = k

Variance

The variance of the Chi-Squared distribution is twice its degrees of freedom: Variance = 2k

Standard Deviation (SD)

The standard deviation of the Chi-Squared distribution is the square root of the variance, so: SD= \sqrt{2 \times \text{df}}​

Skewness

The skewness decreases as the degrees of freedom increase: Skewness = \sqrt{\frac{8}{k}}

  • For small k, the distribution is heavily skewed to the right.
  • As k increases, the skewness approaches 0.

Kurtosis (Excess)

The kurtosis (excess) of the Chi-Squared distribution is: Excess Kurtosis = 12/k

  • This shows that the distribution becomes less peaked as k increases.

Additive Property

If X_1, X_2, \dots, X_m​ are independent Chi-Squared random variables with degrees of freedom k_1, k_2, \dots, k_m, then their sum is also a Chi-Squared random variable: X = X_1 + X_2 + \dots + X_m \sim \chi^2(k_1 + k_2 + \dots + k_m)

Moment-Generating Function (MGF)

The MGF of a Chi-Squared random variable with k degrees of freedom is: M(t) = (1 - 2t)^{-k/2}, \quad t < \frac{1}{2}

Chi-Square Distribution with 1 Degree of Freedom

The χ2(1) distribution is a special case of the χ2 distribution, which is based on the sum of the squares of independent standard normal variables. Specifically, if Z is a standard normal variable (mean = 0, standard deviation = 1), then:

X = Z^2 \quad \text{follows a} \quad \chi^2(1) \text{ distribution.}

Since squaring a standard normal variable always results in a non-negative value, the \chi^2(1) distribution is skewed to the right with its values ranging from 0 to positive infinity.

Chi-Squared Distributions in R Language

R is a popular high level programming language used for statistical analysis. It is open-source programming language as it has a huge community and users can contribute to the development as well. It has vast number of packages which allows the data miners to perform statistical analysis and data visualizations in an interactive manner.

R has 4 built in functions for chi-square distribution.

dchisq() function

The dchisq() function calculates the probability density of a chi-squared distribution. It can compute cumulative probabilities with lower.tail = TRUE for the left tail or lower.tail = FALSE for the right tail.

Syntax:

dchisq (x, df, ncp = 0, log= FALSE)

where x= vector of quantiles.
p= vector of probabilities.
n= number of observations
df= degrees of freedam
ncp= non-centrality parameter (non-negative).
log.p– logical; if TRUE, probabilities p are given as log(p).
lower.tail= this is a logical value, if TRUE (default),probabilities are P[X \leq x], otherwise P[X>x].

pchisq() function

The pchisq() function gives the cumulative distribution function (CDF) for the chi-squared distribution. The dchisq(x, df) function calculates the probability density of a chi-squared distribution for a value x with df degrees of freedom. It can also be used to determine the area under the chi-squared curve for specified intervals with a given number of degrees of freedom.

Syntax:

pchisq(q, df, ncp = 0, lower.tail = TRUE, log.p = FALSE)

qchisq() function

The qchisq() function returns the quantile function for the chi-squared distribution. When the non-centrality parameter (ncp) is set to 0, it computes for the central chi-squared distribution. This method gives the value of x at the qth percentile, where lower.tail = TRUE corresponds to the cumulative probability up to x.

syntax:

qchisq(p, df, ncp = 0, lower.tail = TRUE, log.p = FALSE)

rchisq() function:

The rchisq(n, df) function generates n random numbers from the chi-square distribution with df degrees of freedom. It is used to produce random deviates from the chi-square distribution.

Syntax:

rchisq (n, df, ncp = 0)

read more about Chi-Squared Distributions in R.

Chi-squared Distributions

Non-Central Chi-Square distribution

The non-central chi-square distribution is a generalization of the chi-square distribution, often used in power analyses. It introduces an additional parameter, λ known as the non-central parameter. This parameter shifts the distribution's peak to the right and increases the variance as λ increases.

The λ parameter influences the mean of the normal distributions that make up the chi-square distribution. For instance, a non-central chi-square distribution with λ=2 and k=3 can be generated by squaring and summing values from three normal distributions, each with a mean of 2 and a variance of 1.

Generalized Chi-squared Distribution

The Generalized Chi-squared Distribution is a more flexible version of the standard Chi-squared distribution. It's based on a mathematical expression called a quadratic form, which looks like z′Az.

Here’s what each part means:

  • z is a vector of random variables that follow a Gaussian distribution (i.e., they are normally distributed) with a mean of zero.
  • A is a matrix that helps define how the variables in z are related to each other (i.e., it defines their covariance or how they vary together).
  • z′Az represents a mathematical operation where you multiply the vector z by the matrix A and then by the vector z again. This produces a single number, which is the value from the distribution.

In simpler terms, the generalized chi-squared distribution describes how a set of correlated normal variables behave when combined in a specific way. This allows for more complex scenarios than the basic chi-squared distribution.

Read More,

  • Probability Distribution
  • Normal Distribution
  • Standard Normal Distribution

Next Article
Chi-Squared Distributions

S

somesh_barthwal
Improve
Article Tags :
  • Engineering Mathematics
  • Mathematics
  • Maths

Similar Reads

    Standard Normal Distribution
    Standard normal distribution, also known as the z-distribution, is a special type of normal distribution. In this distribution, the mean (average) is 0 and the standard deviation (a measure of spread) is 1. This creates a bell-shaped curve that is symmetrical around the mean.In this article we have
    8 min read
    Chi-square with Ordinal Data
    Chi-square test is a primordial technique employed by statisticians to evaluate the hypothesis concerned with an association between two variables. This article will take you through an understanding of the Chi-square test especially when used with ordinal data.In this article, we will learn the gen
    9 min read
    Mathematics | Beta Distribution Model
    The Beta Distribution is a continuous probability distribution defined on the interval [0, 1], widely used in statistics and various fields for modeling random variables that represent proportions or probabilities. It is particularly useful when dealing with scenarios where the outcomes are bounded
    11 min read
    Binomial Distribution Practice Problems
    Binomial Distribution is a fundamental concept in probability theory , It is a probability distribution that describes the number of successes in a fixed number of independent trials, where each trial has only two possible outcomes: success or failure.Binomial distribution is widely used across vari
    8 min read
    Poisson Distribution Practice Problems
    Poisson distribution is a probability distribution that models the number of events occurring within a fixed interval of time or space, where these events happen with a known constant mean rate and independently of the time since the last event. It is named after the French mathematician Siméon Deni
    4 min read
geeksforgeeks-footer-logo
Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)
Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
GFG App on Play Store GFG App on App Store
Advertise with us
  • Company
  • About Us
  • Legal
  • Privacy Policy
  • In Media
  • Contact Us
  • Advertise with us
  • GFG Corporate Solution
  • Placement Training Program
  • Languages
  • Python
  • Java
  • C++
  • PHP
  • GoLang
  • SQL
  • R Language
  • Android Tutorial
  • Tutorials Archive
  • DSA
  • Data Structures
  • Algorithms
  • DSA for Beginners
  • Basic DSA Problems
  • DSA Roadmap
  • Top 100 DSA Interview Problems
  • DSA Roadmap by Sandeep Jain
  • All Cheat Sheets
  • Data Science & ML
  • Data Science With Python
  • Data Science For Beginner
  • Machine Learning
  • ML Maths
  • Data Visualisation
  • Pandas
  • NumPy
  • NLP
  • Deep Learning
  • Web Technologies
  • HTML
  • CSS
  • JavaScript
  • TypeScript
  • ReactJS
  • NextJS
  • Bootstrap
  • Web Design
  • Python Tutorial
  • Python Programming Examples
  • Python Projects
  • Python Tkinter
  • Python Web Scraping
  • OpenCV Tutorial
  • Python Interview Question
  • Django
  • Computer Science
  • Operating Systems
  • Computer Network
  • Database Management System
  • Software Engineering
  • Digital Logic Design
  • Engineering Maths
  • Software Development
  • Software Testing
  • DevOps
  • Git
  • Linux
  • AWS
  • Docker
  • Kubernetes
  • Azure
  • GCP
  • DevOps Roadmap
  • System Design
  • High Level Design
  • Low Level Design
  • UML Diagrams
  • Interview Guide
  • Design Patterns
  • OOAD
  • System Design Bootcamp
  • Interview Questions
  • Inteview Preparation
  • Competitive Programming
  • Top DS or Algo for CP
  • Company-Wise Recruitment Process
  • Company-Wise Preparation
  • Aptitude Preparation
  • Puzzles
  • School Subjects
  • Mathematics
  • Physics
  • Chemistry
  • Biology
  • Social Science
  • English Grammar
  • Commerce
  • World GK
  • GeeksforGeeks Videos
  • DSA
  • Python
  • Java
  • C++
  • Web Development
  • Data Science
  • CS Subjects
@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved
We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy
Lightbox
Improvement
Suggest Changes
Help us improve. Share your suggestions to enhance the article. Contribute your expertise and make a difference in the GeeksforGeeks portal.
geeksforgeeks-suggest-icon
Create Improvement
Enhance the article with your expertise. Contribute to the GeeksforGeeks community and help create better learning resources for all.
geeksforgeeks-improvement-icon
Suggest Changes
min 4 words, max Words Limit:1000

Thank You!

Your suggestions are valuable to us.

What kind of Experience do you want to share?

Interview Experiences
Admission Experiences
Career Journeys
Work Experiences
Campus Experiences
Competitive Exam Experiences