Skip to content
geeksforgeeks
  • Courses
    • DSA to Development
    • Get IBM Certification
    • Newly Launched!
      • Master Django Framework
      • Become AWS Certified
    • For Working Professionals
      • Interview 101: DSA & System Design
      • Data Science Training Program
      • JAVA Backend Development (Live)
      • DevOps Engineering (LIVE)
      • Data Structures & Algorithms in Python
    • For Students
      • Placement Preparation Course
      • Data Science (Live)
      • Data Structure & Algorithm-Self Paced (C++/JAVA)
      • Master Competitive Programming (Live)
      • Full Stack Development with React & Node JS (Live)
    • Full Stack Development
    • Data Science Program
    • All Courses
  • Tutorials
    • Data Structures & Algorithms
    • ML & Data Science
    • Interview Corner
    • Programming Languages
    • Web Development
    • CS Subjects
    • DevOps And Linux
    • School Learning
  • Practice
    • Build your AI Agent
    • GfG 160
    • Problem of the Day
    • Practice Coding Problems
    • GfG SDE Sheet
  • Contests
    • Accenture Hackathon (Ending Soon!)
    • GfG Weekly [Rated Contest]
    • Job-A-Thon Hiring Challenge
    • All Contests and Events
  • Data Science
  • Data Science Projects
  • Data Analysis
  • Data Visualization
  • Machine Learning
  • ML Projects
  • Deep Learning
  • NLP
  • Computer Vision
  • Artificial Intelligence
Open In App
Next Article:
Stepwise Regression in Python
Next article icon

Isotonic Regression in Scikit Learn

Last Updated : 26 Apr, 2025
Comments
Improve
Suggest changes
Like Article
Like
Report

Isotonic regression is a regression technique in which the predictor variable is monotonically related to the target variable. This means that as the value of the predictor variable increases, the value of the target variable either increases or decreases in a consistent, non-oscillating manner.

Mathematically, isotonic regression can be formulated as an optimization problem in which the goal is to find a monotonic function that minimizes the sum of the squared errors between the predicted and observed values of the target variable.

The optimization problem can be written as follows:

minimize ∑(y_i - f(x_i))^2     subject to f(x_1) ≤ f(x_2) ≤ ... ≤ f(x_n)

where x_i      and y_i      are the predictors and target variables for the i^{th}   data point, respectively, and f is the monotonic function that is being fit to the data. The constraint ensures that the function is monotonic.

One way to solve this optimization problem is through a dynamic programming approach, which involves iteratively updating the function by adding one predictor-target pair at a time and making sure that the function remains monotonic at each step.

Applications of Isotonic Regression

Isotonic regression has a number of applications, including:

  1. Calibration of predicted probabilities: Isotonic regression can be used to adjust the predicted probabilities produced by a classifier so that they are more accurately calibrated to the true probabilities.
  2. Ordinal regression: Isotonic regression can be used to model ordinal variables, which are variables that can be ranked in order (e.g., "low," "medium," and "high").
  3. Non-parametric regression: Because isotonic regression does not make any assumptions about the functional form of the relationship between the predictor and target variables, it can be used as a non-parametric regression method.
  4. Imputing missing values: Isotonic regression can be used to impute missing values in a dataset by predicting the missing values based on the surrounding non-missing values.
  5. Outlier detection: Isotonic regression can be used to identify outliers in a dataset by identifying points that are significantly different from the overall trend of the data.

In scikit-learn, isotonic regression can be performed using the 'IsotonicRegression' class. This class implements the isotonic regression algorithm, which fits a non-decreasing piecewise-constant function to the data.

Here is an example of how to use the IsotonicRegression class in scikit-learn to perform isotonic regression:

1. Create the sample data with NumPy library

Python3
import numpy as np  # Sample dataset n=20 x = np.arange(n) print('Input:\n',x) y = np.random.randint(0,20,size=n) + 10 * np.log1p(np.arange(n)) print("Target :\n",y) 

Outputs :

Input:   [ 0  1  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19]  Target :   [ 1.         22.93147181 20.98612289 20.86294361 27.09437912 31.91759469   38.45910149 23.79441542 22.97224577 35.02585093 32.97895273 40.8490665   39.64949357 45.3905733  39.08050201 43.72588722 31.33213344 36.90371758   47.44438979 44.95732274]

2.  Import Isotonic Regression from sklearn.isotonic and predict the Target value

Python3
from sklearn.isotonic import IsotonicRegression ir = IsotonicRegression() # create an instance of the IsotonicRegression class  # Fit isotonic regression model y_ir = ir.fit_transform(x, y) # fit the model and transform the data print('Isotonic Regression Predictions  :\n',y_ir) 

Output:

Isotonic Regression Predictions :   [ 1.         21.59351277 21.59351277 21.59351277 27.09437912 29.28583934   29.28583934 29.28583934 29.28583934 34.00240183 34.00240183 39.5616248   39.5616248  39.5616248  39.5616248  39.5616248  39.5616248  39.5616248   46.20085626 46.20085626]

This code will fit an isotonic regression model to the sample data and make predictions on the same data.  We can observe from the above Target that it is increasing or decreasing along the target value.

3.  Let's use Linear regression to predict from the same data.

Python3
from sklearn.linear_model import LinearRegression lr = LinearRegression() # create an instance of the LinearRegression class # Fit linear regression model lr.fit(x.reshape(-1, 1), y) # fit the model to the data y_lr = lr.predict(x.reshape(-1, 1)) # make predictions using the fitted model print('Linear Regression Prediction :\n', y_lr) 

Outputs :

Linear Regression Prediction :   [17.69949296 19.24352614 20.78755933 22.33159252 23.8756257  25.41965889   26.96369208 28.50772526 30.05175845 31.59579164 33.13982482 34.68385801   36.2278912  37.77192438 39.31595757 40.85999076 42.40402394 43.94805713   45.49209032 47.0361235 ]

4. Let's compare by plotting both predictions with matplotlib.

Python3
import matplotlib.pyplot as plt from matplotlib.collections import LineCollection  lines=[[[i,y[i]],[i,y_ir[i]]] for i in range(n)]  # Line to measure the difference between actual and target value lc=LineCollection(lines)  # plt.figure(figsize=(10,4)) plt.plot(x,y,'.',markersize=10, label='data') plt.plot(x,y_ir,'-',markersize=10,label='isotonic regression' ) plt.plot(x,y_lr, '-', label='linear regression')  plt.gca().add_collection(lc) plt.legend() # add a legend  plt.title("Isotonic Regression") plt.show() 

Output: 

Isotonic Regression-Geeksforgeeks
Isotonic Regression

Here, the blue dots represent the original target w.r.t input value. The orange line represents the predicted isotonic regression value. which is varying monotonically along the actual target value. while linear regression is represented by a green line, which is the best linear fit line for input data.

Comparison with different regression algorithms:

Here is a Python code that demonstrates how isotonic regression is different from other regression techniques using a sample dataset:

Python3
from sklearn.preprocessing import PolynomialFeatures from sklearn.linear_model import LinearRegression from sklearn.isotonic import IsotonicRegression import numpy as np import matplotlib.pyplot as plt  # Sample dataset n = 20 x = np.arange(n) print('Input:\n', x) y = np.random.randint(0, 20, size=n) + 10 * np.log1p(np.arange(n)) print("Target :\n", y)  # Fit isotonic regression model # create an instance of the IsotonicRegression class ir = IsotonicRegression()    # fit the model and transform the data y_ir = ir.fit_transform(x, y)    # Fit linear regression model  # create an instance of the LinearRegression class lr = LinearRegression()    # fit the model to the data lr.fit(x.reshape(-1, 1), y)   # make predictions using the fitted model y_lr = lr.predict(x.reshape(-1, 1))    # Fit polynomial regression model # create an instance of the PolynomialFeatures # class with a degree of 2 poly = PolynomialFeatures(degree=2)  # transform the data x_poly = poly.fit_transform(x.reshape(-1, 1))    # create an instance of the # LinearRegression class lr_poly = LinearRegression()    # fit the model to the transformed data lr_poly.fit(x_poly, y)    # make predictions using the fitted model y_poly = lr_poly.predict(x_poly)    # Plot the results plt.plot(x, y, 'o', label='data')  # plot the original data # plot the fitted isotonic regression model plt.plot(x, y_ir, label='isotonic regression') # plot the fitted linear regression model plt.plot(x, y_lr, label='linear regression') # plot the fitted polynomial regression model plt.plot(x, y_poly, label='polynomial regression') plt.legend()  # add a legend  # Add labels and title plt.xlabel('X')  # add x-axis label plt.ylabel('Y')  # add y-axis label plt.title('Comparison of Regression Techniques')  # add title  plt.show()  # show the plot 

Output:

Comparision of different Regression Techniques -Geeksforgeeks
Comparison of different Regression Techniques

The first block imports the necessary libraries and generates a sample dataset with six data points. The second block fits an isotonic regression model to the data using the IsotonicRegression class from the sklearn library. The fit_transform method is used to fit the model and transform the data. The third block fits a linear regression model to the data using the LinearRegression class from the sklearn library. The fourth block fits a polynomial regression model to the data by first transforming the data using the PolynomialFeatures class from the sklearn library, and then fitting a linear regression model to the transformed data. The last block plots the original data, as well as the fitted models, using the matplotlib library.


Next Article
Stepwise Regression in Python

H

harshilsanghvi
Improve
Article Tags :
  • Technical Scripter
  • Machine Learning
  • AI-ML-DS
  • Technical Scripter 2022
  • Python scikit-module
Practice Tags :
  • Machine Learning

Similar Reads

  • Machine Learning Algorithms
    Machine learning algorithms are essentially sets of instructions that allow computers to learn from data, make predictions, and improve their performance over time without being explicitly programmed. Machine learning algorithms are broadly categorized into three types: Supervised Learning: Algorith
    8 min read
  • Top 15 Machine Learning Algorithms Every Data Scientist Should Know in 2025
    Machine Learning (ML) Algorithms are the backbone of everything from Netflix recommendations to fraud detection in financial institutions. These algorithms form the core of intelligent systems, empowering organizations to analyze patterns, predict outcomes, and automate decision-making processes. Wi
    15 min read
  • Linear Model Regression

    • Ordinary Least Squares (OLS) using statsmodels
      Ordinary Least Squares (OLS) is a widely used statistical method for estimating the parameters of a linear regression model. It minimizes the sum of squared residuals between observed and predicted values. In this article we will learn how to implement Ordinary Least Squares (OLS) regression using P
      3 min read

    • Linear Regression (Python Implementation)
      Linear regression is a statistical method that is used to predict a continuous dependent variable i.e target variable based on one or more independent variables. This technique assumes a linear relationship between the dependent and independent variables which means the dependent variable changes pr
      14 min read

    • ML | Multiple Linear Regression using Python
      Linear regression is a fundamental statistical method widely used for predictive analysis. It models the relationship between a dependent variable and a single independent variable by fitting a linear equation to the data. Multiple Linear Regression is an extension of this concept that allows us to
      4 min read

    • Polynomial Regression ( From Scratch using Python )
      Prerequisites Linear RegressionGradient DescentIntroductionLinear Regression finds the correlation between the dependent variable ( or target variable ) and independent variables ( or features ). In short, it is a linear model to fit the data linearly. But it fails to fit and catch the pattern in no
      5 min read

    • Bayesian Linear Regression
      Linear regression is based on the assumption that the underlying data is normally distributed and that all relevant predictor variables have a linear relationship with the outcome. But In the real world, this is not always possible, it will follows these assumptions, Bayesian regression could be the
      11 min read

    • How to Perform Quantile Regression in Python
      In this article, we are going to see how to perform quantile regression in Python. Linear regression is defined as the statistical method that constructs a relationship between a dependent variable and an independent variable as per the given set of variables. While performing linear regression we a
      4 min read

    • Isotonic Regression in Scikit Learn
      Isotonic regression is a regression technique in which the predictor variable is monotonically related to the target variable. This means that as the value of the predictor variable increases, the value of the target variable either increases or decreases in a consistent, non-oscillating manner. Mat
      6 min read

    • Stepwise Regression in Python
      Stepwise regression is a method of fitting a regression model by iteratively adding or removing variables. It is used to build a model that is accurate and parsimonious, meaning that it has the smallest number of variables that can explain the data. There are two main types of stepwise regression: F
      6 min read

    • Least Angle Regression (LARS)
      Regression is a supervised machine learning task that can predict continuous values (real numbers), as compared to classification, that can predict categorical or discrete values. Before we begin, if you are a beginner, I highly recommend this article. Least Angle Regression (LARS) is an algorithm u
      3 min read

    Linear Model Classification

    • Logistic Regression in Machine Learning
      In our previous discussion, we explored the fundamentals of machine learning and walked through a hands-on implementation of Linear Regression. Now, let's take a step forward and dive into one of the first and most widely used classification algorithms — Logistic Regression What is Logistic Regressi
      13 min read

    • Understanding Activation Functions in Depth
      In artificial neural networks, the activation function of a neuron determines its output for a given input. This output serves as the input for subsequent neurons in the network, continuing the process until the network solves the original problem. Consider a binary classification problem, where the
      6 min read

    Regularization

    • Implementation of Lasso Regression From Scratch using Python
      Lasso Regression (Least Absolute Shrinkage and Selection Operator) is a linear regression technique that combines prediction with feature selection. It does this by adding a penalty term to the cost function shrinking less relevant feature's coefficients to zero. This makes it effective for high-dim
      7 min read

    • Implementation of Ridge Regression from Scratch using Python
      Prerequisites: Linear Regression Gradient Descent Introduction: Ridge Regression ( or L2 Regularization ) is a variation of Linear Regression. In Linear Regression, it minimizes the Residual Sum of Squares ( or RSS or cost function ) to fit the training examples perfectly as possible. The cost funct
      4 min read

    • Implementation of Elastic Net Regression From Scratch
      Prerequisites: Linear RegressionGradient DescentLasso & Ridge RegressionIntroduction: Elastic-Net Regression is a modification of Linear Regression which shares the same hypothetical function for prediction. The cost function of Linear Regression is represented by J. [Tex]\frac{1}{m} \sum_{i=1}^
      5 min read

    K-Nearest Neighbors (KNN)

    • Implementation of Elastic Net Regression From Scratch
      Prerequisites: Linear RegressionGradient DescentLasso & Ridge RegressionIntroduction: Elastic-Net Regression is a modification of Linear Regression which shares the same hypothetical function for prediction. The cost function of Linear Regression is represented by J. [Tex]\frac{1}{m} \sum_{i=1}^
      5 min read

    • Brute Force Approach and its pros and cons
      In this article, we will discuss the Brute Force Algorithm and what are its pros and cons. What is the Brute Force Algorithm?A brute force algorithm is a simple, comprehensive search strategy that systematically explores every option until a problem's answer is discovered. It's a generic approach to
      3 min read

    • Implementation of KNN classifier using Scikit - learn - Python
      K-Nearest Neighbors is a most simple but fundamental classifier algorithm in Machine Learning. It is under the supervised learning category and used with great intensity for pattern recognition, data mining and analysis of intrusion. It is widely disposable in real-life scenarios since it is non-par
      3 min read

    • Regression using k-Nearest Neighbors in R Programming
      Machine learning is a subset of Artificial Intelligence that provides a machine with the ability to learn automatically without being explicitly programmed. The machine in such cases improves from the experience without human intervention and adjusts actions accordingly. It is primarily of 3 types:
      5 min read

    Support Vector Machines

    • Support Vector Machine (SVM) Algorithm
      Support Vector Machine (SVM) is a supervised machine learning algorithm used for classification and regression tasks. While it can handle regression problems, SVM is particularly well-suited for classification tasks. SVM aims to find the optimal hyperplane in an N-dimensional space to separate data
      10 min read

    • Classifying data using Support Vector Machines(SVMs) in Python
      Introduction to SVMs: In machine learning, support vector machines (SVMs, also support vector networks) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis. A Support Vector Machine (SVM) is a discriminative classifier
      4 min read

    • Support Vector Regression (SVR) using Linear and Non-Linear Kernels in Scikit Learn
      Support vector regression (SVR) is a type of support vector machine (SVM) that is used for regression tasks. It tries to find a function that best predicts the continuous output value for a given input value. SVR can use both linear and non-linear kernels. A linear kernel is a simple dot product bet
      5 min read

    • Major Kernel Functions in Support Vector Machine (SVM)
      In previous article we have discussed about SVM(Support Vector Machine) in Machine Learning. Now we are going to learn  in detail about SVM Kernel and Different Kernel Functions and its examples. Types of SVM Kernel FunctionsSVM algorithm use the mathematical function defined by the kernel. Kernel F
      4 min read

  • ML | Stochastic Gradient Descent (SGD)
    Stochastic Gradient Descent (SGD) is an optimization algorithm in machine learning, particularly when dealing with large datasets. It is a variant of the traditional gradient descent algorithm but offers several advantages in terms of efficiency and scalability, making it the go-to method for many d
    8 min read
  • Decision Tree

    • Major Kernel Functions in Support Vector Machine (SVM)
      In previous article we have discussed about SVM(Support Vector Machine) in Machine Learning. Now we are going to learn  in detail about SVM Kernel and Different Kernel Functions and its examples. Types of SVM Kernel FunctionsSVM algorithm use the mathematical function defined by the kernel. Kernel F
      4 min read

    • CART (Classification And Regression Tree) in Machine Learning
      CART( Classification And Regression Trees) is a variation of the decision tree algorithm. It can handle both classification and regression tasks. Scikit-Learn uses the Classification And Regression Tree (CART) algorithm to train Decision Trees (also called “growing” trees). CART was first produced b
      11 min read

    • Decision Tree Classifiers in R Programming
      Classification is the task in which objects of several categories are categorized into their respective classes using the properties of classes. A classification model is typically used to, Predict the class label for a new unlabeled data objectProvide a descriptive model explaining what features ch
      4 min read

    • Decision Tree Regression using sklearn - Python
      Decision Tree Regression is a method used to predict continuous values like prices or scores by using a tree-like structure. It works by splitting the data into smaller parts based on simple rules taken from the input features. These splits help reduce errors in prediction. At the end of each branch
      3 min read

    Ensemble Learning

    • Ensemble Methods in Python
      Ensemble means a group of elements viewed as a whole rather than individually. An Ensemble method creates multiple models and combines them to solve it. Ensemble methods help to improve the robustness/generalizability of the model. In this article, we will discuss some methods with their implementat
      11 min read

    • Random Forest Regression in Python
      A random forest is an ensemble learning method that combines the predictions from multiple decision trees to produce a more accurate and stable prediction. It is a type of supervised learning algorithm that can be used for both classification and regression tasks. In regression task we can use Rando
      9 min read

    • ML | Extra Tree Classifier for Feature Selection
      Prerequisites: Decision Tree Classifier Extremely Randomized Trees Classifier(Extra Trees Classifier) is a type of ensemble learning technique which aggregates the results of multiple de-correlated decision trees collected in a "forest" to output it's classification result. In concept, it is very si
      6 min read

    • Implementing the AdaBoost Algorithm From Scratch
      AdaBoost means Adaptive Boosting and it is a is a powerful ensemble learning technique that combines multiple weak classifiers to create a strong classifier. It works by sequentially adding classifiers to correct the errors made by previous models giving more weight to the misclassified data points.
      3 min read

    • XGBoost
      Traditional machine learning models like decision trees and random forests are easy to interpret but often struggle with accuracy on complex datasets. XGBoost, short for eXtreme Gradient Boosting, is an advanced machine learning algorithm designed for efficiency, speed, and high performance. What is
      9 min read

    • CatBoost in Machine Learning
      When working with machine learning, we often deal with datasets that include categorical data. We use techniques like One-Hot Encoding or Label Encoding to convert these categorical features into numerical values. However One-Hot Encoding can lead to sparse matrix and cause overfitting. This is wher
      7 min read

    • LightGBM (Light Gradient Boosting Machine)
      LightGBM is an open-source high-performance framework developed by Microsoft. It is an ensemble learning framework that uses gradient boosting method which constructs a strong learner by sequentially adding weak learners in a gradient descent manner. It's designed for efficiency, scalability and hig
      7 min read

    • Stacking in Machine Learning
      Machine learning uses ensemble methods to combine multiple models and improve overall performance. These methods uses the strengths of various models to create a more robust solution. Some of the most popular ensemble techniques include Bagging and Boosting. Bagging trains multiple similar models an
      2 min read

geeksforgeeks-footer-logo
Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)
Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
GFG App on Play Store GFG App on App Store
Advertise with us
  • Company
  • About Us
  • Legal
  • Privacy Policy
  • In Media
  • Contact Us
  • Advertise with us
  • GFG Corporate Solution
  • Placement Training Program
  • Languages
  • Python
  • Java
  • C++
  • PHP
  • GoLang
  • SQL
  • R Language
  • Android Tutorial
  • Tutorials Archive
  • DSA
  • Data Structures
  • Algorithms
  • DSA for Beginners
  • Basic DSA Problems
  • DSA Roadmap
  • Top 100 DSA Interview Problems
  • DSA Roadmap by Sandeep Jain
  • All Cheat Sheets
  • Data Science & ML
  • Data Science With Python
  • Data Science For Beginner
  • Machine Learning
  • ML Maths
  • Data Visualisation
  • Pandas
  • NumPy
  • NLP
  • Deep Learning
  • Web Technologies
  • HTML
  • CSS
  • JavaScript
  • TypeScript
  • ReactJS
  • NextJS
  • Bootstrap
  • Web Design
  • Python Tutorial
  • Python Programming Examples
  • Python Projects
  • Python Tkinter
  • Python Web Scraping
  • OpenCV Tutorial
  • Python Interview Question
  • Django
  • Computer Science
  • Operating Systems
  • Computer Network
  • Database Management System
  • Software Engineering
  • Digital Logic Design
  • Engineering Maths
  • Software Development
  • Software Testing
  • DevOps
  • Git
  • Linux
  • AWS
  • Docker
  • Kubernetes
  • Azure
  • GCP
  • DevOps Roadmap
  • System Design
  • High Level Design
  • Low Level Design
  • UML Diagrams
  • Interview Guide
  • Design Patterns
  • OOAD
  • System Design Bootcamp
  • Interview Questions
  • Inteview Preparation
  • Competitive Programming
  • Top DS or Algo for CP
  • Company-Wise Recruitment Process
  • Company-Wise Preparation
  • Aptitude Preparation
  • Puzzles
  • School Subjects
  • Mathematics
  • Physics
  • Chemistry
  • Biology
  • Social Science
  • English Grammar
  • Commerce
  • World GK
  • GeeksforGeeks Videos
  • DSA
  • Python
  • Java
  • C++
  • Web Development
  • Data Science
  • CS Subjects
@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved
We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy
Lightbox
Improvement
Suggest Changes
Help us improve. Share your suggestions to enhance the article. Contribute your expertise and make a difference in the GeeksforGeeks portal.
geeksforgeeks-suggest-icon
Create Improvement
Enhance the article with your expertise. Contribute to the GeeksforGeeks community and help create better learning resources for all.
geeksforgeeks-improvement-icon
Suggest Changes
min 4 words, max Words Limit:1000

Thank You!

Your suggestions are valuable to us.

What kind of Experience do you want to share?

Interview Experiences
Admission Experiences
Career Journeys
Work Experiences
Campus Experiences
Competitive Exam Experiences