Skip to content
geeksforgeeks
  • Tutorials
    • Python
    • Java
    • Data Structures & Algorithms
    • ML & Data Science
    • Interview Corner
    • Programming Languages
    • Web Development
    • CS Subjects
    • DevOps And Linux
    • School Learning
    • Practice Coding Problems
  • Courses
    • DSA to Development
    • Get IBM Certification
    • Newly Launched!
      • Master Django Framework
      • Become AWS Certified
    • For Working Professionals
      • Interview 101: DSA & System Design
      • Data Science Training Program
      • JAVA Backend Development (Live)
      • DevOps Engineering (LIVE)
      • Data Structures & Algorithms in Python
    • For Students
      • Placement Preparation Course
      • Data Science (Live)
      • Data Structure & Algorithm-Self Paced (C++/JAVA)
      • Master Competitive Programming (Live)
      • Full Stack Development with React & Node JS (Live)
    • Full Stack Development
    • Data Science Program
    • All Courses
  • Data Science
  • Data Science Projects
  • Data Analysis
  • Data Visualization
  • Machine Learning
  • ML Projects
  • Deep Learning
  • NLP
  • Computer Vision
  • Artificial Intelligence
Open In App
Next Article:
Build, Test, and Deploy Model With AutoML
Next article icon

Build, Test, and Deploy Model With AutoML

Last Updated : 24 May, 2024
Comments
Improve
Suggest changes
Like Article
Like
Report

The term "Automated Machine Learning," or "AutoML," refers to a set of tools and methods used to speed up the creation of machine learning models. It automates a variety of processes, including model evaluation, feature selection, hyperparameter tweaking, and data preparation. By automating the intricate and time-consuming processes involved in model creation, AutoML platforms hope to make machine learning accessible to people and businesses without a strong background in data science.

Build, Test, and Deploy Model With AutoML

Here I will use Vertex AI in Google Cloud Platform to demonstrate the AutoML.

Make sure you have already created a Project and a Bucket to move forward.

Step 1: Dataset

  • We need to create a dataset, using the create button. Here we have used Titanic.csv. It is mandatory to keep the dataset in the dataset section. First, inside Vertex AI go to Dataset and click on Create.
Dataset-Creation-Geeksforgeeks
Fig 1: Creating the dataset
  • Write the dataset name, and select the types. There are four types.
    • Image : If the dataset consists of Image.
    • Tabular: If the dataset is in excel format.
    • Text: If the dataset is in json format.
    • Video: If the dataset is in video format.
Data-source - Geeksforgeeks-(1)
Fig 2: Data source options for dataset

Now we can view & analyze the uploaded dataset

  • If you want to do classification and Regression problems or forecasting then go to Tabular -> Classification/Regression or Tabular -> Forecasting.
  • Upload the dataset from either local computer (chargeable), or you can directly use BigQuery if your dataset is stored in BigQuery itself.
  • After successful completion, you can click to Analyze the dataset. It will automatically analyze the unique and the missing values of the dataset in a click.
Analyze-the-dataset - Geeksforgeeks-(1)
Fig 3: Analyze the dataset

Step 2: AUTO ML in Vertex AI

  • Go to Training in Vertex AI, click on Create.
  • Please Note: Auto ML doesn’t work great in complex dataset. For that we need a clean pre processed data. All the preprocessing must be done before uploading the dataset in Vertex AI.
  • Select the region where you want to store the training.
training-for-AutoML - Geeksforgeeks-(1)
Fig 4: Creating the training for AUTO ML

Train new model

A new pop will open i..e 'Train new model'

1. Training Methods


  • First we select our dataset from dropdown
  • Then Select the Objective. The objectives are either Classification or Regression.
  • Then we select the desired Model Training Method. i.e from AutoML & Custom Training
    • Here we are clicking on AutoML for automatic training and then Continue.
    • The best part of training is if you gets disconnected from the internet still it will train in the background.


options-of-AUTOML - Geeksforgeeks-(1)
Fig 5: Selection options of AUTOML


2. Model Details

  • In model details we have the two options i.e
    • Train new model : We will select it when we are training a new model
    • Train new version : We will select it when we want train the existing model
  • We can retrain if required only by updating the version of existing model or we can train a new model.
  • For training the new model, we have to provide the name, descriptions and select the Target Column from the above defined dataset.
Target-column - Geeksforgeeks-(1)
Fig 6: Selection of Target column
  • Data Split: In data split we have the 3 options i.e
    1. Random: Split the data into 80% training, 10% valiadations and 10% for testing randomly.
    2. Manual: Here we can define the manully the % of data to split into training, testing and validations.
    3. Chronological: This will split the first 80% data into training, next 10% into validations and last 10% for testing.
advanced-option - Geeksforgeeks-(1)
Fig 7: AutoML (advanced option)

3. Join Featurestore

  • Join Featurestore is optional, mean we can skip also.
  • Feature store are used for reusability so for this time we are not adding any feature from feature store.

4. Training options

  • In training options we can exclude any unnecessary independent variables If required, just click on minus.
  • Click on Advanced Option. There is a weight column where we can give priority to some feature if required. Use the optimization techniques as required. Here, I used Log loss.
Defining-Optimization - Geeksforgeeks-(1)
Fig 8: Defining Optimization techniques

5. Compute and pricing

  • In compute and pricing, put a budget. Budget ranges from (1 to 72) where 1 denotes to 1 node hour. Enable early stopping for healthy practices.
  • Click on Start Training.
Compute - Geeksforgeeks-(1)
Fig 9: Compute and prices
  • After execution complete, it will reflect on Training.

Step 3 : Model Registry

  • Once a dataset is trained we can import the model in Model Registry.
  • A model registry is used to store version trained machine learning models. It is a type of repository..
  • Here, all the models with their version type is displayed.
  • Click on Name of model which we havwe assign while training i.e. Titanic.
Model-Registry - Geeksforgeeks-(1)
Fig 10: Model Registry

Step 4: Evalute

  • Version Type is written. If we have same models of different versions we can change it by clicking on Versions.
  • We, can create Evaluation report of different models to see which version is more accurate. It will produce visualize output.


Evaluation of model-Geeksforgeeks
Fig 11: Evaluation of model


Step 5: Deploy and Test

  • An endpoint refers to an API (Application Programming Interface) that allows you to interact with your deployed machine learning model. It provides a way for external applications, services, or users to send data to the model for inference (making predictions or classifications) and receive the model's responses.
  • Click on Deploy & Test -> Deploy to Endpoint.
  • Container creation is needed in Manual ML.
Deploy - Geeksforgeeks
Fig 12: Deploy and Test

1. Define your endpoint

  • Create new endpoint if not.
  • Fill name, location.
  • Access are of two types. Standard uses Rest API, private is for your private cloud (if purchased).
  • In Advanced, there is encryption type, Google managed encryption and Customer managed encryption type. Google managed is the default.


Creation of Endpoint-Geeksforgeeks
Fig 13: Creation of Endpoint



2. Model Setting

  • Select Traffic Split, remember select it out of 100. (Details in next slide)
  • Min no of compute nodes is the compute resources will continuously run even without traffic demand. This can increase cost but avoid dropped requests due to node initialization.
Traffic - Geeksforgeeks-(1)
Fig 14: Traffic Split

What is Traffic split in Vertex AI during Model deployment?

Traffic split refers to the distribution of inference requests (also known as traffic) across different versions of a deployed machine learning model. When you deploy multiple versions of a model, you can control how much traffic each version receives. For example, you might direct 80% of the traffic to the current production version and 20% to a new experimental version. So, in short if you want to deploy multiple versions, make the traffic distribution in that respective way.

  • Explainability options are particularly important when dealing with complex models, such as deep neural networks, that might be considered "black-box" models due to their intricate internal workings. Explainability helps data scientists, developers, and stakeholders gain confidence in the model's decisions. It is recommended to turn it on so that we can test it after deployment.
  • Click on Feature attribute
Feature-attributes - Geeksforgeeks-(1)
Fig 15: Feature attributes

Model Productions

Once the model is in production, it requires continuous monitoring for ensuring its performance is as we expected. It will send email report to the given email id in a gap of x days.

  • Click on Deploy.
  • After successful deployment, we can test the model.
  • To test the model provide the inputs. See the result of the output.
Model-Testing - Geeksforgeeks-(1)
Fig 16: Model Testing


Your model is ready now.

API calling is a separate large topic so we are concluding it here.


Next Article
Build, Test, and Deploy Model With AutoML

S

swarnavo09
Improve
Article Tags :
  • Data Science
  • Machine Learning
  • Geeks Premier League
  • AI-ML-DS
  • Vertex AI
  • Geeks Premier League 2023
Practice Tags :
  • Machine Learning

Similar Reads

    Automating API Testing with Postman
    Testing the functionality, dependability, and performance of an API is known as API testing. API testing can be done automatically or manually. The technique of automating the execution of API tests with tools is known as automated API testing. This can save time and effort, as well as ensure that A
    5 min read
    JUnit – Executing Tests with Maven Build
    JUnit is a widely-used testing framework for Java that allows developers to create and execute unit tests. When combined with Maven, the build automation tool, it becomes easy to automate the execution of these tests as part of the build lifecycle. In this article, we will explore how JUnit works wi
    6 min read
    Running a Single Test or Method With Maven
    Running a single test or method using Maven is straightforward and can be accomplished using Maven commands. By default, Maven runs all tests in your project, but you can specify a single test class or even a single test method to execute. In this article, we will learn how to run a single test or m
    3 min read
    Add Build Properties to a Spring Boot Application
    Spring Boot framework is an open-source framework for developing web applications with high performance and the Spring Boot makes it much easier to develop Spring-based applications developed by Java programming In this article, we will learn how to add build properties to a Spring Boot Application.
    4 min read
    How to Automate TestNG in Selenium?
    TestNG is an open-source test automation framework for Java, designed to make the process of testing more efficient and effective. Standing for "Next Generation," TestNG offers advanced features like annotations, data-driven testing, and parallel execution. When combined with Selenium, it provides a
    4 min read
geeksforgeeks-footer-logo
Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)
Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
GFG App on Play Store GFG App on App Store
Advertise with us
  • Company
  • About Us
  • Legal
  • Privacy Policy
  • In Media
  • Contact Us
  • Advertise with us
  • GFG Corporate Solution
  • Placement Training Program
  • Languages
  • Python
  • Java
  • C++
  • PHP
  • GoLang
  • SQL
  • R Language
  • Android Tutorial
  • Tutorials Archive
  • DSA
  • Data Structures
  • Algorithms
  • DSA for Beginners
  • Basic DSA Problems
  • DSA Roadmap
  • Top 100 DSA Interview Problems
  • DSA Roadmap by Sandeep Jain
  • All Cheat Sheets
  • Data Science & ML
  • Data Science With Python
  • Data Science For Beginner
  • Machine Learning
  • ML Maths
  • Data Visualisation
  • Pandas
  • NumPy
  • NLP
  • Deep Learning
  • Web Technologies
  • HTML
  • CSS
  • JavaScript
  • TypeScript
  • ReactJS
  • NextJS
  • Bootstrap
  • Web Design
  • Python Tutorial
  • Python Programming Examples
  • Python Projects
  • Python Tkinter
  • Python Web Scraping
  • OpenCV Tutorial
  • Python Interview Question
  • Django
  • Computer Science
  • Operating Systems
  • Computer Network
  • Database Management System
  • Software Engineering
  • Digital Logic Design
  • Engineering Maths
  • Software Development
  • Software Testing
  • DevOps
  • Git
  • Linux
  • AWS
  • Docker
  • Kubernetes
  • Azure
  • GCP
  • DevOps Roadmap
  • System Design
  • High Level Design
  • Low Level Design
  • UML Diagrams
  • Interview Guide
  • Design Patterns
  • OOAD
  • System Design Bootcamp
  • Interview Questions
  • Inteview Preparation
  • Competitive Programming
  • Top DS or Algo for CP
  • Company-Wise Recruitment Process
  • Company-Wise Preparation
  • Aptitude Preparation
  • Puzzles
  • School Subjects
  • Mathematics
  • Physics
  • Chemistry
  • Biology
  • Social Science
  • English Grammar
  • Commerce
  • World GK
  • GeeksforGeeks Videos
  • DSA
  • Python
  • Java
  • C++
  • Web Development
  • Data Science
  • CS Subjects
@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved
We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy
Lightbox
Improvement
Suggest Changes
Help us improve. Share your suggestions to enhance the article. Contribute your expertise and make a difference in the GeeksforGeeks portal.
geeksforgeeks-suggest-icon
Create Improvement
Enhance the article with your expertise. Contribute to the GeeksforGeeks community and help create better learning resources for all.
geeksforgeeks-improvement-icon
Suggest Changes
min 4 words, max Words Limit:1000

Thank You!

Your suggestions are valuable to us.

What kind of Experience do you want to share?

Interview Experiences
Admission Experiences
Career Journeys
Work Experiences
Campus Experiences
Competitive Exam Experiences