Skip to content
geeksforgeeks
  • Tutorials
    • Python
    • Java
    • Data Structures & Algorithms
    • ML & Data Science
    • Interview Corner
    • Programming Languages
    • Web Development
    • CS Subjects
    • DevOps And Linux
    • School Learning
    • Practice Coding Problems
  • Courses
    • DSA to Development
    • Get IBM Certification
    • Newly Launched!
      • Master Django Framework
      • Become AWS Certified
    • For Working Professionals
      • Interview 101: DSA & System Design
      • Data Science Training Program
      • JAVA Backend Development (Live)
      • DevOps Engineering (LIVE)
      • Data Structures & Algorithms in Python
    • For Students
      • Placement Preparation Course
      • Data Science (Live)
      • Data Structure & Algorithm-Self Paced (C++/JAVA)
      • Master Competitive Programming (Live)
      • Full Stack Development with React & Node JS (Live)
    • Full Stack Development
    • Data Science Program
    • All Courses
  • Aptitude
  • Engineering Mathematics
  • Discrete Mathematics
  • Operating System
  • DBMS
  • Computer Networks
  • Digital Logic and Design
  • C Programming
  • Data Structures
  • Algorithms
  • Theory of Computation
  • Compiler Design
  • Computer Org and Architecture
Open In App
Next Article:
Error Handling in Compiler Design
Next article icon

Symbol Table in Compiler

Last Updated : 25 Jan, 2025
Comments
Improve
Suggest changes
Like Article
Like
Report

Every compiler uses a symbol table to track all variables, functions, and identifiers in a program. It stores information such as the name, type, scope, and memory location of each identifier. Built during the early stages of compilation, the symbol table supports error checking, scope management, and code optimization for runtime efficiency. It plays a crucial role in ensuring the correct usage of identifiers according to language rules.

Role of Symbol Table in Compiler Phases

The symbol table acts as a bridge between the analysis and synthesis phases of the compiler. It collects information during the analysis phases and utilizes it during the synthesis phases to generate efficient code, ultimately enhancing compile-time performance.

It is used by various phases of the compiler as follows:-

  • Lexical Analysis: Creates new table entries in the table, for example, entries about tokens.
  • Syntax Analysis: Adds information regarding attribute type, scope, dimension, line of reference, use, etc in the table.
  • Semantic Analysis: Uses available information in the table to check for semantics i.e. to verify that expressions and assignments are semantically correct(type checking) and update it accordingly.
  • Intermediate Code Generation: Refers to symbol table for knowing how much and what type of run-time is allocated and table helps in adding temporary variable information.
  • Code Optimization: Uses information present in the symbol table for machine-dependent optimization.
  • Target Code generation: Generates code by using the address information of the identifier present in the table.
  • Symbol Table entries - Each entry in the symbol table is associated with attributes that support the compiler in different phases. 

Example of Using Symbol Table

Imagine a program that includes a series of mathematical expressions, such as:

  • A variable distance representing the distance traveled.
  • A constant pi representing the value of Pi.
  • A function calculateArea that computes the area of a circle.

Name

Type

scope

Memory address

value

Additional Info

distance

variable

Global

0x1000

Uninitialized

Data type: float

pi

constant

Global

0x1004

3.14159

Data type: float, read-only

calculateArea

function

Global

0x1008

N/A

Return type: float

radius

parameter

Local

0x2000

0x1000

Data type: float

In this example:

  • The symbol table records that distance is a global variable of type float that has not been initialized.
  • Pi is a global value of type float with a constant value of 3.14159 and is marked as read-only.
  • It registers the function calculateArea that returns a value of type float.
  • The parameter radius is declared as a local variable - in the scope of the function - also of type float.
  • It is this organization that serves the compiler when it does various tasks, such as checking for type errors, optimization of code, knowing the value of pi because it is a constant, declaring and using variables according to its scope.

Items stored in Symbol table

  • Variable names and constants
  • Procedure and function names
  • Literal constants and strings
  • Compiler generated temporaries
  • Labels in source languages

Information used by the compiler from Symbol table

  • Data type and name
  • Declaring procedures
  • Offset in storage
  • If structure or record then, a pointer to structure table.
  • For parameters, whether parameter passing by value or by reference
  • Number and type of arguments passed to function
  • Base Address

Operations of Symbol table

The basic operations defined on a symbol table include

Operations of Symbol Table

Operations on Symbol Table

Following operations can be performed on symbol table-

  • Insertion of an item in the symbol table.
  • Deletion of any item from the symbol table.
  • Searching of desired item from symbol table.

Implementation of Symbol table

Following are commonly used data structures for implementing symbol table:

List

We use a single array or equivalently several arrays, to store names and their associated information ,New names are added to the list in the order in which they are encountered . The position of the end of the array is marked by the pointer available, pointing to where the next symbol-table entry will go.  The search for a name proceeds backwards from the end of the array to the beginning. when the name is located the associated information can be found in the words following next. 

id1info1id2info2........id_ninfo_n
  • In this method, an array is used to store names and associated information.
  • A pointer "available" is maintained at end of all stored records and new names are added in the order as they arrive
  • To search for a name we start from the beginning of the list till available pointer and if not found we get an error "use of the undeclared name"
  • While inserting a new name we must ensure that it is not already present otherwise an error occurs i.e. "Multiple defined names"
  • Insertion is fast O(1), but lookup is slow for large tables – O(n) on average
  • The advantage is that it takes a minimum amount of space.

Linked List

  • This implementation is using a linked list. A link field is added to each record.
  • Searching of names is done in order pointed by the link of the link field.
  • A pointer "First" is maintained to point to the first record of the symbol table.
  • Insertion is fast O(1), but lookup is slow for large tables – O(n) on average

Hash Table

  • In hashing scheme, two tables are maintained - a hash table and symbol table and are the most commonly used method to implement symbol tables. A hash table is an array with an index range: 0 to table size – 1. These entries are pointers pointing to the names of the symbol table.
  • To search for a name we use a hash function that will result in an integer between 0 to table size – 1.
  • Insertion and lookup can be made very fast – O(1).
  • The advantage is quick to search is possible and the disadvantage is that hashing is complicated to implement.

Binary Search Tree

  • Another approach to implementing a symbol table is to use a binary search tree i.e. we add two link fields i.e. left and right child.
  • All names are created as child of the root node that always follows the property of the binary search tree.
  • Insertion and lookup are O(log 2 n) on average.

Please refer C++ Program to implement Symbol Table for implementation.

Advantages of Symbol Table 

  • The efficiency of a program can be increased by using symbol tables, which give quick and simple access to crucial data such as variable and function names, data kinds, and memory locations.
  • better coding structure Symbol tables can be used to organize and simplify code, making it simpler to comprehend, discover, and correct problems.
  • Faster code execution: By offering quick access to information like memory addresses, symbol tables can be utilized to optimize code execution by lowering the number of memory accesses required during execution.
  • Symbol tables can be used to increase the portability of code by offering a standardized method of storing and retrieving data, which can make it simpler to migrate code between other systems or programming languages.
  • Improved code reuse: By offering a standardized method of storing and accessing information, symbol tables can be utilized to increase the reuse of code across multiple projects.
  • Symbol tables can be used to facilitate easy access to and examination of a program's state during execution, enhancing debugging by making it simpler to identify and correct mistakes.

Disadvantages of Symbol Table

  • Increased memory consumption: Systems with low memory resources may suffer from symbol tables' high memory requirements.
  • Increased processing time: The creation and processing of symbol tables can take a long time, which can be problematic in systems with constrained processing power.
  • Complexity: Developers who are not familiar with compiler design may find symbol tables difficult to construct and maintain.
  • Limited scalability: Symbol tables may not be appropriate for large-scale projects or applications that require o the management of enormous amounts of data due to their limited scalability.
  • Upkeep: Maintaining and updating symbol tables on a regular basis can be time- and resource-consuming.
  • Limited functionality: It's possible that symbol tables don't offer all the features a developer needs, and therefore more tools or libraries will be needed to round out their capabilities.

Applications of Symbol Table

  • Resolution of variable and function names: Symbol tables are used to identify the data types and memory locations of variables and functions as well as to resolve their names.
  • Resolution of scope issues: To resolve naming conflicts and ascertain the range of variables and functions, symbol tables are utilized.
  • Symbol tables, which offer quick access to information such as memory locations, are used to optimize code execution.
  • Code generation: By giving details like memory locations and data kinds, symbol tables are utilized to create machine code from source code.
  • Error checking and code debugging: By supplying details about the status of a program during execution, symbol tables are used to check for faults and debug code.
  • Code organization and documentation: By supplying details about a program's structure, symbol tables can be used to organize code and make it simpler to understand.

Next Article
Error Handling in Compiler Design

A

Ankit87
Improve
Article Tags :
  • Misc
  • Technical Scripter
  • Compiler Design
  • GATE CS
Practice Tags :
  • Misc

Similar Reads

    Introduction of Compiler Design
    A compiler is software that translates or converts a program written in a high-level language (Source Language) into a low-level language (Machine Language or Assembly Language). Compiler design is the process of developing a compiler.The development of compilers is closely tied to the evolution of
    9 min read

    Compiler Design Basics

    Introduction of Compiler Design
    A compiler is software that translates or converts a program written in a high-level language (Source Language) into a low-level language (Machine Language or Assembly Language). Compiler design is the process of developing a compiler.The development of compilers is closely tied to the evolution of
    9 min read
    Compiler construction tools
    The compiler writer can use some specialized tools that help in implementing various phases of a compiler. These tools assist in the creation of an entire compiler or its parts. Some commonly used compiler construction tools include: Parser Generator - It produces syntax analyzers (parsers) from the
    4 min read
    Phases of a Compiler
    A compiler is a software tool that converts high-level programming code into machine code that a computer can understand and execute. It acts as a bridge between human-readable code and machine-level instructions, enabling efficient program execution. The process of compilation is divided into six p
    10 min read
    Symbol Table in Compiler
    Every compiler uses a symbol table to track all variables, functions, and identifiers in a program. It stores information such as the name, type, scope, and memory location of each identifier. Built during the early stages of compilation, the symbol table supports error checking, scope management, a
    8 min read
    Error Handling in Compiler Design
    During the process of language translation, the compiler can encounter errors. While the compiler might not always know the exact cause of the error, it can detect and analyze the visible problems. The main purpose of error handling is to assist the programmer by pointing out issues in their code. E
    5 min read
    Language Processors: Assembler, Compiler and Interpreter
    Computer programs are generally written in high-level languages (like C++, Python, and Java). A language processor, or language translator, is a computer program that convert source code from one programming language to another language or to machine code (also known as object code). They also find
    5 min read
    Generation of Programming Languages
    Programming languages have evolved significantly over time, moving from fundamental machine-specific code to complex languages that are simpler to write and understand. Each new generation of programming languages has improved, allowing developers to create more efficient, human-readable, and adapta
    6 min read

    Lexical Analysis

    Introduction of Lexical Analysis
    Lexical analysis, also known as scanning is the first phase of a compiler which involves reading the source program character by character from left to right and organizing them into tokens. Tokens are meaningful sequences of characters. There are usually only a small number of tokens for a programm
    6 min read
    Flex (Fast Lexical Analyzer Generator)
    Flex (Fast Lexical Analyzer Generator), or simply Flex, is a tool for generating lexical analyzers scanners or lexers. Written by Vern Paxson in C, circa 1987, Flex is designed to produce lexical analyzers that is faster than the original Lex program. Today it is often used along with Berkeley Yacc
    7 min read
    Introduction of Finite Automata
    Finite automata are abstract machines used to recognize patterns in input sequences, forming the basis for understanding regular languages in computer science. They consist of states, transitions, and input symbols, processing each symbol step-by-step. If the machine ends in an accepting state after
    4 min read
    Classification of Context Free Grammars
    A Context-Free Grammar (CFG) is a formal rule system used to describe the syntax of programming languages in compiler design. It provides a set of production rules that specify how symbols (terminals and non-terminals) can be combined to form valid sentences in the language. CFGs are important in th
    4 min read
    Ambiguous Grammar
    Context-Free Grammars (CFGs) is a way to describe the structure of a language, such as the rules for building sentences in a language or programming code. These rules help define how different symbols can be combined to create valid strings (sequences of symbols).CFGs can be divided into two types b
    7 min read

    Syntax Analysis & Parsers

    Introduction to Syntax Analysis in Compiler Design
    Syntax Analysis (also known as parsing) is the step after Lexical Analysis. The Lexical analysis breaks source code into tokens.Tokens are inputs for Syntax Analysis.The goal of Syntax Analysis is to interpret the meaning of these tokens. It checks whether the tokens produced by the lexical analyzer
    7 min read
    FIRST and FOLLOW in Compiler Design
    In compiler design, FIRST and FOLLOW are two sets used to help parsers understand how to process a grammar.FIRST Set: The FIRST set of a non-terminal contains all the terminal symbols that can appear at the beginning of any string derived from that non-terminal. In other words, it tells us which ter
    6 min read
    Parsing - Introduction to Parsers
    Parsing, also known as syntactic analysis, is the process of analyzing a sequence of tokens to determine the grammatical structure of a program. It takes the stream of tokens, which are generated by a lexical analyzer or tokenizer, and organizes them into a parse tree or syntax tree.The parse tree v
    6 min read
    Construction of LL(1) Parsing Table
    Parsing is an essential part of computer science, especially in compilers and interpreters. From the various parsing techniques, LL(1) parsing is best. It uses a predictive, top-down approach. This allows efficient parsing without backtracking. This article will explore parsing and LL(1) parsing. It
    6 min read

    Syntax Directed Translation & Intermediate Code Generation

    Syntax Directed Translation in Compiler Design
    Syntax-Directed Translation (SDT) is a method used in compiler design to convert source code into another form while analyzing its structure. It integrates syntax analysis (parsing) with semantic rules to produce intermediate code, machine code, or optimized instructions.In SDT, each grammar rule is
    8 min read
    S - Attributed and L - Attributed SDTs in Syntax Directed Translation
    In Syntax-Directed Translation (SDT), the rules are those that are used to describe how the semantic information flows from one node to the other during the parsing phase. SDTs are derived from context-free grammars where referring semantic actions are connected to grammar productions. Such action c
    4 min read
    Parse Tree and Syntax Tree
    Parse Tree and Syntax tree are tree structures that represent the structure of a given input according to a formal grammar. They play an important role in understanding and verifying whether an input string aligns with the language defined by a grammar. These terms are often used interchangeably but
    4 min read
    Intermediate Code Generation in Compiler Design
    In the analysis-synthesis model of a compiler, the front end of a compiler translates a source program into an independent intermediate code, then the back end of the compiler uses this intermediate code to generate the target code (which can be understood by the machine). The benefits of using mach
    6 min read
    Issues in the design of a code generator
    A code generator is a crucial part of a compiler that converts the intermediate representation of source code into machine-readable instructions. Its main task is to produce the correct and efficient code that can be executed by a computer. The design of the code generator should ensure that it is e
    7 min read
    Three address code in Compiler
    TAC is an intermediate representation of three-address code utilized by compilers to ease the process of code generation. Complex expressions are, therefore, decomposed into simple steps comprising, at most, three addresses: two operands and one result using this code. The results from TAC are alway
    6 min read
    Data flow analysis in Compiler
    Data flow is analysis that determines the information regarding the definition and use of data in program. With the help of this analysis, optimization can be done. In general, its process in which values are computed using data flow analysis. The data flow property represents information that can b
    6 min read

    Code Optimization & Runtime Environments

    Code Optimization in Compiler Design
    Code optimization is a crucial phase in compiler design aimed at enhancing the performance and efficiency of the executable code. By improving the quality of the generated machine code optimizations can reduce execution time, minimize resource usage, and improve overall system performance. This proc
    9 min read
    Introduction of Object Code in Compiler Design
    Let assume that you have a C program then, you give it to the compiler and compiler will produce the output in assembly code. Now, that assembly language code will be given to the assembler and assembler will produce some code and that code is known as Object Code. Object CodeObject Code is a key co
    6 min read
    Static and Dynamic Scoping
    The scope of a variable x in the region of the program in which the use of x refers to its declaration. One of the basic reasons for scoping is to keep variables in different parts of the program distinct from one another. Since there are only a small number of short variable names, and programmers
    6 min read
    Runtime Environments in Compiler Design
    A translation needs to relate the static source text of a program to the dynamic actions that must occur at runtime to implement the program. The program consists of names for procedures, identifiers, etc., that require mapping with the actual memory location at runtime. Runtime environment is a sta
    8 min read
    Linker
    A linker is an essential tool in the process of compiling a program. It helps combine various object modules (output from the assembler) into a single executable file that can be run on a system. The linker’s job is to manage and connect different pieces of code and data, ensuring that all reference
    8 min read
    Loader in C/C++
    The loader is the program of the operating system which loads the executable from the disk into the primary memory(RAM) for execution. It allocates the memory space to the executable module in the main memory and then transfers control to the beginning instruction of the program. The loader is an im
    3 min read

    Practice Questions

    Last Minute Notes - Compiler Design
    In computer science, compiler design is the study of how to build a compiler, which is a program that translates high-level programming languages (like Python, C++, or Java) into machine code that a computer's hardware can execute directly. The focus is on how the translation happens, ensuring corre
    13 min read
    Compiler Design - GATE CSE Previous Year Questions
    In this article, we are mainly focusing on the Compiler Design GATE Questions that have been asked in Previous Years, with their solutions. And where an explanation is required, we have also provided the reason. Topic-Wise Quizzes to Practice Previous Year's QuestionsLexical AnalysisParsingSyntax-Di
    1 min read
geeksforgeeks-footer-logo
Corporate & Communications Address:
A-143, 7th Floor, Sovereign Corporate Tower, Sector- 136, Noida, Uttar Pradesh (201305)
Registered Address:
K 061, Tower K, Gulshan Vivante Apartment, Sector 137, Noida, Gautam Buddh Nagar, Uttar Pradesh, 201305
GFG App on Play Store GFG App on App Store
Advertise with us
  • Company
  • About Us
  • Legal
  • Privacy Policy
  • In Media
  • Contact Us
  • Advertise with us
  • GFG Corporate Solution
  • Placement Training Program
  • Languages
  • Python
  • Java
  • C++
  • PHP
  • GoLang
  • SQL
  • R Language
  • Android Tutorial
  • Tutorials Archive
  • DSA
  • Data Structures
  • Algorithms
  • DSA for Beginners
  • Basic DSA Problems
  • DSA Roadmap
  • Top 100 DSA Interview Problems
  • DSA Roadmap by Sandeep Jain
  • All Cheat Sheets
  • Data Science & ML
  • Data Science With Python
  • Data Science For Beginner
  • Machine Learning
  • ML Maths
  • Data Visualisation
  • Pandas
  • NumPy
  • NLP
  • Deep Learning
  • Web Technologies
  • HTML
  • CSS
  • JavaScript
  • TypeScript
  • ReactJS
  • NextJS
  • Bootstrap
  • Web Design
  • Python Tutorial
  • Python Programming Examples
  • Python Projects
  • Python Tkinter
  • Python Web Scraping
  • OpenCV Tutorial
  • Python Interview Question
  • Django
  • Computer Science
  • Operating Systems
  • Computer Network
  • Database Management System
  • Software Engineering
  • Digital Logic Design
  • Engineering Maths
  • Software Development
  • Software Testing
  • DevOps
  • Git
  • Linux
  • AWS
  • Docker
  • Kubernetes
  • Azure
  • GCP
  • DevOps Roadmap
  • System Design
  • High Level Design
  • Low Level Design
  • UML Diagrams
  • Interview Guide
  • Design Patterns
  • OOAD
  • System Design Bootcamp
  • Interview Questions
  • Inteview Preparation
  • Competitive Programming
  • Top DS or Algo for CP
  • Company-Wise Recruitment Process
  • Company-Wise Preparation
  • Aptitude Preparation
  • Puzzles
  • School Subjects
  • Mathematics
  • Physics
  • Chemistry
  • Biology
  • Social Science
  • English Grammar
  • Commerce
  • World GK
  • GeeksforGeeks Videos
  • DSA
  • Python
  • Java
  • C++
  • Web Development
  • Data Science
  • CS Subjects
@GeeksforGeeks, Sanchhaya Education Private Limited, All rights reserved
We use cookies to ensure you have the best browsing experience on our website. By using our site, you acknowledge that you have read and understood our Cookie Policy & Privacy Policy
Lightbox
Improvement
Suggest Changes
Help us improve. Share your suggestions to enhance the article. Contribute your expertise and make a difference in the GeeksforGeeks portal.
geeksforgeeks-suggest-icon
Create Improvement
Enhance the article with your expertise. Contribute to the GeeksforGeeks community and help create better learning resources for all.
geeksforgeeks-improvement-icon
Suggest Changes
min 4 words, max Words Limit:1000

Thank You!

Your suggestions are valuable to us.

What kind of Experience do you want to share?

Interview Experiences
Admission Experiences
Career Journeys
Work Experiences
Campus Experiences
Competitive Exam Experiences