Data Abstraction and Data Independence
Last Updated : 15 Feb, 2024
Database systems comprise complex data structures. In order to make the system efficient in terms of retrieval of data, and reduce complexity in terms of usability of users, developers use abstraction i.e. hide irrelevant details from the users. This approach simplifies database design.
Level of Abstraction in a DBMS
There are mainly 3 levels of data abstraction:
- Physical or Internal Level
- Logical or Conceptual Level
- View or External Level
Physical or Internal Level
This is the lowest level of data abstraction. It tells us how the data is actually stored in memory. Access methods like sequential or random access and file organization methods like B+ trees and hashing are used for the same. Usability, size of memory, and the number of times the records are factors that we need to know while designing the database.
Suppose we need to store the details of an employee. Blocks of storage and the amount of memory used for these purposes are kept hidden from the user.
Logical or Conceptual Level
This level comprises the information that is actually stored in the database in the form of tables. It also stores the relationship among the data entities in relatively simple structures. At this level, the information available to the user at the view level is unknown.
We can store the various attributes of an employee and relationships, e.g. with the manager can also be stored.
The logical level thus describes the entire database in terms of a small number of relatively simple structures. Although implementation of the simple structures at the logical level may involve complex physical-level structures, the user of the logical level does not need to be aware of this complexity. This is referred to as physical data independence. Database administrators, who must decide what information to keep in the database, use the logical level of abstraction.
View or External Level
This is the highest level of abstraction. Only a part of the actual database is viewed by the users. This level exists to ease the accessibility of the database by an individual user. Users view data in the form of rows and columns. Tables and relations are used to store data. Multiple views of the same database may exist. Users can just view the data and interact with the database, storage and implementation details are hidden from them. Even though the logical level uses simpler structures, complexity remains because of the variety of information stored in a large database. Many users of the database system do not need all this information; instead, they need to access only a part of the database. The view level of abstraction exists to simplify their interaction with the system
Example: In case of storing customer data,
- Physical level - it will contains block of storages (bytes,GB,TB,etc)
- Logical level - it will contain the fields and the attributes of data.
- View level - it works with CLI or GUI access of database
Data Abstraction The main purpose of data abstraction is to achieve data independence in order to save the time and cost required when the database is modified or altered.
Data Independence
Data Independence is mainly defined as a property of DBMS that helps you to change the database schema at one level of a system without requiring to change the schema at the next level. it helps to keep the data separated from all program that makes use of it.
We have namely two levels of data independence arising from these levels of abstraction:
- Physical level data independence
- Logical level data independence
Data IndependencePhysical Level Data Independence
It refers to the characteristic of being able to modify the physical schema without any alterations to the conceptual or logical schema, done for optimization purposes, e.g., the Conceptual structure of the database would not be affected by any change in storage size of the database system server. Changing from sequential to random access files is one such example. These alterations or modifications to the physical structure may include:
- Utilizing new storage devices.
- Modifying data structures used for storage.
- Altering indexes or using alternative file organization techniques etc.
Logical Level Data Independence
It refers characteristic of being able to modify the logical schema without affecting the external schema or application program. The user view of the data would not be affected by any changes to the conceptual view of the data. These changes may include insertion or deletion of attributes, altering table structures entities or relationships to the logical schema, etc.
Similar Reads
Physical and Logical Data Independence 1. Physical Data Independence : Physical Data Independence is defined as the ability to make changes in the structure of the lowest level of the Database Management System (DBMS) without affecting the higher-level schemas. Hence, modification in the Physical level should not result in any changes in
2 min read
What is Data Independence in DBMS? Data independence is a property of a database management system by which we can change the database schema at one level of the database system without changing the database schema at the next higher level. In this article, we will learn in full detail about data independence and will also see its ty
5 min read
Difference between Physical and Logical Data Independence Data Independence therefore refers to the nature whereby one can change the structure of the database without having to change its implementation or data. There are two types of data independence: The first type is the Physical one as well as the second type is the Logical one. As will be seen, both
6 min read
Data Architecture Design and Data Management Data architecture design is like a detailed plan for how to handle data in a company, showing the steps for gathering, storing, accessing, and using data. This plan helps keep data neat and well-organized. Data management adds to this by taking care of data from start to finish, including collecting
7 min read
Difference between Data and Metadata The term 'data' may also refer to various forms such i.e. numbers, text, images, or audio that provide information concerning facts, phenomena, or even concepts. In other words, it is the basic information that is subjected to processing or interpretation to find meaning or generate products. Metada
6 min read