What is Data Acquisition?
Data Acquisition is the process of extracting the relevant business information, transforming data into a required business format and loading into the target system. A data acquisition defines Data extraction, Data Transformation and Data Loading.
Data Acquisition can be performed by two types of ETL (Extract, Transform, Load) types. They are
- Code based ETL : ETL applications which can be developed using programming languages like SQL, PLSL etc are code based ETL.
- Example : – BASE SAS, SAS ACCESS etc.
- GUI based ETL : ETL applications which are developed using graphical user interface, point and click techniques are known as GUI based ETL applications.
- Example :- Data stage, AB Initio, Informatica, ODI (Oracle Data Integration), data services, data manager, SSIS (SQL Server Integration Services).
What is Data Extraction?
Data Extraction is the process of reading the data from various types of source systems. The following are the different types of source used to define data extraction.
- ERP Sources.
- ORACLE applications.
- JD Edwards.
- People Soft.
- File Sources.
- XML Files.
- Flat files.
- Relational Sources.
- SQL server.
- Legacy sources.
- Main frames.
- As 400.
- Cobal files.
What is Data Transformation?
Data transformation is the process of transforming data and cleansing the data into the required business format. In this Data transformation stage, the following are the activities that take place.
- Data Cleansing.
- Data Scrubing.
- Data Aggregation.
- Data Merging.
What is Data Cleansing?
Data cleansing is the process of filtering the rows that contain null values, removes duplicate rows and removing blank (or) white spaces, it is also known as Data “Santization”.
What is Data Scrubbing?
Data scrubbing is the process of diverting new attributes or columns which are not available in source system.
What is Data aggregation?
Data aggregation is the process of calculation the summaries from detailed data.
What is Data merging?
Data merging is the process of Integrating the data from multiple OLTP source systems. Data merging operation can can be done in two ways :
It is the process of merging the records horizontally using Joins. Where Joints combines the fields from multiple data sources based on one or more fields.
It is the process of merging the records vertically when the two sources are having same metadata (Union). Union combines similar source definitions into a single result set.
What is Data Loading?
Data loading is the process of inserting the data into a target systems. Data loading can be done in two types.
Initial load or full Load
Initial load is the process of inserting source data rows into empty target table. At initial load, all the source data gets loaded into target.
Incremental load or Delta load
Incremental load is the process of inserting only new records after Initial load happens.