Data Integration

The methods to extract, transform, process and load data from and to structured or unstructured sources

Big Data
Big data is a catch-phrase used to describe a massive volume of both structured and unstructured data that is so large it is difficult to process using traditional database and software techniques.
ETL Concepts
ETL is an acronym for Extraction, Transformation and Loading. This is an all-encompassing domain that deals with the technologies involved in extracting data from disparate systems, treating and cleaning those data, transforming the data and finally load it to one or more target systems.
ETL Design Pattern
ETL Design Pattern is a framework of generally reusable solution to the commonly occurring problems during Extraction, Transformation and Loading (ETL) activities of data in a data warehousing environment. This section contains number of articles that deal with various commonly occurring design patterns in any data warehouse design.
Data Health Analysis is an emerging domain that deals with measuring and establishing the quality of the collected data quantitatively.
Data Cleansing
Data cleansing, data cleaning or data scrubbing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database. Used mainly in databases, the term refers to identifying incomplete, incorrect, inaccurate, irrelevant, etc. parts of the data and then replacing, modifying, or deleting this dirty data or coarse data.
Master Data Management Tutorial
Master data management (MDM) comprises the processes, governance, policies, standards and tools that consistently define and manage the critical data of an organization to provide a single point of reference
ETL Informatica
This section contains a number of articles and tutorials on Informatica PowerCenter™ and PowerMart™ ETL tool.
SAP BusinessObjects Data Services
SAP Data Services delivers a single enterprise-class solution for Data Integration, Data Quality, Data Profiling, and Text Data Processing that allows us to integrate, transform, improve, and deliver trusted data to critical business processes.
This section contains a number of articles and tutorials on Microsoft SQL Server Integration Services. SSIS is a platform for data integration and workflow applications.
Pentaho Data Integration
This section contains a number of articles and tutorials on Pentaho Data Integration™ popularly known as Kettle ETL tool.