Data Processing
Data Processing
No article under this category
ETL Concepts
ETL is an acronym for Extraction, Transformation and Loading. This is an all-encompassing domain that deals with the technologies involved in extracting data from disparate systems, treating and cleaning those data, transforming the data and finally load it to one or more target systems.
ExploreETL Design Pattern
ETL Design Pattern is a framework of generally reusable solution to the commonly occurring problems during Extraction, Transformation and Loading (ETL) activities of data in a data warehousing environment. This section contains number of articles that deal with various commonly occurring design patterns in any data warehouse design.
ExploreData Health Analysis
Data Health Analysis is an emerging domain that deals with measuring and establishing the quality of the collected data quantitatively.
ExploreData Cleansing
Data cleansing, data cleaning or data scrubbing is the process of detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database. Used mainly in databases, the term refers to identifying incomplete, incorrect, inaccurate, irrelevant, etc. parts of the data and then replacing, modifying, or deleting this dirty data or coarse data.
ExploreMaster Data Management
Master data management (MDM) comprises the processes, governance, policies, standards and tools that consistently define and manage the critical data of an organization to provide a single point of reference.
ExploreInformatica
This section contains a number of articles and tutorials on Informatica PowerCenter ™ and PowerMart ™ ETL tool.
ExploreSAP Data Services
SAP Data Services delivers a single enterprise-class solution for Data Integration, Data Quality, Data Profiling, and Text Data Processing that allows us to integrate, transform, improve, and deliver trusted data to critical business processes.
ExploreSSIS
This section contains a number of articles and tutorials on Microsoft SQL Server Integration Services. SSIS is a platform for data integration and workflow applications.
ExploreBig Data
Big data is a catch-phrase used to describe a massive volume of both structured and unstructured data that is so large it is difficult to process using traditional database and software techniques.
ExplorePentaho Data Integration
This section contains a number of articles and tutorials on Pentaho Data Integration ™ popularly known as Kettle ETL tool.
Explore