Source Data Analysis

Profile, Measure, and Assess Risks

A thorough look at data generated at the point of origin

Baseline’s Source Data Analysis provides you with a comprehensive assessment, profile, and comparison of data on one or more key source systems.

Baseline uses profiling tools and statistical measurement to give you a structured way to understand what data exists, how proprietary it is, its accuracy, its relevance, and whether it should be included in your company’s systems of record catalog.

The service can be delivered in combination with other Baseline offerings, such as the Data Quality Audit, to give you a sustained approach to data acquisition, definition, standardization, and reconciliation. Or it can be delivered separately as a litmus test for evaluating data on a source system prior to the often costly activity of acquiring it.

» Your Challenges
» The Problem
» The Baseline Approach
» Your Value
» Why Baseline

Your Challenges

  • Data quality issues
  • Multiple sources generating same data
  • Non-existent documentation of source systems
  • Data models and file structures not understood
  • Lack of knowledge about core data elements
  • Mismatch between source system data and developers’ definition
  • Analyzing multiple data sources for a high profile IT project

» Back to top of page

The Problem

Inability to understand source data imperils many an IT project

There have been so many IT projects brought to their knees due to the inability of developers to understand source data. The causes range from inadequate source system documentation to the lack of resources available to support operational systems to source system owners refusing to share information from proprietary or homegrown systems.

Few companies have source system owners who are specifically responsible for deploying quality data to systems and users across the organization. Moreover, the acknowledged owners of these systems aren’t necessarily intimate with the data. Therefore, source systems often remain a “no man’s land” when it comes to understanding and using key business data.

The ripple effects can be significant, often imperiling project timeframes and objectives.
Before you begin your next IT project – whether business intelligence, data warehousing, application development, or data integration – understanding source data issues could be critical to your success.

» Back to top of page

The Baseline Approach

Combining data profiling with business program risk assessment

It all begins with the “systems of origin” that generate data in the first place. These systems, while essential to business operations, are often the culprits when it comes to lack of standardized, reconciled, or clean data, and the downstream effects on the business.

Delivering data-enabled solutions to business means understanding your company’s data quality, data integration, and data management challenges. And this means getting to the “root cause” of the issue—going straight to the source, the operational systems that originate data.

Baseline’s Source Data Analysis service completes a comprehensive analysis of your data using a profiling tool compatible with your current operational and technical environment. The profiling tool does the “heavy lifting”—sifting through the data and doing the statistical comparisons.

Then Baseline generates a source system profile that includes measurements of important qualifiers such as the percentage of null values in key fields, the degree of data replication, the percentage of formatting discrepancies, and the distribution of values.

We conclude the Source Data Analysis with a risk assessment at both a micro- and macro-level—how data and metadata within the source system may be compromised and how this places business programs at risk. The Source Data Analysis alerts your IT project sponsors to the potential impact of poorly understood, missing, inaccurate, or proprietary source data.

» Back to top of page

Your Value

Data assessment as an early warning system can streamline IT projects and avoid wasted work effort

Source Data Analysis helps you avoid wasted work effort caused by incorrect assumptions about the source system and its data. IT projects benefit from streamlining data acquisition, simplification of ETL and data cleanup, and rerouting applications to other systems that may generate more relevant, accurate, or timely information.

Upon completion of the Source Data Analysis, you are fully ready to decide if:

  • The operational system(s) in questions are candidates to serve as data sources for a data warehouse, business intelligence, customer data integration (CDI), or master data management program.
  • Source system data should be replicated across more than one source.
  • Source systems are outdated and need to be replaced.
  • ETL jobs accessing the source systems should be modified, replaced, or targeted to alternative systems.
  • Data generated by the source system is relevant to the target system or application.

» Back to top of page

Why Baseline

Helping you make data assessment a permanent part of your IT infrastructure

Baseline advocates understanding the root causes of data issues before turning to automation. Source Data Analysis offers your company a thorough look at the data generated at the point of origin, often the most difficult type of data assessment a company can undertake.

Moreover, Baseline can help you leverage Source Data Analysis to address organizational challenges that historically plague many companies.
A common recommendation from the Source Data Analysis is the creation of a new role, Source Data Steward, for systems that are either highly complex or in demand by other business applications.

Another typical outgrowth is that companies incorporate data assessment—a data profiling tool and analysis process—as a permanent part of their source system infrastructures.

Baseline Consulting excels at helping companies deploy integrated and meaningful business information to departments, individuals, and subsidiaries. We not only understand the applications that use data, but we also specialize in the methods to acquire and integrate data.

» Back to top of page

 

To request more information, contact us via e-mail or call us at 1-818-906-7638.
 

August 18, 2008. TDWI Conference, San Diego. BI from Both Sides with Jill Dyché.

September 22, 2008. IDQ Conference, San Antonio. How to Use Six Sigma to Improve Data Quality & Quantify Data Quality Improvement with Joy Medved

» See our full schedule
 

The Need for Data Profiling in CDI.
Discover how data profiling takes on a whole new dimension when it’s applied to your CDI solution. As CDI becomes central to a company’s overall information strategy, Data Profiling becomes a crucial step in providing “a single version of the truth.”
» Read the article

» Browse our Articles &
   White Papers
 

© 2008 Baseline Consulting Group, Inc. All Rights Reserved.