Question

  Asked: May 11 2005   2:03 AM GMT
  Asked by: szppt8


Conversion of de-duplicated Customer Numbers due to source system change


CRM, Customer relationship management applications, Data warehousing applications, IBM, Oracle

We have a Data Warehouse which de-duplicates Customers from 3 operational systems to enable more accurate customer reporting, data mining & CRM activities. We have several month-end snapshots history.
Our organisation is building a new Operation Data Store which will now take over the de-dupe & key persistency functions. The ODS uses more up to date techniques for de-duping so it gets different results.
We want to align the DW with the ODS so operations and analytics functions have the same view of the customer but because the de-dupe results are different, we can?t just have a straight swap of keys. For example, DW matches 3 source IDs together, yet ODS matches 4 source IDs together. We estimate that about 5% of source IDs will change keys each month.
What?s the best way to convert our Customer Keys to:
(a) minimise impact to history (b) retain usability and (c) retain a proper star schema.
We will consider different solutions for conversion of history versus ongoing processing.

Subscribe to Alerts! Get questions and answers delivered to your Inbox.


E-mail me updates on this question



   SUBSCRIBE

hidden modal window

Answer Wiki (Improve, edit or add to this answer)


 RATE THIS ANSWER
0
Click to Vote:
  •   0
  •  0



With respect to "history," there would seem to to two cases: 1) situations in which you want to be able to report history based on "truth" as you then saw it, and 2) situations in which you want to report based on the improved version of "truth" made possible by better "deduping." As an example of the first case, if someone used the existing data warehouse for some form of government reporting, and there might be a need for future drilldown, you want to have consistency, so keep an "old model" DW running. On the other hand, if you are using the data warehouse for, say, some form of sales compensation based on this year versus last year, you probably want to run the previous year's data through the "better" deduping process so that you have apples to apples comparison between last year and this year. Going forward, you presumably want to move to the better version of deduping.

It isn't obvious why a difference in deduping would impact your star schema.
  • AddThis Social Bookmark Button

Browse more Questions and Answers on CRM, DataManagement and DataCenter.

Looking for relevant CRM Whitepapers? Visit the SearchCRM.com Research Library.


Discuss This Answer


You must be logged-in to discuss a question. Log-in/Register