Help get this topic noticed by sharing it on Twitter Twitter, Facebook Facebook, or email.
Mario

Duplicate Entity Identification

Is it possible to execute a duplicate entity identity analysis using algorithms such as Soundex or metaphone using multiple attributes?
1 person has
this question
+1
Reply

  • Hello,
    this is actually the focus of our main product the DQC.

    In DQ Analyzer, there are functions to transform the data using Soundex, Metaphone or Double Metaphone. We have found Soundex a bit outdated for analysis, but using the combination of Metaphones can be very handy, dependable on the data.

    You can use the functions wherever in the Plan created in Create Profile dialog. You can add the result of the transformation as another column for analysis. In case you do not have much data as a roll-up or analyze the concatation of phonemized names and other attributes is the key strong enough for blocking during the matching itself.

  • (some HTML allowed)
    How does this make you feel?
    Add Image
    I'm

    e.g. indifferent, undecided, unconcerned kidding, amused, unsure, silly happy, confident, thankful, excited sad, anxious, confused, frustrated