Oracle FAQ | Your Portal to the Oracle Knowledge Grid |
Home -> Community -> Usenet -> c.d.o.server -> Re: slightly OT - cleaning up "dirty" keys?
sybrandb_at_yahoo.com wrote:
> Try the SOUNDEX function available in Oracle on your data.
> It is heavily English-oriented, but that doesn't seem to be a problem
> in your case.
> SOUNDEX will provide the 'phonetic' representation of a name.
That would be a useful "distance" measure.
That would give me matching (of user input) under dirty data, but doesn't help me cluster (and re-normalise) the existing cruddy data.
BugBear Received on Thu Mar 02 2006 - 04:12:50 CST