Dave's Work at Language Analysis Systems, Inc.

Here's a little comic strip to illustrate some of the issues we dealt with in designing multi-lingual name-search algorithms when I worked at Language Analysis Systems back in the 90s:



This obviously refers to bizarre spellings from Poland, Hungary, and elsewhere in Eastern Europe like the following:
KASPRZYK   PRZYTYCKA   KRZYSZTOF   CZAJKOWSKI   VCZRNIK   MIKOLAJCZYK   HUSZTHY   ESEPGRCSEPREGI   DRCZHHONY
Names like these often have several variant spellings which can be difficult to retrieve using standard searching algorithms like SoundEx. For example, you may recognize the fourth item above as the Polish spelling of the famous Russian composer's name (Tschaikowsky). We have proto-typed a number of interesting search algorithms which deal with this problem using linguistic principles. For example, check out this cool phonetic name-search demo at our company website! Highlight and control-C one of the names below (strange spellings of the names of people in my extended family) before going over to the page. Then you can just paste it into the search window using control-V. Or just type in your own name or a name you know to be problematic in automatic searches.
HAIRUS      WULF         RAWSZ        FREKEY
GARFELT     BURGESSIN    GAWRFEELD    BOGMYRE
STEPHAN     HENSON       HENTDSEN     HENDERSON
BURGISON    GARFELT      PEGMYRE      HERISZ
HARRISON    GARRISON     WULFE        FREAKY
TEPHANEY    MAIKUL       LEESUH       AIMEE      MATTIEU
LENSEA      SZTIFFUKNEE  SITKNEE      DALOOR
DILLON      QANDOES      DANA         BOBANA     TENA
FRECK       FRACK        LOGAN        RAWBURDT
LOWGUN      STIEWEN      SCHTIEWAN    DEYVUD
KNIGHTLEY   KNIGHT       KNECHT       NIKTE
WRIGHT      WREN         RHETT        REDD     
OUELSENNE   TSCHADMANN   EMHEMMID     HENTSEN
BRIGHT
PS You may want to set the maximum-number-of-returns button to 60 instead of the default 20 so that you will get back more hits.

Updated April 1st, 2002