When importing new addresses to my DB, I do a spellchek to see if the street already exists (the new street is only spelled wrong).
We are currently usingthe Levenshtein method in MySQL query to find similar street names. The problem is street numbers. Having street nmbers in the address really slows down the similarity search / spellcheking.
Example:
Street abc 34
Street abc 37
Street abc 39
These street names is spelled correctly, but the Levenshtein method thinks they are misspelled because of the street numbers.
We have develope a PHP function that takes anything after (and including) the first digit and puts it in another column.
This works fine for streets having the street number at the end, but will not work for countries having the street numbers at the start.
I'm wondering if anyaone else have worked on similar problems?
Update
The solution is for a store locator web site and I'm currently working on the module that will import store lists.
One solution is using Google Maps API and see if it returns geo address.