I have an array of 17,000 strings. Many of the strings have similar matches, for example:
User Report XYZ123 Bob Smith User Report YEI723 User Report User Report Number of Hits 27 Frank's Weekly Transaction Report Transaction Report 123
What is the best way to find the top "similar strings"? For instance, using the example above, I would want to see "User Report" and "Transaction Report" as two of the top "similar strings".