In the field of text mining, one of the useful tools is to find the similarity percentage between two words for clustering or other purposes. Actually, I am not so familiar with text mining but it sounds quite interesting topic and I would like to do more study to find out about this field. However, I was working on a piece of code to find how much two names similar to each other and if the percentage is more than X (e.g. 80 %)
, it is considered the names almost identical, else it prompts user.
I did little bit Googling and found bunch of useful materials as well as a sample Java code from Stack Overflow which I copied here with slight modifications,
For further reading regarding text mining please refer to the following links,