Categories
Java

Finding similarity percentage between Strings

In the field of text mining, one of the useful tools is to find the similarity percentage between two words for clustering or other purposes. Actually, I am not so familiar with text mining but it sounds quite interesting topic and I would like to do more study to find out about this field. However, I was working on a piece of code to find how much two names similar to each other and if the percentage is more than X (e.g. 80 %) , it is considered the names almost identical, else it prompts user.

I did little bit Googling and found bunch of useful materials as well as a sample Java code from Stack Overflow which I copied here with slight modifications,

For further reading regarding text mining please refer to the following links,

Exit mobile version