SimMetrics
From Wikipedia, the free encyclopedia
SimMetrics is an open source extensible library of similarity or distance metrics (also known as string metrics).
The SimMetrics open source library includes the following metrics
- Levenshtein distance,
- Block distance or city block distance or L2 distance,
- Cosine similarity,
- Jaccard index,
- Needleman-Wunsch algorithm or Sellers algorithm,
- Smith-Waterman algorithm,
- Gotoh distance or Smith-Waterman-Gotoh distance,
- Monge Elkan distance,
- Jaro distance,
- Jaro-Winkler,
- SoundEx distance,
- Matching coefficient,
- Dice’s coefficient,
- Jaccard similarity or Jaccard coefficient or Tanimoto coefficient,
- Overlap coefficient,
- Euclidean distance,
- Q-gram distance,
- and more.
SimMetrics provides a library of floating-point based (0.0-1.0) similarity measures between pairs of string data as well as the unnormalised metric output.
[edit] External links
- [1][2]
- Second String similar library

