GetReuse SDK provides functions for calculation of similarity of data. You can link GetReuse SDK libraries with your software, and so add similarity measurement functionality into your software.
As a proof of concept and as a test of suitability you can use:
Sample programs from a GetReuse SDK distribution
GetReuse software
Supported platforms:
The GetReuse SDK is written in a portable C programming language, so on your request we can compile and provide libraries for other platforms.
A value of the similarity is a value in range from 0 to 1. The closer value of the similarity to 1, the more similar are texts; the closer value of the similarity to 0, the more different are texts; the similarity 1 means that texts are equal; the similarity 0.90 roughly means that texts share 90% of common data and have 10% of unique data.
The formula for the calculation of the similarity is based on the scientific research (see page “References”). Any other “good” method of calculations should produce results that are equivalent in some terms to the GetReuse results.
© 2003, 2004 DataHansa software. All rights reserved.
All registered names are the property of their respective owners.