|
NAMEtext_simlarity.pl - Measure the pair-wise similarity between files or stringsSYNOPSIStext_similarity.pl --type Text::Similarity::Overlaps --normalize --string '.......this is one' '????this is two' text_similarity.pl --type Text::Similarity::Overlaps --no-normalize --string '.......this is one' '????this is two' text_similarity.pl --type Text::Similarity::Overlaps --string 'sir winston churchill' 'Churchill, Winston Sir' text_similarity.pl --type Text::Similarity::Overlaps ../GPL.txt ../FDL.txt text_similarity.pl --verbose --type Text::Similarity::Overlaps ../GPL.txt ../FDL.txt text_similarity.pl --verbose --stoplist stoplist.txt --type Text::Similarity::Overlaps ../GPL.txt ../FDL.txt text_similarity.pl [[--verbose] [--stoplist=FILE] [--no-normalize] [--string]] --type=TYPE | --help | --version] FILE1 FILE2 DESCRIPTIONThis script is a simple command-line interface to the Text::Similarity Perl modules. A method for computing similarity must be specified via the --type option, and then that method is used to measure the similarity of two strings or two files.Text::Similarity::Overlaps measures similarity by counting the number of words that overlap (match) between the two inputs, without regard to order. So, all of the following strings would have the same pairwise similarity (they would each have a raw score of 4 relative to each other, meaning that 4 words are overlapping or matching). winston churchill was here here was winston churchill winston was here churchill By default Text::Similarity::Overlaps returns a normalized F-measure between 0 and 1. Normalization can be turned off by specifying --no-normalize. It returns various other overlap based scores if you specify --verbose. OPTIONS
AUTHORSTed Pedersen, University of Minnesota, Duluth tpederse at d.umn.edu Jason Michelizzi Ying Liu, University of Minnesota, Twin Cities liux0395 at umn.edu Last modified by: $Id: text_similarity.pl,v 1.1.1.1 2013/06/26 02:38:12 tpederse Exp $ BUGS
COPYRIGHT AND LICENSECopyright (C) 2004-2010, Jason Michelizzi, Ted Pedersen and Ying LiuThis program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA
Visit the GSP FreeBSD Man Page Interface. |