AI::Categorizer::Collection::Files - One document per file
my $c = new AI::Categorizer::Collection::Files
(path => '/tmp/docs/training',
category_file => '/tmp/docs/cats.txt');
print "Total number of docs: ", $c->count_documents, "\n";
while (my $document = $c->next) {
...
}
$c->rewind; # For further operations
This implements a Collection class in which each document exists as a single
file on a filesystem. The documents can exist in a single directory, or in
several directories.
This is a subclass of the abstract AI::Categorizer::Collection class, so any
methods mentioned in its documentation are available here.
- new()
- Creates a new Collection object and returns it. In addition to the
parameters accepted by the superclass, the following parameters are
accepted:
- path
- Indicates a location on disk where the documents can be found. The path
may be specified as a string giving the name of a directory, or as a
reference to an array of such strings if the documents are located in more
than one directory.
- recurse
- Indicates whether subdirectories of the directory (or directories) in the
"path" parameter should be descended
into. If set to a true value, they will be descended into. If false, they
will be ignored. The default is false.
Ken Williams, ken@mathforum.org
Copyright 2002-2003 Ken Williams. All rights reserved.
This library is free software; you can redistribute it and/or
modify it under the same terms as Perl itself.
AI::Categorizer::Collection(3)