Lingua::JA::Summarize::Extract - summary generator for Japanese
use strict;
use warnings;
use utf8;
use Lingua::JA::Summarize::Extract;
my $text = '日本語の文章を適当に書く。';
my $summary = Lingua::JA::Summarize::Extract->extract($text);
print $summary->as_string;
print "$summary";
# cuts short to 20 length
$summary->length(20);
print "$summary";
# mecab charset
my $extractor = Lingua::JA::Summarize::Extract->new({ mecab_charset => 'utf8' });
Lingua::JA::Summarize::Extract is a summary generator for Japanese text. The
extraction method can be changed with the plug-in mechanism.
- new([options])
- a object is made by using the options.
- extract(text[, options])
- text is summarized. blessed by using options if called direct. return to
Lingua::JA::Summarize::Extract::ResultSet object.
the content of processing can be changed by passing the constructor the options.
- plugins
- the processing of split of word and line and the scoring etc. can be done
by using another modules. please pass it by the ARRAY reference.
- rate
- the weight at scoring can be changed.
thing to refer to POD of each plugin when you want to examine
other options.
Kazuhiro Osawa <ko@yappo.ne.jp>
<http://gensen.dl.itc.u-tokyo.ac.jp/>, <http://www.ryo.com/getsen/>
This library is free software; you can redistribute it and/or modify it under
the same terms as Perl itself.
Hey! The above document had some coding errors, which are explained
below:
- Around line 86:
- Non-ASCII character seen before =encoding in
''日本語の文章を適当に書く。';'.
Assuming UTF-8