Lingua::JA::Summarize::Extract::Plugin::Parser::Ngram - a word parser by N-gram
use strict;
use warnings;
use utf8;
use Lingua::JA::Summarize::Extract;
my $text = '';
my $text = '日本語の文章を適当に書く。';
my $summary = Lingua::JA::Summarize::Extract->extract($text); # default plugin
print "$summary";
parse dose the word by using N-gram. the number of N can be changed by KATAKANA,
KANJI, and the Latin character.
- latin_gram
- latin character
- kana_gram
- katakana character
- han_gram
- kanji character
Kazuhiro Osawa <ko@yappo.ne.jp>
This library is free software; you can redistribute it and/or modify it under
the same terms as Perl itself.
Hey! The above document had some coding errors, which are explained
below:
- Around line 51:
- Non-ASCII character seen before =encoding in
''日本語の文章を適当に書く。';'.
Assuming UTF-8