Lingua::ZH::Segment - Chinese Text Segmentation
This document describes version 0.01 of Lingua::ZH::Segment, released March 10,
2005.
use Lingua::ZH::Segment;
print segment('降龍18掌'); # 降 龍 18 掌
This module currently only break chinese text into single character (Chinese
word), it will not break up any alphabet.
Currently, only "segment" is available.
Cheng-Lung Sung <clsung@tw.freebsd.org>
Hsin-Chan Chien for inspiring me about Encode::Guess.
Copyright 2005 by Cheng-Lung Sung <clsung@tw.freebsd.org>
This program is free software; you can redistribute it and/or
modify it under the same terms as Perl itself.
See <http://www.perl.com/perl/misc/Artistic.html>
Hey! The above document had some coding errors, which are explained
below:
- Around line 24:
- Non-ASCII character seen before =encoding in
'segment('降龍18掌');'. Assuming UTF-8