GSP
Quick Navigator

Search Site

Unix VPS
A - Starter
B - Basic
C - Preferred
D - Commercial
MPS - Dedicated
Previous VPSs
* Sign Up! *

Support
Contact Us
Online Help
Handbooks
Domain Status
Man Pages

FAQ
Virtual Servers
Pricing
Billing
Technical

Network
Facilities
Connectivity
Topology Map

Miscellaneous
Server Agreement
Year 2038
Credits
 

USA Flag

 

 

Man Pages
Keywords(3) User Contributed Perl Documentation Keywords(3)

Lingua::ZH::Keywords - Extract keywords from Chinese text

    # Exports keywords() by default
    use Lingua::ZH::Keywords;

    print join(",", keywords($text));       # Prints five keywords
    print join(",", keywords($text, 10));   # Prints ten keywords

This is a very simple algorithm which removes stopwords from the text, and then counts up what it considers to be the most important keywords. The "keywords" subroutine returns a list of keywords in order of relevance.

The stopwords list is accessible as @Lingua::ZH::Keywords::StopWords.

If the input $text is an Unicode string, the returned keywords will also be Unicode strings; otherwise they are assumed to be Big5-encoded bytestrings.

Lingua::ZH::TaBE, Lingua::EN::Keywords

Algorithm adapted from the Lingua::EN::Keywords module by Simon Cozens, <simon@simon-cozens.org<gt>.

Autrijus Tang <autrijus@autrijus.org>

Copyright 2003 by Autrijus Tang <autrijus@autrijus.org>.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

See <http://www.perl.com/perl/misc/Artistic.html>

2003-01-20 perl v5.32.1

Search for    or go to Top of page |  Section 3 |  Main Index

Powered by GSP Visit the GSP FreeBSD Man Page Interface.
Output converted with ManDoc.