Scrappy::Queue - Scrappy HTTP Request Flow-Control System
#!/usr/bin/perl
use Scrappy::Queue;
my $queue = Scrappy::Queue->new;
$queue->add($url);
while (my $url = $queue->next) {
... $queue->add(...);
}
Scrappy::Queue provides a system for saving URLs to a recordset/queue and
iterating of them using the Scrappy framework.
The list method return the list of URLs in the queue. This is returned in list
context.
my $queue = Scrappy::Queue->new;
...
my @list = $queue->list;
The add method adds new URLs to the queue. Duplicate URLs will be ignored.
my $queue = Scrappy::Queue->new;
$queue->add($url);
The clear method completely empties the queue and resets the cursor (loop
position).
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
$queue->clear;
The reset method resets the cursor (loop position).
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
while (my $url = $queue->next) {
$queue->reset if ...; # beware the infinate loop
}
$queue->reset;
The current method returns the URL in the current loop position.
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
while (my $url = $queue->next) {
last if ...;
}
print 'great' if $url eq $queue->current;
The next method moves the cursor to the next loop position and returns the URL.
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
while (my $url = $queue->next) {
...
}
The previous method moves the cursor to the previous loop position and returns
the URL.
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
while (my $url = $queue->next) {
...
}
print $queue->previous;
The first method moves the cursor to the first loop position and returns the
URL.
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
print $queue->first;
The last method moves the cursor to the last loop position and returns the URL.
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
print $queue->last;
The index method moves the cursor to the specified loop position and returns the
URL. The loop position is a standard array index position.
my $queue = Scrappy::Queue->new;
$queue->add(...);
$queue->add(...);
$queue->add(...);
print $queue->index(1);
The cursor method returns the current loop position.
my $queue = Scrappy::Queue->new;
print $queue->cursor;
Al Newkirk <awncorp@cpan.org>
This software is copyright (c) 2010 by awncorp.
This is free software; you can redistribute it and/or modify it
under the same terms as the Perl 5 programming language system itself.