|
NAMEBoulder::Unigene - Fetch Unigene data records as parsed Boulder StonesSYNOPSIS# parse a file of Unigene records $ug = new Boulder::Unigene(-accessor=>'File', -param => '/data/unigene/Hs.dat'); while (my $s = $ug->get) { print $s->Identifier; print $s->Gene; } # parse flatfile records yourself open (UG,"/data/unigene/Hs.dat"); local $/ = "*RECORD*"; while (<UG>) { my $s = Boulder::Unigene->parse($_); # etc. } DESCRIPTIONBoulder::Unigene provides retrieval and parsing services for UNIGENE recordsBoulder::Unigene provides retrieval and parsing services for NCBI Unigene records. It returns Unigene entries in Stone format, allowing easy access to the various fields and values. Boulder::Unigene is a descendent of Boulder::Stream, and provides a stream-like interface to a series of Stone objects. Access to Unigene is provided by one accessors, which give access to local Unigene database. When you create a new Boulder::Unigene stream, you provide the accessors, along with accessor-specific parameters that control what entries to fetch. The accessors is:
It is also possible to parse a single Unigene entry from a text string stored in a scalar variable, returning a Stone object. Boulder::Unigene methodsThis section lists the public methods that the Boulder::Unigene class makes available.
OUTPUT TAGSThe tags returned by the parsing operation are taken from the names shown in the Flat file Hs.dat since no better description of them is provided yet by the database source producer.Top-Level TagsThese are tags that appear at the top level of the parsed Unigene entry.
STS Multiple records in the form ^STS ACC=XXXXXX NAME=YYYYYY
TXMAP Multiple records in the form ^TXMAP XXXXXXX; MARKER=YYYYY; RHPANEL=ZZZZThe TXMAP tag points to a Stone record that contains multiple subtags. Each subtag is the name of a feature which points, in turn, to a Stone that describes the feature's location and other attributes.Each feature will contain one or more of the following subtags:
PROTSIM Multiple records in the form ^PROTSIM ORG=XXX; PROTID=DBID:YYY; PCT=ZZZ; ALN=QQQQ Where DBID is PID for indicate presence of GenPept identifier, SP to indicate SWISSPROT identifier, PIR to indicate PIR identifier, PRF to indicate ???
SEQUENCE Multiple records in the form ^SEQUENCE ACC=XXX; NID=YYYY; PID = CLONE= END= LID=
SEE ALSOBoulder, Boulder::Blast, Boulder::GenbankAUTHORLincoln Stein <lstein@cshl.org>. Luca I.G. Toldo <luca.toldo@merck.de>Copyright (c) 1997 Lincoln D. Stein Copyright (c) 1999 Luca I.G. Toldo This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. See DISCLAIMER.txt for disclaimers of warranty.
Visit the GSP FreeBSD Man Page Interface. |