|
NAMEConvert::BinHex - extract data from Macintosh BinHex filesALPHA WARNING: this code is currently in its Alpha release. Things may change drastically until the interface is hammered out: if you have suggestions or objections, please speak up now! SYNOPSISSimple functions:use Convert::BinHex qw(binhex_crc macbinary_crc); # Compute HQX7-style CRC for data, pumping in old CRC if desired: $crc = binhex_crc($data, $crc); # Compute the MacBinary-II-style CRC for the data: $crc = macbinary_crc($data, $crc); Hex to bin, low-level interface. Conversion is actually done via an object ("Convert::BinHex::Hex2Bin") which keeps internal conversion state: # Create and use a "translator" object: my $H2B = Convert::BinHex->hex2bin; # get a converter object while (<STDIN>) { print $STDOUT $H2B->next($_); # convert some more input } print $STDOUT $H2B->done; # no more input: finish up Hex to bin, OO interface. The following operations must be done in the order shown! # Read data in piecemeal: $HQX = Convert::BinHex->open(FH=>\*STDIN) || die "open: $!"; $HQX->read_header; # read header info @data = $HQX->read_data; # read in all the data @rsrc = $HQX->read_resource; # read in all the resource Bin to hex, low-level interface. Conversion is actually done via an object ("Convert::BinHex::Bin2Hex") which keeps internal conversion state: # Create and use a "translator" object: my $B2H = Convert::BinHex->bin2hex; # get a converter object while (<STDIN>) { print $STDOUT $B2H->next($_); # convert some more input } print $STDOUT $B2H->done; # no more input: finish up Bin to hex, file interface. Yes, you can convert to BinHex as well as from it! # Create new, empty object: my $HQX = Convert::BinHex->new; # Set header attributes: $HQX->filename("logo.gif"); $HQX->type("GIFA"); $HQX->creator("CNVS"); # Give it the data and resource forks (either can be absent): $HQX->data(Path => "/path/to/data"); # here, data is on disk $HQX->resource(Data => $resourcefork); # here, resource is in core # Output as a BinHex stream, complete with leading comment: $HQX->encode(\*STDOUT); PLANNED!!!! Bin to hex, "CAP" interface. Thanks to Ken Lunde for suggesting this. # Create new, empty object from CAP tree: my $HQX = Convert::BinHex->from_cap("/path/to/root/file"); $HQX->encode(\*STDOUT); DESCRIPTIONBinHex is a format used by Macintosh for transporting Mac files safely through electronic mail, as short-lined, 7-bit, semi-compressed data streams. Ths module provides a means of converting those data streams back into into binary data.FORMAT(Some text taken from RFC-1741.) Files on the Macintosh consist of two parts, called forks:
Additional information regarding Macintosh files is stored by the Finder in a hidden file, called the "Desktop Database". Because of the complications in storing different parts of a Macintosh file in a non-Macintosh filesystem that only handles consecutive data in one part, it is common to convert the Macintosh file into some other format before transferring it over the network. The BinHex format squashes that data into transmittable ASCII as follows:
FUNCTIONSCRC computation
OO INTERFACEConversion
Construction
Get/set header information
Decode, high-level
Encode, high-level
SUBMODULESConvert::BinHex::Bin2HexA BINary-to-HEX converter. This kind of conversion requires a certain amount of state information; it cannot be done by just calling a simple function repeatedly. Use it like this:# Create and use a "translator" object: my $B2H = Convert::BinHex->bin2hex; # get a converter object while (<STDIN>) { print STDOUT $B2H->next($_); # convert some more input } print STDOUT $B2H->done; # no more input: finish up # Re-use the object: $B2H->rewind; # ready for more action! while (<MOREIN>) { ... On each iteration, "next()" (and "done()") may return either a decent-sized non-empty string (indicating that more converted data is ready for you) or an empty string (indicating that the converter is waiting to amass more input in its private buffers before handing you more stuff to output. Note that "done()" always converts and hands you whatever is left. This may have been a good approach. It may not. Someday, the converter may also allow you give it an object that responds to read(), or a FileHandle, and it will do all the nasty buffer-filling on its own, serving you stuff line by line: # Someday, maybe... my $B2H = Convert::BinHex->bin2hex(\*STDIN); while (defined($_ = $B2H->getline)) { print STDOUT $_; } Someday, maybe. Feel free to voice your opinions. Convert::BinHex::Hex2BinA HEX-to-BINary converter. This kind of conversion requires a certain amount of state information; it cannot be done by just calling a simple function repeatedly. Use it like this:# Create and use a "translator" object: my $H2B = Convert::BinHex->hex2bin; # get a converter object while (<STDIN>) { print STDOUT $H2B->next($_); # convert some more input } print STDOUT $H2B->done; # no more input: finish up # Re-use the object: $H2B->rewind; # ready for more action! while (<MOREIN>) { ... On each iteration, "next()" (and "done()") may return either a decent-sized non-empty string (indicating that more converted data is ready for you) or an empty string (indicating that the converter is waiting to amass more input in its private buffers before handing you more stuff to output. Note that "done()" always converts and hands you whatever is left. Note that this converter does not find the initial "BinHex version" comment. You have to skip that yourself. It only handles data between the opening and closing ":". Convert::BinHex::ForkA fork in a Macintosh file.# How to get them... $data_fork = $HQX->data; # get the data fork $rsrc_fork = $HQX->resource; # get the resource fork # Make a new fork: $FORK = Convert::BinHex::Fork->new(Path => "/tmp/file.data"); $FORK = Convert::BinHex::Fork->new(Data => $scalar); $FORK = Convert::BinHex::Fork->new(Data => \@array_of_scalars); # Get/set the length of the data fork: $len = $FORK->length; $FORK->length(170); # this overrides the REAL value: be careful! # Get/set the path to the underlying data (if in a disk file): $path = $FORK->path; $FORK->path("/tmp/file.data"); # Get/set the in-core data itself, which may be a scalar or an arrayref: $data = $FORK->data; $FORK->data($scalar); $FORK->data(\@array_of_scalars); # Get/set the CRC: $crc = $FORK->crc; $FORK->crc($crc); UNDER THE HOODDesign issues
How it worksSince BinHex is a layered format, consisting of...A Macintosh file [the "BIN"]... Encoded as a structured 8-bit bytestream, then... Compressed to reduce duplicate bytes, then... Encoded as 7-bit ASCII [the "HEX"] ...there is a layered parsing algorithm to reverse the process. Basically, it works in a similar fashion to stdio's fread(): 0. There is an internal buffer of decompressed (BIN) data, initially empty. 1. Application asks to read() n bytes of data from object 2. If the buffer is not full enough to accommodate the request: 2a. The read() method grabs the next available chunk of input data (the HEX). 2b. HEX data is converted and decompressed into as many BIN bytes as possible. 2c. BIN bytes are added to the read() buffer. 2d. Go back to step 2a. until the buffer is full enough or we hit end-of-input. The conversion-and-decompression algorithms need their own internal buffers and state (since the next input chunk may not contain all the data needed for a complete conversion/decompression operation). These are maintained in the object, so parsing two different input streams simultaneously is possible. WARNINGSOnly handles "Hqx7" files, as per RFC-1741.Remember that Macintosh text files use "\r" as end-of-line: this means that if you want a textual file to look normal on a non-Mac system, you probably want to do this to the data: # Get the data, and output it according to normal conventions: foreach ($HQX->read_data) { s/\r/\n/g; print } AUTHOR AND CREDITSMaintained by Stephen Nelson <stephenenelson@mac.com>Written by Eryq, http://www.enteract.com/~eryq / eryq@enteract.com Support for native-Mac conversion, plus invaluable contributions in Alpha Testing, plus a few patches, plus the baseline binhex/debinhex programs, were provided by Paul J. Schinder (NASA/GSFC). Ken Lunde (Adobe) suggested incorporating the CAP file representation. LICENSECopyright (c) 1997 by Eryq. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.This software comes with NO WARRANTY of any kind. See the COPYING file in the distribution for details.
Visit the GSP FreeBSD Man Page Interface. |