Provided by: libemail-folder-perl_0.860-3_all
NAME
Email::Folder::Mbox - reads raw RFC822 mails from an mbox file
VERSION
version 0.860
SYNOPSIS
This isa Email::Folder::Reader - read about its API there.
DESCRIPTION
Does exactly what it says on the tin - fetches raw RFC822 mails from an mbox. The mbox format is described at http://www.qmail.org/man/man5/mbox.html We attempt to read an mbox as through it's the mboxcl2 variant, falling back to regular mbox mode if there is no "Content-Length" header to be found. OPTIONS The new constructor takes extra options. "fh" When filename is set to "FH" than Email::Folder::Mbox will read mbox archive from filehandle "fh" instead from disk file "filename". "eol" This indicates what the line-ending style is to be. The default is "\n", but for handling files with mac line-endings you would want to specify "eol => "\x0d"" "jwz_From_" The value is taken as a boolean that governs what is used match as a message separator. If false we use the mutt style /^From \S+\s+(?:Mon|Tue|Wed|Thu|Fri|Sat|Sun)/ /^From (?:Mon|Tue|Wed|Thu|Fri|Sat|Sun)/; If true we use /^From / In deference to this extract from <http://www.jwz.org/doc/content-length.html> Essentially the only safe way to parse that file format is to consider all lines which begin with the characters ``From '' (From-space), which are preceded by a blank line or beginning-of-file, to be the division between messages. That is, the delimiter is "\n\nFrom .*\n" except for the very first message in the file, where it is "^From .*\n". Some people will tell you that you should do stricter parsing on those lines: check for user names and dates and so on. They are wrong. The random crap that has traditionally been dumped into that line is without bound; comparing the first five characters is the only safe and portable thing to do. Usually, but not always, the next token on the line after ``From '' will be a user-id, or email address, or UUCP path, and usually the next thing on the line will be a date specification, in some format, and usually there's nothing after that. But you can't rely on any of this. Defaults to false. "unescape" This boolean value indicates whenever lines which starts with /^>+From / should be unescaped (= removed leading '>' char). This is needed for mboxrd and mboxcl variants. But there is no way to detect for used mbox variant, so default value is false. "seek_to" Seek to an offset when opening the mbox. When used in combination with ->tell you may be able to resume reading, with a trailing wind. "next_message" This returns next message as string "next_messageref" This returns next message as ref to string "tell" This returns the current filehandle position in the mbox. "next_from" This returns the From_ line for next message. Call it before ->next_message. "messageid" This returns the messageid of last read message. Call if after ->next_message.
AUTHORS
• Simon Wistow <simon@thegestalt.org> • Richard Clamp <richardc@unixbeard.net> • Pali <pali@cpan.org>
COPYRIGHT AND LICENSE
This software is copyright (c) 2006 by Simon Wistow. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.