class RMail::Mailbox::MBoxReader
Class that can parse Unix mbox style mailboxes. These mailboxes separate individual messages with a line beginning with the string “From ”.
Typical usage:
File.open("file.mbox") { |file| RMail::Mailbox::MBoxReader.new(file).each_message { |input| message = RMail::Parser.read(input) # do something with the message end }
Or see RMail::Mailbox.parse_mbox for a more convenient interface.
Public Class Methods
Creates a new MBoxReader that reads from `input' with lines that end with `line_separator'.
`input' can either be an IO source (an object that responds to the “read” method in the same way as a standard IO object) or a String.
`line_separator' defaults to $/, and useful values are probably limited to “n” (Unix) and “rn” (DOS/Windows).
# File lib/rmail/mailbox/mboxreader.rb, line 61 def initialize(input, line_separator = $/) super(input) @end_of_message = false @chunk_minsize = 0 @sep = line_separator @tail = nil # This regexp will match a From_ header, or some prefix. re_string = RMail::Parser::PushbackReader. maybe_contains_re("#{@sep}From ") @partial_from_re = Regexp.new(re_string) # This regexp will match an entire From_ header. @entire_from_re = /\A#{@sep}From .*?#{@sep}/ end
Public Instance Methods
Yield self until eof, calling next after each yield.
This method makes it simple to read messages successively out of the mailbox. See the class description for a code example.
# File lib/rmail/mailbox/mboxreader.rb, line 128 def each_message while !eof yield self self.next end end
Returns true if the next call to #read_chunk will return nil.
# File lib/rmail/mailbox/mboxreader.rb, line 120 def eof parent_eof and @tail.nil? end
Advances to the next message to be read. Call this after read returns nil.
Note: Once read returns nil, you can call eof before or after calling next to tell if there actually is a next message to read.
# File lib/rmail/mailbox/mboxreader.rb, line 112 def next @end_of_message = false @tail = nil end
Reads some data from the current message and returns it. The `size' argument is just a suggestion, and the returned string can be larger or smaller. When `size' is nil, then the entire message is returned.
Once all data from the current message has been read, read returns nil and next must be called to begin reading from the next message. You can use eof to tell if there is any more data to be read from the input source.
# File lib/rmail/mailbox/mboxreader.rb, line 88 def read_chunk(size) chunk = read_chunk_low(size) if chunk if chunk.length > @sep.length @tail = chunk[-@sep.length .. -1] else @tail ||= '' @tail << chunk end elsif @tail if @tail[-@sep.length .. -1] != @sep chunk = @sep end @tail = nil end chunk end