This is a simple PO file to ruby hash parser, which complies with GNU PO file specification. Tested with the msgcat (GNU gettext-tools) 0.18.3 tool.
The parser is probably as optimized for speed as possible with pure ruby, while parsing all different PO types into seperate keys. It was written as a "core-replacement" for an equivalent parslet PEG parser for arashm/PoParser and benchmarked about 500 times faster.
The parser can be used in two ways:
SimplePoParser.parse(file_path) # parses a PO file and returns array of hashes
SimplePoParser.parse_message(message) # parses a single PO message and returns a hash
A PO message is parsed into a hash with meaningful keys for each type of line. The values are strings if only one line of such content was parsed, otherwise it's an array of strings. Each string is representing one line of content in the PO file.
{
:translator_comment => "" || ["", ""...],
:extracted_comment => "" || ["", ""...],
:reference => "" || ["", ""...],
:flag => "" || ["", ""...],
:previous_msgctxt => "" || ["", ""...],# msgctxt of the message used for the fuzzy translation
:previous_msgid => "" || ["", ""...], # msgid of the messaged used for the fuzzy translation
:previous_msgid_plural => "" || ["", ""...],
:msgctxt => "" || ["", ""...],
:msgid => "" || ["", ""...],
:msgid_plural => "" || ["", ""...],
:msgstr => "" || ["", ""...], # for singular messages
"msgstr[N]" => "" || ["", ""...] # for plural messages, there N is the plural number starting from 0
}
License: MIT - Copyright (c) 2017 Dennis-Florian Herr @Experteer GmbH