oracular (3) Data::TableReader::Decoder::IdiotCSV.3pm.gz

Provided by: libdata-tablereader-perl_0.021-1_all bug

NAME

       Data::TableReader::Decoder::IdiotCSV - Access rows of a badly formatted comma-delimited text file

VERSION

       version 0.021

DESCRIPTION

       This decoder is like ::Decoder::CSV, but can additionally parse the garbage resulting from those special
       people who write "CSV Export" code that looks like

         print join(",", map qq{"$_"}, @record)."\n";

       (or rather, the equivalent code in Visual Basic or PHP which is what they're probably using)  regardless
       of their data containing quote characters or newlines, resulting in garbage like:

         "First Name","Last Name","email"
         "Joseph "Joe","Smith",""Smith, Joe" <jsmith@example.com>"

       This can actually be processed by (recent versions of) the Text::CSV module with the following
       configuration:

         {
           binary => 1,
           allow_loose_quotes => 1,
           allow_whitespace => 1,
           escape_char => undef,
         }

       And so this module is simply a subclass of Data::TableReader::Decoder::CSV which provides those defaults
       to the parser.

       How does the parsing work though?  Well, some guesswork and patterns.  It's not super reliable, and you
       should always complain loudly to whoever generated that data, unless they're a much larger company than
       you and would never listen, or went out of business a while back, in which case you can justify using
       this module in production.

AUTHOR

       Michael Conrad <mike@nrdvana.net>

       This software is copyright (c) 2024 by Michael Conrad.

       This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5
       programming language system itself.