bug-gawk
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Parse CVS in awk


From: Wolfgang Laun
Subject: Re: Parse CVS in awk
Date: Thu, 9 Apr 2020 16:54:20 +0200

I remember having read this in the RFC, and i went and dug it out for
you. It's quite clear:
"While there are various specifications and implementations for the
CSV format, there is
no formal specification in existence, which allows for a wide variety
of interpretations of
CSV files." Or, as the poet has put it,
*Lasciate ogni speranza voi ch′entrate.*

No programming language or its bug list is going to save you from your
predicament.

-W



On Thu, 9 Apr 2020 at 04:52, Peng Yu <address@hidden> wrote:

> I'm wondering if the solution mentioned here is robust against all CVS
> format variations.
>
> https://www.gnu.org/software/gawk/manual/gawk.html#Splitting-By-Content
>
> https://en.wikipedia.org/wiki/Comma-separated_values
>
> "Embedded double quote characters may then be represented by a pair of
> consecutive double quotes,[13] or by prefixing a double quote with an
> escape character such as a backslash (for example in Sybase Central).
> "
>
> It seems that there are variations in CVS format. Is there a way to
> robustly handle all possible cases?
>
> --
> Regards,
> Peng
>
>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]