CSV file containing empty line isn't parsed #13

mb21 · 2015-07-11T18:40:01Z

$ cat a.md 

```{.table caption="capt" source="b.csv"}
```

$ cat b.csv 
foo,bar
,
foo,bar

$ pandoc --filter pandoc-csv2table a.md 
<p>+-------+-------+ | foo | bar | +=======+=======+ +-------+-------+ | foo | bar | +-------+-------+</p>
<p>Table: capt</p>

The text was updated successfully, but these errors were encountered:

baig · 2015-07-11T18:59:11Z

Your CSV is invalid. It should be like this:

foo,bar
foo,bar

mb21 · 2015-07-11T19:15:38Z

Well, CSV isn't a particularly well-defined format. But every spreadsheet software I know of would parse my csv file as one containing an empty line (in fact, it was generated by google sheets). So I would expect:

| foo  | bar |
|------|-----|
|      |     |
| foo  | bar |

Actually, it's not even Text.CSV, in ghci:

Prelude Text.CSV> parseCSVFromFile "b.csv"
Right [["foo","bar"],["",""],["foo","bar"],[""]]

I wonder why the filter decides to print the rendered table as markdown wrapped in a paragraph of all things... you wrote earlier "as an intermediate step it pipes the CSV contents through Pandoc's Markdown Reader." I still don't understand that design decision: why not convert the list of lists we got from Text.CSV directly to a Text.Pandoc.Definition.Table?

baig · 2015-07-11T19:49:32Z

Then this seems like a csv parser issue. The filter uses an external csv parser which implements csv parsing as defined in RFC 4180.

mb21 · 2015-07-11T20:07:36Z

See my updated message above. Also, I was curious, so I checked out the RFC's BNF grammar. It defines a record as one or more comma-separated fields, and a field as escaped or non-escaped where non-escaped is zero or more TEXTDATA, so the file is valid...

baig · 2015-07-11T20:37:05Z

I still don't understand that design decision: why not convert the list of lists we got from Text.CSV directly to a Text.Pandoc.Definition.Table?

Because pandoc tables allow markdown inside their cells.

I'll have to see where the filter is going wrong but don't hold your breath in the meantime. I am finalizing my dissertation and it might be a while before I look into it.

Also, pull requests are welcomed and appreciated.

mb21 · 2015-07-12T09:56:12Z

okay :)

mb21 mentioned this issue Jul 11, 2015

[Bug] type="pipe" does not work at all; type="multiline" does not honor aligns="L" #11

Open

baig added the bug label Jul 11, 2015

mb21 mentioned this issue Jul 13, 2015

Isn't this filter a duplicate effort to pandoc-csv2table ? mb21/pandoc-placetable#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSV file containing empty line isn't parsed #13

CSV file containing empty line isn't parsed #13

mb21 commented Jul 11, 2015

baig commented Jul 11, 2015

mb21 commented Jul 11, 2015

baig commented Jul 11, 2015

mb21 commented Jul 11, 2015

baig commented Jul 11, 2015

mb21 commented Jul 12, 2015

CSV file containing empty line isn't parsed #13

CSV file containing empty line isn't parsed #13

Comments

mb21 commented Jul 11, 2015

baig commented Jul 11, 2015

mb21 commented Jul 11, 2015

baig commented Jul 11, 2015

mb21 commented Jul 11, 2015

baig commented Jul 11, 2015

mb21 commented Jul 12, 2015