Platform specific split string? #10

jeroen · 2018-04-09T17:48:10Z

If I run the example from the readme I see different output:

Is this expected? I noticed you split by \r\n which is windows-specific I think? Did you test the package on non-windows machines?

The text was updated successfully, but these errors were encountered:

lebebr01 · 2018-04-09T18:38:17Z

This is not expected and not an issue I was aware of. I do not have a non-windows machine currently to run some local tests like this on.

When I wrote this portion of the code I was not aware of the tokenizers package. It would likely be more robust to use their tokenize_lines function to perform this action.

I will also brainstorm a unit test that would catch this behavior as well. I'm open to ideas for a good way to test this.

jeroen · 2018-04-09T18:51:14Z

Yes you should add a unit test for this. You can automatically run checks on linux and osx using travis.

lebebr01 · 2018-04-09T19:51:36Z

e8d8e90 should fix issue.

Added unit test here to test for literal "\n" characters in result text: test here. Open to other ways to test this behavior.

jeroen · 2018-04-10T17:16:25Z

OK thanks. BTW you could reduce your dependency weight by calling stringi::stri_split_lines() and stringi::stri_split_boundaries(x, type = "word") directly rather than via tokenizers.

lebebr01 added the bug label Apr 9, 2018

lebebr01 self-assigned this Apr 9, 2018

jeroen mentioned this issue Apr 9, 2018

[REVIEW]: pdfsearch: Search Tools for PDF Files openjournals/joss-reviews#668

Closed

36 tasks

lebebr01 closed this as completed Apr 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Platform specific split string? #10

Platform specific split string? #10

jeroen commented Apr 9, 2018 •

edited

Loading

lebebr01 commented Apr 9, 2018

jeroen commented Apr 9, 2018

lebebr01 commented Apr 9, 2018

jeroen commented Apr 10, 2018

Platform specific split string? #10

Platform specific split string? #10

Comments

jeroen commented Apr 9, 2018 • edited Loading

lebebr01 commented Apr 9, 2018

jeroen commented Apr 9, 2018

lebebr01 commented Apr 9, 2018

jeroen commented Apr 10, 2018

jeroen commented Apr 9, 2018 •

edited

Loading