Implementing filters for capitalize, downcase, first, last, prepend, append and pluralize #18

gnunicorn · 2016-03-18T19:57:22Z

This implements the a few more filters.

Specifically it adds:

capitalize - capitalize words in the input sentence
downcase - convert an input string to lowercase
first - get the first element of the passed in array
last - get the last element of the passed in array
prepend - prepend a string e.g. {{ 'bar' | prepend:'foo' }} #=> 'foobar'
pluralize - return the second word if the input is not 1, otherwise return the first word e.g. {{ 3 | pluralize: 'item', 'items' }} #=> 'items'
append - append a string e.g. {{ 'foo' | append:'bar' }} #=> 'foobar'
slice - slice a string. Takes an offset and length, e.g. {{ "hello" | slice: -3, 3 }} #=> llo

References #11

johannhof · 2016-04-02T06:00:46Z

@ligthyear any updates on this? Should I preliminarily review it already? :)

The travis build is failing because you mis-spelled divided_by and it can't import it. 😊

However, #28 already introduced divided_by and times so you'll also have to rebase again

gnunicorn · 2016-04-02T09:51:21Z

Sorry, @johannhof , I was hoping I had a bit more time to implement a few others, but the last two weeks went crazy instead.

I rebased against master and updated everything to make all tests pass. I guess it is best to review/merge this batch now and do others another time. I'll update the description accordingly.

So, yes, please feel free to review and merge.

johannhof · 2016-04-05T08:00:22Z

@ligthyear don't worry about it, I can totally relate. I'll review it soon! Thanks for contributing

johannhof · 2016-04-06T13:28:00Z

src/filters.rs

+                    },
+                _ => chr.to_uppercase().next().unwrap(),
+            }.to_string();
+            word + &next_char


I would've done something like

Str(ref s) => Ok(Str(s.split(' ').map(|word| { let (h, t) = word.split_at(1); h.to_uppercase() + t }).collect::<Vec<String>>().join(" "))),

but it seems you're trying to preserve whitespace, which is fine by me. Can I just ask you to add some comments explaining your code, because I really can't wrap my head around it. Also, are the unwrap calls guaranteed to be safe?

Also, are we sure that the iterator to_uppercase returns will always contain only one uppercase letter? afaik ß maps to SS, for instance.

Some random suggestions:

Converting every char to a String and then appending that to the other string is isn't the best way to do this. You should probably use String::push instead. Btw. word as the variable name for the accumulator string is a bit misleading, I think.

You're also iterating over the accumulated string to get the last char in every iteration, which means the whole function is O(n^2) in the length of the input string. That's not necessarily a problem, but it would be easy to avoid by just having a boolean that keeps track whether the last char was whitespace. That would make the function O(n) and also make it easier to read.

Finally, you're iterating over char_indices, but never actually look at the indices. Is there a reason you couldn't just use chars instead?

Regarding the ß issue: I don't know enough about unicode to know how much of a problem this is. ß probably won't be a problem because it never occurs at the beginning of a word anyway, but there might be other characters that have the same issue.

Re ß:
ß is the only letter that doesn't have any uppercase at all. The work-a-round in technical sense is to often replace it with "SS", which is technically two characters, however that never effects us, as ß also can't be used as the beginning of any word as it should only be used to extend the previous vowel. So, while in a uppercasing of a text this would be relevant (and actually mean your lowercased text is shorter than the uppercased one), in a title-case scenario (the one we are looking at), this isn't an issue in practice. Given, that someone could still feed the ß at the beginning of the word, but as far as I understand this code, this would just lead to the usage of "S" instead of "SS".

Re complexity/ not using split(' '):
This is complex because unicode is complex. I had a split version first, then added the "silent whitespace" test and everything failed – as it also would for a line break. Same for the join. We'd replace the whitespaces improperly.

Re char_indices: this is relic of an earlier implementation. Yes, this should use chars

Re String::push: good point. Though that means I need to get a mutable handle. I'm sure I can do that in the fold without breaking the borrow checking. I'll try it.

Re Boolean for tracking: indeed, that could be done. That's how I used to do that in python, but never liked it. In case in particular as my otherwise very functional closure has an ugly side effect it constantly depends on. Also not sure, the borrow checker will allow that one. I'll see if that is something I can make better.

The unwrap()s here are safe. We always have an actual character ;) .

Re Boolean for tracking: indeed, that could be done. That's how I used to do that in python, but never liked it. In case in particular as my otherwise very functional closure has an ugly side effect it constantly depends on.

That doesn't require any side effects, the accumulator variable of the fold can just be (String, bool) instead of String. Although I have to say that even as a Haskell programmer, I'm not sure that a fold is really better than a loop here.

Re String::push: good point. Though that means I need to get a mutable handle. I'm sure I can do that in the fold without breaking the borrow checking. I'll try it.

This shouldn't be a problem, you just need to add mut to the binding of word. I would be surprised if that caused issues with the borrow checker.

johannhof · 2016-04-06T19:39:07Z

I basically only have that comment, the rest looks good. We'd also need test coverage for the error cases (passing invalid values to the filters). If you don't have the time for that we can just open a follow-up bug. :)

gnunicorn · 2016-04-08T09:32:58Z

@johannhof See comments in the thread. I can/will take a look this weekend, is that soon enough? also adding some invalid values should be in there ;) .

johannhof · 2016-04-10T06:30:46Z

Take your time, this PR is not going away :)

also adding some invalid values should be in there ;)

huh, where? I can't seem to find it

Thanks!

johannhof · 2016-05-15T21:16:11Z

Going through this again I think it's ok to merge, thank you very much @ligthyear! Feel free to make another PR if you still want to pick up the suggestions in this thread.

Benjamin Kampmann added 10 commits April 2, 2016 11:34

Add Unicode aware Capitalize filter

e6a2f7f

Implement Downcase

2d2ea12

Add Array test for size

2439349

Implement filter 'first'

d845003

Implement filter 'last'

14a0255

Implement filter 'prepend'

e8c4cc9

Implement filter 'append'

83db32b

Implement filter 'pluralize'

2bba018

fixing typo

04029d8

Switch to mew API after rebase

f369dea

gnunicorn force-pushed the more-filters branch from 6145176 to f369dea Compare April 2, 2016 09:43

Style to make nightly linter happy, too

dbc1fe4

gnunicorn changed the title ~~Implementing remaining filters~~ Implementing filters for capitalize, downcase, first, last, prepend, append and pluralize Apr 2, 2016

johannhof reviewed Apr 6, 2016
View reviewed changes

johannhof merged commit 9e8bc53 into cobalt-org:master May 15, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing filters for capitalize, downcase, first, last, prepend, append and pluralize #18

Implementing filters for capitalize, downcase, first, last, prepend, append and pluralize #18

gnunicorn commented Mar 18, 2016

johannhof commented Apr 2, 2016

gnunicorn commented Apr 2, 2016

johannhof commented Apr 5, 2016

johannhof Apr 6, 2016

johannhof Apr 6, 2016

fhartwig Apr 7, 2016

fhartwig Apr 7, 2016

gnunicorn Apr 8, 2016

gnunicorn Apr 8, 2016

fhartwig Apr 8, 2016

fhartwig Apr 8, 2016

johannhof commented Apr 6, 2016

gnunicorn commented Apr 8, 2016

johannhof commented Apr 10, 2016

johannhof commented May 15, 2016

Implementing filters for capitalize, downcase, first, last, prepend, append and pluralize #18

Implementing filters for capitalize, downcase, first, last, prepend, append and pluralize #18

Conversation

gnunicorn commented Mar 18, 2016

johannhof commented Apr 2, 2016

gnunicorn commented Apr 2, 2016

johannhof commented Apr 5, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

johannhof commented Apr 6, 2016

gnunicorn commented Apr 8, 2016

johannhof commented Apr 10, 2016

johannhof commented May 15, 2016