Functional style #2908

jgonggrijp · 2021-02-07T00:29:55Z

This is a substantial refactoring of the Underscore source code, which I've been working on since shortly after the modularization (#2849). Before merging, I would like this to be reviewed extensively, by multiple people, in multiple ways and in multiple stages. Please see the final third of this post for review details.

Edit: I need help from people who can contribute benchmarks based on real-world applications. Please leave a comment if you think you might be able to help!

Goals

Over the years, the source code has lost a substantial portion of its former elegance. This is partly due to bugfixes and feature additions, but also substantially due to micro-optimizations of which the added value is often questionable. I wanted to bring back some of that former elegance, prioritizing a clean, readable, DRY code style over absolute maximum performance.
Making the code more DRY and more functional would also contribute to a smaller bundle size (Go on a Diet #2060), so I made this an explicit goal as well.
Preparing the source code for future code sharing with Underscore-contrib and Underscore Fusion. I theorized that some of Underscore's internal logic could be factored out. Once factored out, it could potentially be made public, or at least be made stable enough to enable reuse in Underscore's sister projects.
Preparing the source code for adding new core collection types in the future, in particular Map, Set and iterators in Underscore 2.0. This can only be done in a maintainable way if there is a single source of truth about how to iterate any given collection. More about this below.

Principles

No interface changes (yet), in order to keep distractions at a minimum. While the code changes contain some seeds for possible future interface changes (mostly additions), I wanted this branch to focus entirely on the implementation of the existing interface.
No assumptions. I measured the effect on minified+gzipped UMD bundle weight after every change and did extensive microbenchmarking in order to assess impact on performance.
Only meaningful changes. If, after measurement, it turns out that a change benefits neither readability nor bundle size nor performance, don't make the change. There are quite a few changes that I reverted for this reason. I have interactively rebased this branch several times in order to clean up history.
Mild performance degradations are acceptable. 99% of application code is not performance-critical; programmer productivity and code size are usually much more important. Neither of those interests is served by trying to squeeze every drop of performance out of the hardware. In the remaining 1% of cases, if a slight slowdown is really a problem (i.e., fairly trivial code in a hot loop), users can replace their call to an Underscore function by a simple hand-written loop and easily achieve much better performance than they ever could with a library function.

Approach

I started by asking myself which existing Underscore function should be the single source of truth about collection iteration. Initially, I considered _.reduce. It is a very general function and many other collection functions can be cleanly expressed in terms of it, for example _.map, _.filter and _.min/_.max. I also considered _.each, because it doesn't use an accumulator by default. _.each and _.reduce are equally powerful, in the sense that either can be cleanly expressed in terms of the other.

I soon realized, however, that there is a function that can't be cleanly expressed in terms of _.each or _.reduce: _.find. Like the procedural for-in loop with a break option, _.find may stop iteration early. Otherwise, it basically does the same thing as _.each. All Underscore collection functions can be cleanly expressed using _.find, including not only _.each, _.reduce, _.map and _.min but also _.some, _.every and _.contains. For this reason, I have chosen _.find to be the one function that defines how to iterate a collection.

Conveniently, _.find was already implemented by branching over the collection types: it calls _.findIndex on arrays and _.findKey on objects. These latter functions, in turn, are the single sources of truth about iterating arrays and objects, respectively (although the situation is a bit more nuanced with regard to arrays; more on this shortly). In Underscore 2.0, I plan to add _.findEntry for Map/Set and _.findIteration for iterators. By including these in _.find and implementing all other collection functions in terms of _.find, all collection functions would automatically support all five collection types in a consistent way.

Besides using _.find in all collection functions, one of the first things I did was factoring out very general linearSearch and binarySearch functions. These are the actual single sources of truth on how to iterate/search arrays; _.findIndex, _.findLastIndex, _.indexOf, _.lastIndexOf and _.sortedIndex are all implemented using linearSearch and/or binarySearch under the hood.

linearSearch is so general that I was able to substitute it for nearly all hand-written for/while loops in the source code. This proved very effective in both producing cleaner code and reducing the bundle size. However, I have reverted this in many places because function call overhead turned out to be costlier than I expected. It seems that even modern JS engines rarely perform inlining, if ever. After discovering this, I adopted the following "safe" rule to choose between linearSearch and a hand-written loop: if the loop body didn't previously involve any function call, keep the hand-written loop; otherwise, replace it by linearSearch or another Underscore function. In this way, the extra function call introduced by using an Underscore function can never slow down the loop by more than a factor two. I expect the slowdown to be less in real-world scenarios, because a loop body that purely consists of two function calls (i.e., with functions that don't do anything) is trivial. Also, loops that already involved a function call often already involved multiple function calls.

I wrote an extensive microbenchmark suite to measure performance, in which each operation is repeated at least 3.6 million times and during at least a whole second. Please follow that link for details. All performance claims in the current post are informed by repeated measurements in Node.js 10, mobile Safari 14 and desktop Firefox 84 (on macOS) with those microbenchmarks. However, I believe that a final performance impact assessment should be made by comparing execution time in real-world applications and across a wider range of engines. More on this in the review section at the bottom of this post.

With this overall approach in mind, I made several passes over all the source modules, looking for opportunities to make the code cleaner, DRYer and more functional. The code I'm presenting today is the result of this effort and the subject of the first round of reviews. Once I have received and processed the first round of reviews, I plan to make one more pass over the source code, in order to make function parameter names more consistent and to fix the linear order of the bundles. After that, I'll take one smaller, final review before finally merging the branch.

Results (stage 1)

All collection functions except for _.find are now written in a collection-agnostic way, i.e., they use _.find or another Underscore function to abstract over the collection type.
New generic functions: linearSearch, binarySearch, extremum, greater, less, lessEqual. I believe that linearSearch and binarySearch should remain internal, in favor of the more user-friendly _.*index* family of functions (which can however expand parameters to support more options from the underlying primitives); the others could potentially be made public in a future update.
It is trivial to add _.sortedLastIndex in the future (in fact it's implemented but not added to the index.js yet) as well as an isSorted option for _.lastIndexOf (which is manually disabled for now).
Hand-written loops have in many cases been replaced by a call to linearSearch or another Underscore function. Besides linearSearch, the main loop primitives are _.findKey and _.times. Besides those primitives, 20 functions (out of the original 38) still contain hand-written loops for performance reasons.
While factoring out extremum, I fixed _.max and _.min may behave incorrectly if iteratee returns infinite number #2688 and opened the possibility of making _.min and _.max work with non-numbers in the future. This is also how I uncovered the "crazy comparison semantic" that I discuss in the next bullet.
I was able to deduplicate the slow path of _.min and _.max, which transforms the elements through an iteratee before comparing, through the outfactoring of extremum. I couldn't deduplicate the fast path (without iteratee) yet, because, for historical reasons, they currently have a rather crazy comparison semantic; they do something that is wildly different from just the builtin </> operators. I could abstract this semantic in a function and pass it to extremum, and in fact, this is exactly what I do in the slow path, but the fast path would suffer a performance degradation of roughly a factor 25 if I were to do the same there. Alternatively, I could inline the comparison logic in extremum, but I don't want to do that because it is really crazy. I want to change it back to regular </> in Underscore 2.0, in which case I'll be able to move the fast path to extremum as well and finish the deduplication. Please see my pre-review comments after this post for more details.
The "switch pyramids of doom" (as @jashkenas once aptly called them) in _.restArguments and optimizeCb are gone. As far as my benchmarks could tell, they had no performance benefit whatsoever over the simpler code that I put in their place.
The infrastructure around _.iteratee and the internal cb has been simplified substantially.
Some functions, including _.map and _.each, take a slight performance hit due to the changes. In my microbenchmarks, with very trivial loop bodies, this could be up to a factor two. I consider this unavoidable, due to the need for a single source of truth about collection iteration, though I have tried hard to keep the sacrifices at a minimum. I'd like to see real-world performance comparisons before drawing any conclusions.
In cases where really tiny collections with only one or two elements are passed to an Underscore function, performance hits might be worse than a factor two due to the increased stacking of function calls. Again, I would prefer to benchmark real-world applications before drawing any conclusions.
In contrast, some functions actually seem to have sped up a bit due to the simplifications, for example the anonymous functions returned by _.restArguments, which are used internally throughout Underscore.
The minified+gzipped UMD bundle size shrank by 225 bytes. I think this is as close to resolving Go on a Diet #2060 as we will ever get (edit: other than some breaking changes that can be made in version 2.0, such as removing IE<11 support).

Review

This PR is unusual because it may lead to changes that sacrifice performance in favor of other considerations. For the sake of accountability, I'd like the review to be very thorough and entirely public. This is going to be a lot of work and I don't want all that work to land on the shoulders of only one or two people. Instead, I hope to involve many people, each doing only light work. As a token of appreciation, everyone making a substantial contribution to this review will get a permanent honorable mention in the changelog.

I propose to divide the review work as follows. I'll be inviting specific people for specific parts, but please rest assured that anyone is welcome to participate in any part of the review. I'll keep track of review progress below.

Benchmark preparation

If you have a real-world JavaScript application (or library function) that meets all of the following criteria:

already in production, i.e., deployed/released and in use by real people in order to accomplish a real task;
has a performance-critical section, i.e., a part that required some attention in the past in order to ensure that it would be fast enough, or of which you could easily imagine that it might become problematically slow;
the performance-critical section involves calling at least one Underscore function;
the application can be adjusted to focus on testing the speed of that performance-critical section (so that testers don't need to wait for more than a few seconds before they can start measuring);

and you are willing to do the following things, with help as needed:

to create a version of your application that is suitable for quickly measuring the speed of the performance-critical section (without changing anything about the internal logic of that section itself);
to publish the source code of the benchmark program thus created and to keep the source code open for an extended period of time (at least one year but preferably indefinitely);
to invest some additional effort to make it as easy as possible for testers to run the program, to measure the speed and to switch between both versions of Underscore;

then please let us know by leaving a comment below!

Performance measurements

You can contribute performance measurements by doing the following.

Pick an engine that you can run benchmarks on (such as Firefox) and a benchmark program that can run on this engine.
Run the benchmark on the chosen engine, both with Underscore at commit c9b4b63 (master) and Underscore at commit eaba5b5 (functional), and compare the speed.
If you find a small speed difference, please repeat your measurements as often as is required in order to determine whether the difference is significant.
If you do find a significant difference, please profile the benchmark with both versions of Underscore to determine what change in Underscore is causing the difference.
Report your findings by leaving a comment below. Please include details such as hardware specifications, exact versions of the operating system and engine used, number of repeated measurements, exact measured speed on each repetition, and exact numbers obtained from profiling.

Final words

I make major contributions to Underscore, like the current one, because I believe it is an excellent library and because I want to help lift it to the next level. If you like what I'm doing, please consider supporting my open source work on Patreon. Shoutout to my existing backers, who encouraged me to do this work, and to @jashkenas, who created this awesome library.

This cost 5 bytes, but it's both more readable and more efficient.

coveralls · 2021-02-07T00:32:00Z

Coverage increased (+0.04%) to 95.238% when pulling eaba5b5 on jgonggrijp:functional into 745e9b7 on jashkenas:master.

jgonggrijp · 2021-02-07T00:31:14Z

modules/_baseIteratee.js

-  if (isFunction(value)) return optimizeCb(value, context, argCount);
-  if (isObject(value) && !isArray(value)) return matcher(value);
-  return property(value);
-}


For clarity: this code was merged into the public-facing modules/iteratee.js.

jgonggrijp · 2021-02-07T00:34:17Z

modules/_binarySearch.js

+
+// Iteratively cut `array` in half to figure out the index at which `obj` should
+// be inserted so as to maintain the order defined by `compare`.
+export default function binarySearch(array, obj, iteratee, compare) {


This is a more general version of _.sortedIndex, using compare instead of builtin operator <. _.sortedIndex now calls binarySearch internally.

jgonggrijp · 2021-02-07T00:41:44Z

modules/_extremum.js

+  var result, iterResult;
+  iteratee = cb(iteratee, context);
+  var first = true;
+  find(collection, function(value, key) {
+    var iterValue = iteratee(value, key, collection);
+    if (first || compare(iterValue, iterResult)) {
+      result = value;
+      iterResult = iterValue;
+      first = false;
+    }
+  });


This loop represents the "slow path" that I mentioned in the opening post. The fast path is currently still duplicated in _.min and _.max, for reasons explained under _.max.

jgonggrijp · 2021-02-07T00:47:27Z

modules/_optimizeCb.js

-  switch (argCount == null ? 3 : argCount) {
-    case 1: return function(value) {
-      return func.call(context, value);
-    };
-    // The 2-argument case is omitted because we’re not using it.
-    case 3: return function(value, index, collection) {
-      return func.call(context, value, index, collection);
-    };
-    case 4: return function(accumulator, value, index, collection) {
-      return func.call(context, accumulator, value, index, collection);
-    };
-  }


This part has been replaced by bindCb4.

jgonggrijp · 2021-02-07T00:47:53Z

modules/_optimizeCb.js

-  return function() {
-    return func.apply(context, arguments);
-  };


This part has been replaced by bindCb.

jgonggrijp · 2021-02-07T02:15:06Z

modules/max.js

+    for (var l = getLength(collection), i = 1; i < l; i++) {
+      val = collection[i];
+      if (
+        res == null || val != null && +val > +res || +res !== +res


I removed the -Infinity initializer, but at the same time committed myself to preserving the aforementioned crazy comparison semantic (until Underscore 2.0). This shifted all the crazyness into the if condition here. Note that this would just be val > res if we were following standard semantics.

jgonggrijp · 2021-02-07T02:36:42Z

modules/max.js

-    iteratee = cb(iteratee, context);
-    each(obj, function(v, index, list) {
-      computed = iteratee(v, index, list);
-      if (computed > lastComputed || computed === -Infinity && result === -Infinity) {


In the original slow path, computed and result were compared to -Infinity on every iteration as a way to ensure that an infinity produced by the iteratee would be preferred over the infinity initializer. This was the second factor that caused #2688. In the new slow path (in extremum), this special condition is no longer present; it is no longer required because the -Infinity initializer is gone.

Note that the special case for null, which I discussed in the fast branch, is not present here. This is due to an oversight of the person who originally added that special case. In effect, this created an additional internal inconsistency in the comparison semantics of _.max and _.min.

jgonggrijp · 2021-02-07T02:40:34Z

modules/max.js

  }
-  return result;
+  return extremum(collection, function(val, res) {
+    return res == null || val != null && +val > +res || +res !== +res;


The new slow path wraps the comparison condition in a function that is passed to extremum. It is the exact same expression as in the fast path; this is meant to ensure that, despite the code duplication, gzip can efficiently compress both copies together.

jgonggrijp · 2021-02-07T02:49:26Z

modules/reduceRight.js

+// A **less general** backward variant of `_.each`, specifically catered to
+// implementing `_.reduceRight`.
+function eachRight(obj, func) {
+  if (isArrayLike(obj)) {
+    findLastIndex(obj, func);
+  } else {
+    findLastIndex(keys(obj), function(key) {
+      func(obj[key], key, obj);
+    });
+  }
+}
+


This is the one exception to _.find being the single source of truth about how to iterate a collection. The problem is that there is no findRight, and I think there never should be; "rightward" iteration is only meaningful for arrays. Objects are unordered and Map, Set and iterators can be iterated in one direction only. In my opinion, reduceRight should disappear in Underscore 2.0. People who want to do right-to-left iteration can use _.findLastIndex directly.

jgonggrijp · 2021-02-07T03:11:35Z

modules/_createIndexFinder.js

+    if (getLength(array) < 1) return -1;
+    if (typeof controlArg == 'number') {
+      start = controlArg;
+    } else if (controlArg && forward) {


The && forward condition is only here to prevent the isSorted option from creeping into _.lastIndexOf as a side effect, purely because I vowed not to make any interface changes on this branch. It can be removed in a future update.

jgonggrijp · 2021-02-09T20:30:23Z

High-level review by @jashkenas, quoted from private email with his permission:

I gave your PR/plan a quick skim, and am 100% in agreement with your plan and proposal. I really love what you're trying to do here! I agree that large performance considerations need to trump pure elegance, for the sake of pragmatism.

jgonggrijp · 2021-03-04T11:29:05Z

First benchmark is in, thanks to awesome work by @m90!

jgonggrijp · 2023-02-27T13:47:22Z

Update: I will be writing an additional real-world benchmark in week 46, which is November 13–17. I hope to finish and merge this branch soon afterwards.

jgonggrijp added 30 commits December 30, 2020 01:21

Simplify _.iteratee

c22e1e9

Make _.contains generic over the collection type

ce15d38

Use _.find in _.each

f0e9bc3

Use _.find in _.every and _.some

0b51eae

Use _.each in createReduce

f4e0f71

Use _.chain in _chainResult

11bdeab

Slim down use of _.each in _.filter

3bc4c50

Use _.each in _.map

ea55aeb

Import modules/iteratee.js for its side effect in modules/_cb.js

5092fb5

Use isArrayLike in _.isEmpty

008990f

Remove an unnecessary check in _.omit

bd14ef1

Remove an unnecessary check from _.pick

38129be

Deduplicate subexpressions in _.sample

be5b8a2

Reorder and document _.toArray

adfb2f8

Simplify and deduplicate _.uniq

e842fd5

Use getLength in collectNonEnumProps

5d73fb3

Use getLength in _.bindAll

3753fe7

Refactor _.first and _.last, using getLength

d7393b4

This cost 5 bytes, but it's both more readable and more efficient.

Use getLength in _.invoke

18fd848

Remove switch pyramid of doom from _.restArguments

296be9e

Simplify _.restArguments using array.slice

60319af

Use _.last in _.invoke

645eafe

Use _.initial in _.invoke

aac1042

Simplify internal flatten using array.push

431b84e

Use array.push in _.partial

5781d89

Factor out linearSearch and binarySearch base algorithms

81a7a77

Use linearSearch in _.has

b4ab797

Use linearSearch in _.intersection

5bc6d47

Use linearSearch in the internal flatten

eef39cd

Use linearSearch in _methodFingerprint

b63e425

jgonggrijp added 16 commits February 4, 2021 14:36

Reuse the start variable in createIndexFinder

01f324e

Use _.find instead of _.each in _.map

6a0ebd8

Use _.find instead of _.each in internal group

7fee0d5

Use _.find instead of _.each in _.filter

f60a3ef

Use _.times instead of linearSearch in _.chunk

ba7df7a

Use _.findKey instead of linearSearch in _.mapObject

1cb9003

Elide createPredicateIndexFinder

674ed39

Slim down predicate elision in linearSearch

b7637b6

Move the identity optimization back to _.min and _.max

d5b8d74

Inline decideNumeric in _.min and _.max

b99a079

Use forward iteration in _.bindAll

9de1ceb

Use forward iteration in _.isEqual

20e88e5

Move the decide parameter out of the way in extremum

d82eadc

Generalize _.sample over the collection types

640332b

Restore only the essence of former optimizeCb

341ad30

Replace bindCb by bindCb4 where appropriate

eaba5b5

jgonggrijp commented Feb 7, 2021

View reviewed changes

jgonggrijp marked this pull request as ready for review February 7, 2021 03:14

jgonggrijp mentioned this pull request Mar 4, 2021

Discussion: benchmark package from NPM offen/underscore-benchmark#3

Closed

jgonggrijp mentioned this pull request Mar 8, 2021

[Tests]: System time tampering effects on _.throttle and _.debounce methods #2913

Merged

2 tasks

jgonggrijp mentioned this pull request Apr 7, 2021

Statistical Functions #2920

Closed

jgonggrijp mentioned this pull request Apr 28, 2021

Clarification on "-x" releases #2926

Closed

jgonggrijp mentioned this pull request Mar 24, 2023

isTrueish documentcloud/underscore-contrib#255

Open

jgonggrijp mentioned this pull request Jul 21, 2023

#3391 - Match trailing slash on root to how root was configured. jashkenas/backbone#4240

Merged

jgonggrijp mentioned this pull request Apr 4, 2024

Are there any plans for the next version release? #2994

Open

jgonggrijp mentioned this pull request Oct 20, 2024

refactoring _.some #2999

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Functional style #2908

Functional style #2908

jgonggrijp commented Feb 7, 2021 •

edited

Loading

coveralls commented Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp Feb 7, 2021

jgonggrijp commented Feb 9, 2021

jgonggrijp commented Mar 4, 2021

jgonggrijp commented Feb 27, 2023

Functional style #2908

Are you sure you want to change the base?

Functional style #2908

Conversation

jgonggrijp commented Feb 7, 2021 • edited Loading

Goals

Principles

Approach

Results (stage 1)

Review

Benchmark preparation

Performance measurements

Final words

coveralls commented Feb 7, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jgonggrijp commented Feb 9, 2021

jgonggrijp commented Mar 4, 2021

jgonggrijp commented Feb 27, 2023

jgonggrijp commented Feb 7, 2021 •

edited

Loading