Reduce micromatch overhead in jest-haste-map HasteFS #10132

lencioni · 2020-06-05T14:00:56Z

I was profiling some Jest runs at Airbnb and noticed that on my
MacBook Pro, we can spend over 30 seconds after running Jest with code
coverage as the coverage reporter adds all of the untested files. I
believe that this will grow as the size of the codebase increases.

Looking at the call stacks, it appears to be calling micromatch
repeatedly, which calls picomatch, which builds a regex out of the
globs. It seems that the parsing and regex building also triggers the
garbage collector frequently.

Since this is in a tight loop and the globs won't change between
checks, we can greatly improve the performance here by using
micromatch.matcher.

This optimization reduces the block of time here from about 30s to
about 10s. The aggregated total time of coverage reporter's
onRunComplete goes from 23s to 600ms.

Before:

After:

Summary

Motivation: Improve Jest performance when collecting coverage

Test plan

I ran jest in the Airbnb frontend monorepo with and without coverage options, with a path argument.

I was profiling some Jest runs at Airbnb and noticed that on my MacBook Pro, we can spend over 30 seconds after running Jest with code coverage as the coverage reporter adds all of the untested files. I believe that this will grow as the size of the codebase increases. Looking at the call stacks, it appears to be calling micromatch repeatedly, which calls picomatch, which builds a regex out of the globs. It seems that the parsing and regex building also triggers the garbage collector frequently. Since this is in a tight loop and the globs won't change between checks, we can greatly improve the performance here by using micromatch.matcher. This optimization reduces the block of time here from about 30s to about 10s. The aggregated total time of coverage reporter's onRunComplete goes from 23s to 600ms.

thymikee

👍

lencioni · 2020-06-05T14:51:54Z

I think this actually suffers from the same bug I addressed in #10131

Once that PR lands, I'll extract the code to a shared module and use it in both places.

lencioni · 2020-06-08T15:56:32Z

I'm going to fold this PR into #10131

github-actions · 2021-05-11T06:06:43Z

This pull request has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Please note this issue tracker is not a help forum. We recommend using StackOverflow or our discord channel for questions.

facebook-github-bot added the cla signed label Jun 5, 2020

thymikee approved these changes Jun 5, 2020

View reviewed changes

lencioni mentioned this pull request Jun 5, 2020

Improve Jest startup time and test runtime, particularly when running with coverage, by caching micromatch and avoiding recreating RegExp instances #10131

Merged

lencioni closed this Jun 8, 2020

github-actions bot locked as resolved and limited conversation to collaborators May 11, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce micromatch overhead in jest-haste-map HasteFS #10132

Reduce micromatch overhead in jest-haste-map HasteFS #10132

lencioni commented Jun 5, 2020

thymikee left a comment

lencioni commented Jun 5, 2020 •

edited

Loading

lencioni commented Jun 8, 2020

github-actions bot commented May 11, 2021

Reduce micromatch overhead in jest-haste-map HasteFS #10132

Reduce micromatch overhead in jest-haste-map HasteFS #10132

Conversation

lencioni commented Jun 5, 2020

Summary

Test plan

thymikee left a comment

Choose a reason for hiding this comment

lencioni commented Jun 5, 2020 • edited Loading

lencioni commented Jun 8, 2020

github-actions bot commented May 11, 2021

lencioni commented Jun 5, 2020 •

edited

Loading