Switch to split markdown parser #3048

MDeiml · 2022-06-22T13:10:52Z

My new split markdown parser works quite well now (better than the non-split one). This PR changes the config to point at the new branch because I want to keep the old branch for now as other projects are using it.

It also adds some other changes that are needed for the two parsers to work:

An exclude_children! directive, that might also be very useful for something like Injected markdown incorrectly highlights indented docstrings #2212
Splitting markdown queries into block and inline ones and add the injection for inline into block grammar
Add an include_dir option to parser configs (needed, because the two grammars don't live in the repos root directory)

clason · 2022-06-22T16:16:04Z

Do you want to add yourself as a maintainer so people know whom to ping when things go wrong?

MDeiml · 2022-06-23T14:52:50Z

Sure good idea

clason · 2022-06-23T16:41:45Z

Also, please don't forget to mark this as a breaking change (feat(markdown)!: switch to split parser) as this will invalidate downstream queries.

MDeiml · 2022-06-24T07:30:40Z

I also noticed a bug (which probably should be solved separately from this PR): A node from an injected language used for highlighting can have a range that might include bytes that lie outside of the included range. E.g. with this

> *Foo
> Bar*

The emphasis spans over the second >, but that one is not part of the included range for the inline content. It is still highlighted as emphasis.

We would need to calculate the intersection between included range and node range for every node used in an injected language and use that instead.

theHamsta · 2022-06-24T10:37:25Z

@MDeiml wow, awesome that we now have an exclude_children! directive. We can of course merge it. But I'm wondering whether we should just make it always the default as this is the way upstream tree-sitter behaves (except "injection.combined" is set https://tree-sitter.github.io/tree-sitter/syntax-highlighting#language-injection).

MDeiml · 2022-06-24T12:38:13Z

I also think excluding children is the better default, but making this the default would probably need changes (and some discussion) in neovim itself. Maybe it's better to first test this directive here and later make a PR to neovim?

lua/nvim-treesitter/parsers.lua

lockfile.json

lua/nvim-treesitter/parsers.lua

kyazdani42

looks good to me otherwise :)

queries/markdown_inline/highlights.scm

lua/nvim-treesitter/parsers.lua

lockfile.json

theHamsta · 2022-06-25T17:26:14Z

~~I'm having similar performance issues as with the old parser. Was the intention of the split to improve performance?~~

Wrong alarm. I accidentally used the old markdown parser (was left installed as an artifact of testing a different PR at alternative install location). It seems to be slow still with markdown_inline but that might also be our injection implementation that does no incremental parsing.

MDeiml · 2022-06-26T14:35:34Z

It's a bit faster, but still slow. I think parsing time after edits is still linear with file size (should be linear with size of edit), because all inline ranges with more complicated elements like emphasis or code spans still get reparsed. This could be fixed, by detecting which inline ranges actually got edited and then just reusing the old trees for all the other ones. But that is something that tree-sitter itself should be able to detect, so maybe I'm gonna create a PR in for that in tree-sitter.

clason · 2022-06-26T14:51:27Z

Just on the off-chance: will tree-sitter/tree-sitter#1783 be of any help?

MDeiml · 2022-06-26T15:48:50Z

I don't think so. There is no "syntactically wrong markdown", so error recovery is not relevant.

- adding back nvim-markdown for syntax highlighting while queries downstream get updated to support the markdown split for markdown_inline - REF: nvim-treesitter/nvim-treesitter#3048

clason requested review from theHamsta and kyazdani42 June 22, 2022 16:15

ghishadow mentioned this pull request Jun 23, 2022

Markdown Highlighting Broken lapce/lapce#605

Closed

theHamsta reviewed Jun 24, 2022

View reviewed changes

lua/nvim-treesitter/parsers.lua Outdated Show resolved Hide resolved

clason reviewed Jun 24, 2022

View reviewed changes

lockfile.json Outdated Show resolved Hide resolved

MDeiml force-pushed the use_split_markdown_grammar branch from a462a34 to b35b32c Compare June 24, 2022 15:48

theHamsta reviewed Jun 24, 2022

View reviewed changes

lua/nvim-treesitter/parsers.lua Show resolved Hide resolved

theHamsta reviewed Jun 24, 2022

View reviewed changes

lua/nvim-treesitter/parsers.lua Show resolved Hide resolved

MDeiml force-pushed the use_split_markdown_grammar branch from b35b32c to bf1ebcf Compare June 24, 2022 20:04

kyazdani42 approved these changes Jun 25, 2022

View reviewed changes

queries/markdown_inline/highlights.scm Outdated Show resolved Hide resolved

theHamsta reviewed Jun 25, 2022

View reviewed changes

lua/nvim-treesitter/parsers.lua Outdated Show resolved Hide resolved

clason closed this Jun 25, 2022

clason reopened this Jun 25, 2022

MDeiml force-pushed the use_split_markdown_grammar branch from 3d4058a to 34ae4ac Compare June 25, 2022 16:53

theHamsta reviewed Jun 25, 2022

View reviewed changes

lockfile.json Outdated Show resolved Hide resolved

MDeiml added 3 commits June 26, 2022 16:29

Switch to split markdown parser

5e6e008

Fix luastyle lint

6c2c806

Add myself as maintainer for markdown

e795da2

MDeiml force-pushed the use_split_markdown_grammar branch from 34ae4ac to e795da2 Compare June 26, 2022 14:29

clason merged commit 002084b into nvim-treesitter:master Jun 26, 2022

clason mentioned this pull request Jun 26, 2022

Notice of Breaking Changes #2293

Open

numToStr mentioned this pull request Jun 27, 2022

[Markdown] E13: File exists (add ! to override) #3074

Closed

ranebrown mentioned this pull request Jun 29, 2022

Update queries for split markdown parser lewis6991/spellsitter.nvim#75

Closed

lewis6991 mentioned this pull request Sep 26, 2022

spelllang lewis6991/nvim-treesitter#4

Closed

David-Else mentioned this pull request Oct 7, 2022

Slow Markdown Highlighting helix-editor/helix#4139

Closed

lewis6991 mentioned this pull request Oct 18, 2022

allow non-registered langs to use modules lewis6991/nvim-treesitter#5

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to split markdown parser #3048

Switch to split markdown parser #3048

MDeiml commented Jun 22, 2022

clason commented Jun 22, 2022

MDeiml commented Jun 23, 2022

clason commented Jun 23, 2022

MDeiml commented Jun 24, 2022

theHamsta commented Jun 24, 2022

MDeiml commented Jun 24, 2022

kyazdani42 left a comment

theHamsta commented Jun 25, 2022 •

edited

Loading

MDeiml commented Jun 26, 2022

clason commented Jun 26, 2022

MDeiml commented Jun 26, 2022

Switch to split markdown parser #3048

Switch to split markdown parser #3048

Conversation

MDeiml commented Jun 22, 2022

clason commented Jun 22, 2022

MDeiml commented Jun 23, 2022

clason commented Jun 23, 2022

MDeiml commented Jun 24, 2022

theHamsta commented Jun 24, 2022

MDeiml commented Jun 24, 2022

kyazdani42 left a comment

Choose a reason for hiding this comment

theHamsta commented Jun 25, 2022 • edited Loading

MDeiml commented Jun 26, 2022

clason commented Jun 26, 2022

MDeiml commented Jun 26, 2022

theHamsta commented Jun 25, 2022 •

edited

Loading