NBConvert 6.0 Planning #1045

MSeal · 2019-06-09T20:38:01Z

In light of several conversations around larger changes, I wanted to start the thread on 6.0 targets and outline / discuss what changes should or shouldn't be in the next major release. We should make issues for each of the following that we want addressed. Any of the topics below can be questioned and challenged or pushed until later releases. Champions for each one would be appreciated to help move key subjects forward. I'll keep this top ticket updated with links and additional requested items from the thread below.

Move execute preprocessor code into separate thin project
@mpacer first brought up the idea, I think it would greatly help the separation of concerns and let nbconvert focus on conversion rather than execution. Other projects (i.g. papermill) that just want execution ability could import this new module rather than the much larger dependency chain of nbconvert.
Python 3 only
Remove, or at a minimum not support, python 2 moving forward. Seems straight forward enough for the major version upgrade.
Directory based template loading, following viola and other jupyter project patterns.
First part for adding additional paths will be in make a default global location for custom user templates #1028 . It's intended to make templates a directory with resources and multiple asset files rather than a single template file with the jump to 6.0.
Eliminating the majority of usability concerns with latex, pdf, and html conversion.
This is a tall order, and we've been trying to get improvements made for the major latex / pdf issues, but there's a lot of these and I'm not sure we'll have them closed for the last 5.x release. Will probably require some significant improvements to how templates are loaded, executed, and tested to get improvements to more production quality levels.
- [ ] Deprecate / remove unnecessary or complicated configuration options
We have a lot of configuration options, some of them are old or could be unnecessary now. I'd like to prune options and figure out how templating improvements could help where flags were used as crutches in the past. Not sure what the full scope could be here for this.
- [ ] Consider lightening or removing traitlets
Not going to be a popular topic, but traitlets causes a lot of support burden and debugging issues. It's also been cited as the part that's kept many strong developers I know from contributing because it's complicated and difficult to understand how it operates without a lot of digging. I don't expect this topic to make it in for 6.0 but I thought I might at least spark a conversation on the topic to see how we could make the repo more contributor friendly.

There's probably some more I'm not thinking of atm. I'll add items from the thread that seem relevant.

t-makaro · 2019-06-10T03:08:39Z

Move execute preprocessor code into separate thin project

I agree. It's weird that a conversion utility deals with execution, but we should keep the --execute flag in some fashion even if deprecated. We could reference the new package. It would be nice to have them developed independently.

Python 3 only

Yes please. Note: Python 2 notebooks will still be able to be converted. Nbconvert itself will just need python 3 to run.

Eliminating the majority of usability concerns with latex, pdf, and html conversion.

This will have a more clear path forward after I do issue triaging.

Deprecate / remove unnecessary or complicated configuration options

I'd be willing to help audit some of the configuration options. We could definitely do with some pruning. I'd also like to re-write some of the docs to make the python API and configuration options more clear. This would also make the repo more contributor friendly.

Consider lightening or removing traitlets

As an example of bugs: #774. That said. Traitlets and python based config files really do allow for a lot awesome config. Refactoring and lightening would be appreciated. I feel like a big problem with traitlets is the documentation. I had to look through the code to find traitlets.config.loaders.PyFileConfigLoader in order to load the nbconvert config file whilst using the python API recently since I wasn't using NbConvertApp.

Should we do 5.6 release for some of the fixes that aren't API breaking? It would also be nice to have milestones for "must be in >6.0" and "could be released before 6.0". It would help with triaging issues.

t-makaro · 2019-06-10T04:19:17Z

Note on traitlets. It actually helped me get into contributing to nbconvert when I was making #889 once I realised that all I had to do was add 1 line to make something easily configurable. I really think the limitation is in documentation more than anything.

MSeal · 2019-06-12T06:06:56Z

Don't get me wrong that the ability to configure is really nice -- it's the debugging it when it goes wrong (which I just spent a good 5 hours doing so last week) is where I find myself questioning if we need something so complicated.

MSeal · 2019-06-12T06:08:17Z

For the tagging I say we add anything that should be > 6.0 as Milestone 6.0 for now. Anything not milestoned could be before or after.

t-makaro · 2019-08-02T17:03:02Z

Can we consider making notebook the default export format for nbconvert in 6.0? This allows

jupyter nbconvert notebook.ipynb --SomePreprocessor.enabled=True

to be the command for notebook -> notebook operations. This way if people just want to apply preprocessor operations without changing to a different format it doesn't need an explicit --to notebook. This seems like a less arbitrary default than html.

MSeal · 2019-08-06T20:33:13Z

Can we consider making notebook the default export format for nbconvert in 6.0?

I'm +1 for this

takluyver · 2019-11-02T17:55:58Z

Python 3 only

👍

Consider lightening or removing traitlets

I certainly think traitlets are an attractive nuisance. They're a whole pile of complexity which is, I believe, largely unnecessary for this kind of application (I believe traits were originally designed to support making GUIs quickly). And they tie your config to class names, making refactoring a pain.

The one thing in their favour is that it's really easy to add a new config option. But, I'd argue, it's kind of too easy. You wind up making everything configurable because why wouldn't you add config=True, but then two config options don't play well together, or some future change becomes 5x more complex because you have to accommodate a config option just in case someone is relying on it.

OK, enough ranting.

Can we consider making notebook the default export format for nbconvert in 6.0?

Maybe with a warning, in particular if there are no preprocessors configured? I suspect there are a lot of people who are used to not needing --to html to get HTML output.

If I can suggest one further idea: drop postprocessors as a general category? There's only one (serve), and that only really makes sense for one specific format (slides). Maybe there should be a separate little tool that converts a notebook to reveal slides and serves them?

takluyver · 2019-11-03T21:49:47Z

Move execute preprocessor code into separate thin project

I think this could make sense for several of the generic preprocessors which manipulate the notebook data, e.g. removing cells, clearing metadata. Like with execution, there's no real reason that they need to depend on the conversion machinery.

Maybe they should live together in a package called something like nbdo, to avoid creating N separate tiny packages to do a single function each. Then again, @kynan's nbstripout is already a standalone thing - though that also includes functionality for integrating it with git.

MSeal · 2019-11-04T18:18:25Z

If I can suggest one further idea: drop postprocessors as a general category? There's only one (serve), and that only really makes sense for one specific format (slides). Maybe there should be a separate little tool that converts a notebook to reveal slides and serves them?

I'd be down for dropping the postprocessors and just changing preprocessors to processors. I found the naming and convention therein super confusing when I first joined and I believe the confusion is partially keeping people from contributing more by not understanding the model. Let's make a new issue for this idea and copy the notes from here.

I think this could make sense for several of the generic preprocessors which manipulate the notebook data, e.g. removing cells, clearing metadata. Like with execution, there's no real reason that they need to depend on the conversion machinery.

We were thinking the execute preprocessor should have it's own isolated package because papermill, nbconvert, and possibly a future async client library would all make use of just that dependency. Currently papermill pulls a lot of extra dependencies it doesn't need to get nbconvert installed. Would other preprocessors you've read have other direct importers that wouldn't want the rest of nbconvert? I'd be inclined to leave processors that are only used by nbconvert in the library, as we already have a lot of problems with users having mismatched dependency versions that don't work together. Would like to iterate on the ideas more here.

takluyver · 2019-11-05T07:57:40Z

Would other preprocessors you've read have other direct importers that wouldn't want the rest of nbconvert?

nbstripout is doing something a lot like the clearoutput and clearmetadata preprocessors, although they're trivial enough that it reimplements the functionality. I think it's easy to imagine preprocessors like regexremove and tagremove being used to modify notebooks without converting them, e.g. #1132.

This is really going one step further with @t-makaro's idea to make notebook the default output format. Any time you want to nbconvert --to notebook, you're only using the preprocessors, not the conversion machinery. And it's the conversion machinery that has all the dependencies.

takluyver · 2019-11-05T07:59:58Z

Brainstorming: would it make sense to specify a kind of in-memory pipeline, something like this:

nbdo clearoutput -- removecells --tag debug -- execute --timeout 60 -- convert --to html

MSeal · 2019-11-06T03:59:33Z

I think chaining processors would be a good move. It would make the ordering of conversions clear and allow for one call to do all conversions. It also makes pre vs post processor not matter anymore.

SylvainCorlay · 2019-11-20T14:15:55Z

I would be in favor of tagging alpha releases before we check all the boxes here.

As soon as we have it, we will start targetting the alpha in a development branch of Voilà and other places where we consume the new template system.

MSeal · 2019-11-20T18:04:11Z

I can look this weekend at any remaining python 3-only items and kick off an alpha release to that effort. @takluyver anything you're aware of that we should close before doing so?

takluyver · 2019-11-20T20:18:36Z

I haven't been following closely, but I think you can make alpha releases whenever you think it's useful. Just be aware that anyone running pip install --pre might pick it up - they don't need to specify the precise version.

MSeal · 2019-11-24T19:13:01Z

@SylvainCorlay pre-release 6.0.0a0 is now on PyPI. I tested that it's only getting picked up by pip install --pre nbconvert. Normal nbconvert installs will still pick up 5.6.1. I also made sure python 2 isn't picking up the new pre-release.

takluyver · 2019-11-24T21:58:21Z

Unfortunately notebook tests have picked up the pre-release, and are failing with an error about not finding templates:

Traceback (most recent call last):
  File "/home/travis/build/jupyter/notebook/notebook/nbconvert/handlers.py", line 132, in get
    resources=resource_dict
  File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/html.py", line 99, in from_notebook_node
    self.register_filter('highlight_code', highlight_code)
  File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 418, in register_filter
    return self._register_filter(self.environment, name, jinja_filter)
  File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 154, in environment
    self._environment_cached = self._create_environment()
  File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 436, in _create_environment
    paths = self.get_template_paths()
  File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 465, in get_template_paths
    template_names = self.get_template_names()
  File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 512, in get_template_names
    raise ValueError('No template sub-directory with name %r found in the following paths:\n\t%s' % (template_name, paths))
ValueError: No template sub-directory with name 'classic' found in the following paths:
	/tmp/tmpjcjzbt1f/data
	/tmp/tmpjcjzbt1f/env/share/jupyter
	/tmp/tmpjcjzbt1f/share/jupyter

https://travis-ci.org/jupyter/notebook/jobs/616385738

MSeal · 2019-11-24T22:07:42Z

I'm not sure that's unfortunate or not. Is the build purposefully pulling any (not released) dependency versions? It does tells us where we need to prep fixes for 6.0 branch. I am guessing that code needs to be 5.x/6.0 compatible.

…

On Sun, 24 Nov 2019, 1:58 pm Thomas Kluyver, ***@***.***> wrote: Unfortunately notebook tests have picked up the pre-release, and are failing with an error about not finding templates: Traceback (most recent call last): File "/home/travis/build/jupyter/notebook/notebook/nbconvert/handlers.py", line 132, in get resources=resource_dict File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/html.py", line 99, in from_notebook_node self.register_filter('highlight_code', highlight_code) File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 418, in register_filter return self._register_filter(self.environment, name, jinja_filter) File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 154, in environment self._environment_cached = self._create_environment() File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 436, in _create_environment paths = self.get_template_paths() File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 465, in get_template_paths template_names = self.get_template_names() File "/home/travis/virtualenv/python3.7.1/lib/python3.7/site-packages/nbconvert/exporters/templateexporter.py", line 512, in get_template_names raise ValueError('No template sub-directory with name %r found in the following paths:\n\t%s' % (template_name, paths)) ValueError: No template sub-directory with name 'classic' found in the following paths: /tmp/tmpjcjzbt1f/data /tmp/tmpjcjzbt1f/env/share/jupyter /tmp/tmpjcjzbt1f/share/jupyter https://travis-ci.org/jupyter/notebook/jobs/616385738 — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#1045?email_source=notifications&email_token=AAOHBVA5EQQ77VDU7LPECATQVL2H5A5CNFSM4HWOEWPKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEFAV5KQ#issuecomment-557932202>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAOHBVFKAQKVLLAJDE5LLTTQVL2H5ANCNFSM4HWOEWPA> .

takluyver · 2019-11-25T10:14:57Z

It deliberately uses pip install --pre, presumably to get advance warning of issues with dependencies. So as you say, it's fortunate that it could get picked up quickly.

I'm guessing the issue is that the template files are now found using the 'jupyter path' search path, but the notebook tests monkeypatch this to isolate themselves from the user's local setup.

Certainly the notebook code and test suite needs to remain compatible with nbconvert 5.x at least until nbconvert 6 is released (and really for a while afterwards too).

maartenbreddels · 2019-11-29T15:18:40Z

The notebook issue only seemed to be test run issue: jupyter/notebook#5092 jupyter/notebook#5086.
So far it seems solid.

I'm now working on voila and many of the templates, so far so good.

MSeal · 2020-07-02T18:54:31Z

Alright, that works

SylvainCorlay · 2020-07-07T13:53:33Z

FYI, I pushed another quiet alpha (6.0.0a4), so that we can start testing the latest changes in voila-reveal.

MSeal · 2020-07-07T16:34:40Z

Sounds good, I think some rapid alpha releases to support testing the last changes is a good idea. Btw in helping with #1292 I think I got most of the "backwards compatability / warnings for old template patterns" done if you wanted to take a look later.

MSeal · 2020-08-24T19:14:02Z

I am thinking of kicking off a beta release and setting an explicit date for 6.0 official release. I'd prefer to do the full release in a week or two (at most).

The only things I have that I am queuing to complete are:

Changed clearmetadata preprocessor to remove all metadata by default #1314
Add flag to convert Config options warnings as sphinx warnings #1350 (if possible before a release, not going to block on it)
Update README to reflect Also drop 3.6 ? #1348 (comment)

There's a number of bugs still left in https://github.com/jupyter/nbconvert/issues?q=is%3Aopen+is%3Aissue+milestone%3A6.0 but nothing I'd consider blocking a release. I am going to post to discord about seeing if anyone that's not a common contributor is open to helping wrap those up before a release goes out.

What are folks thoughts on kicking off a beta release now and a full release next ~weekend? Is there anything else that needs to be actively addressed @t-makaro @maartenbreddels @SylvainCorlay ?

MSeal · 2020-08-24T19:33:12Z

Oh and #582 perhaps if we want to take an action around PostProcessors or not.

MSeal · 2020-08-24T22:18:14Z

I kicked off a 6.0.0b7 release for the beta and am going to announce it on the discourse so people are aware / ask for interested parties to contribute any of the open bug fixes they want in before the final release.

SylvainCorlay · 2020-08-26T08:08:14Z

6.0.0b7

why not b0?

SylvainCorlay · 2020-08-26T13:03:01Z

I think it is really weird to release the first beta with b7. I think that we should delete that release and make a b0 available before communicating about it. @MSeal are you ok with me going ahead with this?

MSeal · 2020-08-26T13:40:11Z

It was the natural progression of pre-release numbering from a build number perspective after a6. It also showed that we had several prior builds, and the number doesn't really matter as it's not semver anyway. I'd prefer we just leave it to avoid confusing people who have already installed the latest pre-release because pip tooling doesn't do great with installing when you go backwards and not forwards with release progression alphanumerically.

SylvainCorlay · 2020-08-26T14:03:33Z

OK, sounds good.

SylvainCorlay · 2020-09-07T16:59:52Z

Nbconvert 6.0.0rc0 is out with the latest changelog. I sent another round of emails to downstream maintainers about it.

SylvainCorlay · 2020-09-07T17:02:08Z

The only things I have that I am queuing to complete are:

#1314
#1350 (if possible before a release, not going to block on it)

If you don't have time to tackle this, I think it would be great to table it to the next release. We are blocking on nbconvert 6.0 for quite some time, and the more we wait, the more difficult it becomes...

Update README to reflect #1348 (comment)

I opened a PR to the readme to add the comment

MSeal · 2020-09-07T17:25:03Z

@SylvainCorlay That's fair, I was just going to take some time today to see if I could get it resolved. I agree and would like to release 6.0 this week. Give me a couple hours and I should have that PR updated, but I agree that we don't need to block on it.

SylvainCorlay · 2020-09-07T18:04:24Z

Great! let's aim for tomorrow (Tuesday)!

joelostblom · 2020-09-08T21:57:28Z

Excited for 6.0, thanks for working on this release!

I don't want to delay, I just want to raise awareness about #1358. If you can reproduce that issue, it might be something to consider fixing before 6.0 because it blocks HTML export, if you cannot reproduce I guess it is something local although I have tried it on two different laptops and had it reported by several of my students running a6 (on Windows).

I just tried with the rc0 zip and the same problem is there. In essence, I create a new conda env with only jupyterlab (and deps) installed, HTML export works fine until I do pip install https://github.com/jupyter/nbconvert/archive/6.0.0rc0.zip, then I encounter the issue reported in #1358.

It seems like this is only an issue with installing nbconvert zip or tar.gz files from GitHub, not directly from a commit, so likely won't be a problem for the release. Sorry for the noise.

MSeal · 2020-09-08T22:22:25Z

Np, thanks for reporting. I'll follow up on that oddity in #1358 but I agree I'm not hitting a blocking issue for release.

MSeal · 2020-09-08T22:58:41Z

@SylvainCorlay Since we're in fairly different timezones how do you want to coordinate the release itself? I did a PR using the release automation tooling to capture the remaining authors for this release: #1363

I'm happy with either of us running the release after that merges.

mgeier · 2020-09-09T07:45:28Z

What about #1347?
This still seems to be broken, right?

MSeal · 2020-09-09T07:53:27Z

Ahh we should check that and see if it's an issue or not (I won't have any time tomorrow to do that)

mgeier · 2020-09-09T07:59:23Z

There is still an error in the example notebook mentioned in #1339 (see also #1292 (comment)).

This doesn't yet show up as a CI failure (see #1350), but I think the notebook itself (or whatever is causing the error) should be fixed before 6.0 because the failure will be visible in the docs, where the example output is much too long: https://nbconvert.readthedocs.io/en/latest/nbconvert_library.html#Example

mgeier · 2020-09-09T08:17:17Z

@MSeal

we should check that and see if it's an issue or not

I've tried running the notebook docs/source/customizing.ipynb (which has been removed in #1354) and it still produces the errors mentioned in #1339.

The usage in this notebook might be wrong, though, I don't know.

SylvainCorlay · 2020-09-09T18:44:47Z

There is still an error in the example notebook mentioned in #1339 (see also #1292 (comment)).

This is now fixed!

The usage in this notebook might be wrong, though, I don't know.

Yes, I think this was outdated.

SylvainCorlay · 2020-09-09T19:02:16Z

@MSeal @maartenbreddels I fixed the last issue reported by @mgeier.

There is the question of #1347, which is about supporting the cwd as a template path. I am not sure we should do this though.

SylvainCorlay · 2020-09-09T21:29:52Z

OK, I added a comment about #1347.

If nobody objects by then, I will tag 6.0.0 tomorrow morning (CET)!

MSeal · 2020-09-10T01:01:27Z

I think we're good for a release, thanks for offering to handle shipping @SylvainCorlay. If minor things come up after let's just be more proactive with patch releases with fixes than we were with 5.x.

SylvainCorlay · 2020-09-10T08:19:14Z

Nbconvert 6.0 is out! 🎉

The conda-forge PR should follow shortly.

Closing this issue!

MSeal · 2020-09-10T16:34:34Z

I can post on discourse later today or tomorrow about the event. Awesome work everyone and thanks for getting 6.0 across the finish line

MSeal added this to the 6.0 milestone Jun 9, 2019

MSeal mentioned this issue Aug 23, 2019

Async / asynchronous execution #1092

Open

MSeal mentioned this issue Oct 17, 2019

5.6.1 Release #1117

Closed

MSeal mentioned this issue Oct 25, 2019

NBConvert 6.0 Changes sphinx-doc/sphinx#6773

Closed

takluyver mentioned this issue Nov 4, 2019

Require Python 3 #1135

Merged

takluyver mentioned this issue Nov 8, 2019

Per NB nbconvert configuration in metadata? #1139

Open

chrisjsewell mentioned this issue Nov 9, 2019

support nbconvert version 6 when released chrisjsewell/ipypublish#108

Open

MSeal mentioned this issue Nov 19, 2019

nbconvert 6.0 alpha release #1144

Closed

maartenbreddels mentioned this issue Nov 29, 2019

test(fix): make tests run with nbconvert >=6,<7 jupyter/notebook#5092

Closed

MSeal mentioned this issue Dec 30, 2019

Use new subset of nbconvert (new package) nteract/papermill#462

Closed

MSeal self-assigned this Sep 9, 2020

SylvainCorlay closed this as completed Sep 10, 2020

chrisjsewell mentioned this issue Sep 27, 2020

Improve handling of stderr formatting executablebooks/MyST-NB#264

Open

NBConvert 6.0 Planning #1045

NBConvert 6.0 Planning #1045

Comments

MSeal commented Jun 9, 2019 • edited Loading

t-makaro commented Jun 10, 2019

t-makaro commented Jun 10, 2019

MSeal commented Jun 12, 2019

MSeal commented Jun 12, 2019

t-makaro commented Aug 2, 2019

MSeal commented Aug 6, 2019

takluyver commented Nov 2, 2019

takluyver commented Nov 3, 2019

MSeal commented Nov 4, 2019

takluyver commented Nov 5, 2019

takluyver commented Nov 5, 2019

MSeal commented Nov 6, 2019

SylvainCorlay commented Nov 20, 2019

MSeal commented Nov 20, 2019

takluyver commented Nov 20, 2019

MSeal commented Nov 24, 2019

takluyver commented Nov 24, 2019

MSeal commented Nov 24, 2019 via email

takluyver commented Nov 25, 2019

maartenbreddels commented Nov 29, 2019

MSeal commented Jul 2, 2020

SylvainCorlay commented Jul 7, 2020

MSeal commented Jul 7, 2020

MSeal commented Aug 24, 2020

MSeal commented Aug 24, 2020

MSeal commented Aug 24, 2020

SylvainCorlay commented Aug 26, 2020

SylvainCorlay commented Aug 26, 2020

MSeal commented Aug 26, 2020

SylvainCorlay commented Aug 26, 2020

SylvainCorlay commented Sep 7, 2020

SylvainCorlay commented Sep 7, 2020

MSeal commented Sep 7, 2020

SylvainCorlay commented Sep 7, 2020

joelostblom commented Sep 8, 2020 • edited Loading

MSeal commented Sep 8, 2020

MSeal commented Sep 8, 2020

mgeier commented Sep 9, 2020

MSeal commented Sep 9, 2020

mgeier commented Sep 9, 2020

mgeier commented Sep 9, 2020

SylvainCorlay commented Sep 9, 2020

SylvainCorlay commented Sep 9, 2020

SylvainCorlay commented Sep 9, 2020

MSeal commented Sep 10, 2020

SylvainCorlay commented Sep 10, 2020

MSeal commented Sep 10, 2020

MSeal commented Jun 9, 2019 •

edited

Loading

joelostblom commented Sep 8, 2020 •

edited

Loading