Support type annotations (PEP 484) #196

rth · 2019-01-13T17:27:23Z

With support for Python 2.7 being dropped in a number of projects, some might consider using type annotations from PEP 484.

In particular, see discussions in https://github.com/numpy/numpy-stubs, pandas-dev/pandas#14468 scipy/scipy#9038 scikit-learn/scikit-learn#11170

A recurrent issue when using type annotations is that one has then to keep consistency between types in the docstring and type annotations which is extra work and can be tedious.

It would be great if numpydoc could take the type information from annotations. For instance,

def func(x):
    """Some function

    Parameters
    ----------
    x : int
        Description of parameter `x`.
    """

could optionally be written as,

def func(x: int):
    """Some function

    Parameters
    ----------
    x
        Description of parameter `x`.
    """

and generate the same HTML.

To keep backward compatibility this mechanism could be enabled with an optional config flag, and only be used if the type is not provided in the docstring.

For docstring formatting without numpydoc this functionality is implemented in sphinx-autodoc-typehints (that also works with sphinx.napoleon). So maybe integration with that extension could be a possibility.

This would also have the added benefit that type annotations, unlike docstrings types, can be programmatically checked for consistency (e.g. with mypy)..

jorisvandenbossche · 2019-01-14T16:02:41Z

Interesting topic! Two remarks:

When using such an "inject by sphinx" approach, that would mean that the docstring is not updated with the typing information in interactive sessions?
For me, typing annotations are more oriented to developers (and their tools) / advanced users, and not that newbie friendly (eg I find a type description of str or float more friendly than Union[str, float])

(It might be something each project needs to discuss and decide for themselves if they find those items important or not, but still, interested to see what others think)

jnothman · 2019-01-14T23:42:08Z

I'm sure if we wanted, we could find a way to inject types into docstrings at module load time, making sure the types are available in pydoc etc. Apart from being magic, this would entail a runtime dependency on numpydoc (but not necessarily sphinx)

rth · 2019-01-15T10:46:53Z

When using such an "inject by sphinx" approach, that would mean that the docstring is not updated with the typing information in interactive sessions?

As Joel said it's possible, but a broader question is whether we want to repeat types in the docstring if they are already shown in the function signature. For instance, if we take modern typed languages, Rust docstrings don't repeat that information neither does say C++ xtensor project (which seem to use sphinx BTW).

Having this issue closed, with "it's a better practice not to repeat type information in the docstring" is also a possibility. I was just interested in some feedback about this in general.

For me, typing annotations are more oriented to developers (and their tools) / advanced users, and not that newbie friendly (eg I find a type description of str or float more friendly than Union[str, float])

I agree. Transforming simple types to a more human-readable version in the docstring is a possibility, though the questions is what to do with more complex types (it could be possible to limit the deph say show Dict[str, Tuple[int]] as just dict.

rth · 2019-10-10T14:45:33Z

This actually seems to be already implemented in napoleon https://github.com/sphinx-doc/sphinx/blob/master/doc/usage/extensions/napoleon.rst#type-annotations

Will try to investigate how to add something similar to numpydoc..

MartinThoma · 2020-01-30T07:17:03Z

Instead of

def func(x: int):
    """Some function

    Parameters
    ----------
    x
        Description of parameter `x`.
    """

I would love

def func(x: int):
    """Some function

    Parameters
    ----------
    x : Description of parameter `x`.
    """

rth · 2020-03-30T13:47:54Z

An alternative approach mentioned by @ogrisel today was to have some migration tool that would take code with type annotations and update the docstring types in the corresponding python files .

If it happens I also think that numpydoc could be a good place for it. Somewhat related to docsting validation, but with inline modification of the code.

MartinThoma · 2020-03-30T14:26:15Z

@rth Do you happen to know a tool which can do that?

I had a look at https://github.com/dadadel/pyment in February, but it had a couple of issues (e.g. dadadel/pyment#84 ) which made it unusable for me.

jnothman · 2020-03-31T20:06:40Z

I had understood @ogrisel to be suggesting a tool which would check compatibility between the docstring type and the annotation, rather than risk automatic generation.

darcamo · 2020-04-01T21:59:32Z

A tool that checks compatibility would be great. Then one could easily use it as a pre-commit hook, for instance.

wwuck · 2020-06-12T04:34:43Z

A tool that checks compatibility would be great. Then one could easily use it as a pre-commit hook, for instance.

Have you looked at darglint? It looks like it will do that.

darcamo · 2020-06-12T20:07:57Z

I just checked darglint. It is exactly what we need for compatibility checking. The only problem is that it is a bit slow, at least for now.

cbrnr · 2021-08-11T08:47:23Z

Instead of a tool that checks if type annotations and types in docstrings are compatible, direct support for type annotations in numpydoc would be much better IMO. I'd love to be able to write a function as mentioned in the initial post. I think type annotations can also be useful for users (and not only developers), and if user-friendliness is an issue I like the suggestion to convert certain annotations to something more readable.

TitaniumHocker · 2022-12-19T12:07:57Z

Any updates on this?

HealthyPear · 2023-03-13T17:22:42Z

I think I also hit this roadblock.

I have something like,

def load_config_file(self, path: Union[str, pathlib.Path]) -> None:
        """
        Load a configuration file in one of the supported formats, and merge it with
        the current config if it exists.

        Parameters
        ----------
        path: Union[str, pathlib.Path]
            config file to load. [yaml, toml, json, py] formats are supported
        """

then I use the numpydoc validator via sphinx-build and I get

pyswgo.core.tool.Tool.load_config_file:PR01:Parameters {'path'} not documented
pyswgo.core.tool.Tool.load_config_file:PR02:Unknown parameters {'pathlib.Path]', 'path: Union[str'}
pyswgo.core.tool.Tool.load_config_file:PR10:Parameter "path" requires a space before the colon separating the parameter name and type
pyswgo.core.tool.Tool.load_config_file:PR08:Parameter "path: Union[str" description should start with a capital letter
pyswgo.core.tool.Tool.load_config_file:PR09:Parameter "path: Union[str" description should finish with "."
pyswgo.core.tool.Tool.load_config_file:PR04:Parameter "pathlib.Path]" has no type
pyswgo.core.tool.Tool.load_config_file:PR08:Parameter "pathlib.Path]" description should start with a capital letter
pyswgo.core.tool.Tool.load_config_file:PR09:Parameter "pathlib.Path]" description should finish with "."

I am not sure what would be the best approach for you, but as a user I would at least like that this worked in a way that doesn't force me to exclude the associated validation checks codes as they are relevant for non-type-annotated functions.

I feel that a temporary solution numpydoc should be to at least recognize type annotations and have specific codes so you as developers can continue to think about how to properly address this and the user can safely disable any warning and still be sure that the rest of the API can be validated in a stable way.

yarikoptic · 2023-03-29T14:28:16Z

It is unfortunate that no decision was made on this yet, as typing annotations become more and more popular. I would really prefer form which would minimize duplication of information, thus making code/docstrings less cluttered and possibly inconsistent. A few extra aspects worth considering (were not mentioned) while working it all out

Returns/Yields

How to annotate then Returns (and Yields) in particular in case of simple data types -- should it become just the description, e.g. instead of

Returns
-------
int
    Description of anonymous integer return value.

(taken from https://numpydoc.readthedocs.io/en/latest/format.html#returns) to become just

Returns
-------
Description of anonymous integer return value.

if we are to avoid duplicating type annotation? or may be some "ad-hoc"

Returns
-------
_
    Description of anonymous integer return value.

to signal unnamed entity, or even giving it some specific name thus aligning to the next example where names given in case of a tuple return.

Overloads

See https://docs.python.org/3/library/typing.html#typing.overload describing support for them. ATM AFAIK mypy has various issues with testing overloads (over 280 mention overload) and thus pragmatically the actual implementation might need to specify union of types to have some checking done (ref: as discovered in our case), so it might be even trickier but still -- an aspect to keep in mind.

rth · 2023-03-29T14:43:18Z

In retrospect, IMO the best way is to go with sphinx-autodoc-typehints . I mean it already works with napoleon for numpy style docstrings, and it mentions numpydoc occasionally so it might just work with this package as well (though I haven't tested it).

I think it doesn't really make sense to start separate work on this topic, if there is already a popular package in the ecosystem that solves this problem well (and could work with numpydoc, unless there are indications that it's not the case).

jrast · 2023-04-17T15:32:58Z

@rth I just tried to use sphinx-autodoc-typehints with numpydoc and was not able to get a working configuration. I tried different configuration settings but the typehints did not show. While I agree that duplicate work should be prevented, currently there is no way which leads to the desired results.

rth · 2023-04-17T15:35:48Z

Would you mind opening an issue about it at https://github.com/tox-dev/sphinx-autodoc-typehints to see what they think about it?

OriolAbril · 2024-06-18T15:34:04Z

Hoping this helps get the ball rolling on this.

I personally think manual type info on the docstring is valuable, especially in Python and especially for non-experts, as type hints can be quite obfuscated. But it is true it means significant duplication (maybe there could be a "translator" to make type hints more readable in the docstring) which sometimes even leads to removing the type information from the docstring altogether as it is already in the type hints. I'd rather get some type hint style info injected into the docstring that nothing at all.

It should be possible to extend the current mangle_docstring function to add the type information (using sphinx_autodoc_typehints), or to do that from sphinx_autodoc_typehints or to build a lightweight numpydoc specific extension on top of sphinx_autodoc_typehints. https://github.com/OriolAbril/numpydoc/blob/type_hints/numpydoc/numpydoc.py#L238-L267 is a proof of concept of doing that after numpydoc has processed the docstring for example. With that the docstring for

def plot_dist(
    dt: DataTree,
    var_names: Optional[Sequence[Hashable]]=None,
    filter_vars: Optional[str]=None,
    group: str="posterior",
    coords: Mapping[str, Any]=None,
    sample_dims: Optional[Sequence[Hashable]]=None,
    kind: Optional[str]=None,
    point_estimate=None,
    ...
):
    """Plot 1D marginal densities in the style of John K. Kruschke’s book.

    Generate :term:`facetted` :term:`plots` with: a graphical representation of 1D marginal
    densities (as KDE, histogram, ECDF or dotplot), a credible interval and a point estimate.

    Parameters
    ----------
    dt
        Input data. In case of dictionary input, the keys are taken to be model names.
        In such cases, a dimension "model" is generated and can be used to map to aesthetics.
    var_names
        One or more variables to be plotted.
        Prefix the variables by ~ when you want to exclude them from the plot.
    filter_vars
        If None, interpret var_names as the real variables names.
        If “like”, interpret var_names as substrings of the real variables names.
        If “regex”, interpret var_names as regular expressions on the real variables names.
    group
        Group to be plotted.
    coords
    sample_dims
        Dimensions to reduce unless mapped to an aesthetic.
        Defaults to ``rcParams["data.sample_dims"]``
    kind : {"kde", "hist", "dot", "ecdf"}, optional
        How to represent the marginal density.

became:

(note only the parameters without type information had type hints injected, kind kept the manual type description)

jarrodmillman · 2024-06-18T15:42:04Z

https://github.com/scientific-python/docstub

OriolAbril · 2024-06-18T15:44:33Z

Thanks! I didn't know it existed and this is much closer to what I'd like to have! ❤️

rth changed the title ~~Support for type annotations (PEP 484)~~ Support type annotations (PEP 484) Jan 13, 2019

rth mentioned this issue Jul 30, 2019

add example of a good docstring for defaults and examples scikit-learn/scikit-learn#12356

Merged

rth mentioned this issue Oct 10, 2019

Python 2 / 3.4 end of support timeline #235

Closed

rth mentioned this issue Mar 20, 2020

Support typing scikit-learn/scikit-learn#16705

Open

pwinston mentioned this issue Jun 15, 2020

ImageSlice docstring needs updating napari/napari#1355

Closed

rth mentioned this issue Jun 26, 2020

Indicating None as an option in docstrings of n_jobs and random_state scikit-learn/scikit-learn#17748

Open

rth mentioned this issue Jul 4, 2020

ENH Adds typing support to LogisticRegression scikit-learn/scikit-learn#17799

Open

sappelhoff mentioned this issue Aug 29, 2020

Are type annotations worth having? mne-tools/mne-bids#533

Open

simonjayhawkins mentioned this issue Sep 17, 2020

TYP: alias IndexLabel without Optional pandas-dev/pandas#36401

Merged

5 tasks

JustAnotherArchivist mentioned this issue Dec 3, 2020

Document the individual modules better JustAnotherArchivist/snscrape#6

Open

This was referenced Dec 31, 2020

ENH: add stats.qmc module with quasi Monte Carlo functionality scipy/scipy#10844

Merged

Get some kind of typing for scipy.special scipy/scipy#11749

Closed

rth mentioned this issue Mar 21, 2021

DOC Fix formatting in some docstrings pyodide/pyodide#1353

Merged

vyasr mentioned this issue Apr 22, 2021

Add support for pydocstyle and test on abc.py rapidsai/cudf#7985

Merged

hamogu mentioned this issue May 4, 2022

Consider allowing type annotations in newly written code sherpa/sherpa#1501

Closed

This was referenced Jul 1, 2022

Typing overhaul dask/dask#9233

Closed

Annotations guidelines dask/community#255

Open

jrsmith3 mentioned this issue Aug 7, 2022

Update documentation jrsmith3/ibei#59

Closed

NicolasGensollen mentioned this issue Aug 30, 2022

Python implementation for StatisticsSurface aramis-lab/clinica#669

Merged

jorgepiloto mentioned this issue Sep 29, 2022

MAINT: add typehint example ansys/pydpf-post#169

Merged

HealthyPear mentioned this issue Mar 13, 2023

Napoleon docstring validation sphinx-doc/sphinx#11237

Open

yarikoptic mentioned this issue Mar 29, 2023

ENH: Support extracting DICOMs from ZIP files (and possibly other archives) by switching to use shutil.unpack_archive instead of tarfile module functionality nipy/heudiconv#471

Merged

eriknw mentioned this issue Mar 30, 2023

Add type annotations to functions? python-graphblas/python-graphblas#179

Open

jrast mentioned this issue Apr 17, 2023

sphinx-autodoc-typehints & numpydoc tox-dev/sphinx-autodoc-typehints#347

Open

yarikoptic mentioned this issue Jun 3, 2023

Investigate/use sphinx-autodoc-typehints datalad/datalad#7404

Closed

kburns mentioned this issue Jun 25, 2023

Improve type annotations DedalusProject/dedalus#255

Open

lwasser mentioned this issue Jun 28, 2023

Working towards better type annotation stravalib/stravalib#373

Closed

HealthyPear mentioned this issue Jul 24, 2023

How to exclude an API member from being processed by numpydoc #479

Open

fillipe-gsm mentioned this issue Jul 28, 2023

Feat: solve TSP using branch and bound method fillipe-gsm/python-tsp#34

Merged

connorjward mentioned this issue Oct 12, 2023

FML firedrakeproject/firedrake#3041

Merged

aqw mentioned this issue Oct 20, 2023

numpy-ify doctstrings psyinfra/onyo#339

Closed

4 tasks

larsoner mentioned this issue Nov 28, 2023

Add type hints mne-tools/mne-python#12243

Open

This was referenced Oct 10, 2024

Inclusing static types in docs breaks references glass-dev/glass#349

Closed

gh-346: enforce NumPy docstrings + use static typing in docs glass-dev/glass#347

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support type annotations (PEP 484) #196

Support type annotations (PEP 484) #196

rth commented Jan 13, 2019 •

edited

Loading

jorisvandenbossche commented Jan 14, 2019

jnothman commented Jan 14, 2019 via email

rth commented Jan 15, 2019

rth commented Oct 10, 2019

MartinThoma commented Jan 30, 2020

rth commented Mar 30, 2020 •

edited

Loading

MartinThoma commented Mar 30, 2020

jnothman commented Mar 31, 2020 via email

darcamo commented Apr 1, 2020

wwuck commented Jun 12, 2020

darcamo commented Jun 12, 2020

cbrnr commented Aug 11, 2021

TitaniumHocker commented Dec 19, 2022

HealthyPear commented Mar 13, 2023

yarikoptic commented Mar 29, 2023

rth commented Mar 29, 2023

jrast commented Apr 17, 2023

rth commented Apr 17, 2023

OriolAbril commented Jun 18, 2024

jarrodmillman commented Jun 18, 2024

OriolAbril commented Jun 18, 2024

Support type annotations (PEP 484) #196

Support type annotations (PEP 484) #196

Comments

rth commented Jan 13, 2019 • edited Loading

jorisvandenbossche commented Jan 14, 2019

jnothman commented Jan 14, 2019 via email

rth commented Jan 15, 2019

rth commented Oct 10, 2019

MartinThoma commented Jan 30, 2020

rth commented Mar 30, 2020 • edited Loading

MartinThoma commented Mar 30, 2020

jnothman commented Mar 31, 2020 via email

darcamo commented Apr 1, 2020

wwuck commented Jun 12, 2020

darcamo commented Jun 12, 2020

cbrnr commented Aug 11, 2021

TitaniumHocker commented Dec 19, 2022

HealthyPear commented Mar 13, 2023

yarikoptic commented Mar 29, 2023

Returns/Yields

Overloads

rth commented Mar 29, 2023

jrast commented Apr 17, 2023

rth commented Apr 17, 2023

OriolAbril commented Jun 18, 2024

jarrodmillman commented Jun 18, 2024

OriolAbril commented Jun 18, 2024

rth commented Jan 13, 2019 •

edited

Loading

rth commented Mar 30, 2020 •

edited

Loading