add extension best practices #338

bendichter · 2023-02-02T17:43:42Z

Motivation

Integrate best practices from this PR: NeurodataWithoutBorders/nwb-overview#72

for more information, see https://pre-commit.ci

docs/best_practices/extensions.rst

CodyCBakerPhD · 2023-02-02T17:49:50Z

ReadTheDocs does not like the tabs stuff

/home/docs/checkouts/readthedocs.org/user_builds/nwbinspector/checkouts/338/docs/best_practices/extensions.rst:26: ERROR: Unknown directive type "tabs".

from https://readthedocs.org/api/v2/build/19360519.txt

docs/best_practices/extensions.rst

…xtension_best_practices

for more information, see https://pre-commit.ci

…xtension_best_practices

docs/best_practices/extensions.rst

CodyCBakerPhD · 2023-02-03T18:07:45Z

docs/best_practices/extensions.rst

+* Extend ``TimeSeries`` for storing timeseries data. NWB provides main types of ``TimeSeries``
+  and you should identify the most specific type of ``TimeSeries`` relevant for your use case
+  (e.g., extend ``ElectricalSeries`` to define a new kind of electrical recording).


Should these TimeSeries/ElectricalSeries be :py:class: intersphinx references?

Ditto for below on TimeIntervals / DynamicTable

I would suggest intersphinx to the nwb-schema docs since this is for extensions

CodyCBakerPhD · 2023-02-03T18:20:27Z

docs/best_practices/extensions.rst

+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Attributes should be used mainly to store small metadata (usually less than 64 Kbytes) that
+is associated with a particular Group or Dataset. Typical uses of attributes are, e.g., to


Just reading this, it's hard to grasp 'how big' something is to be less than 64 KB

Since most metadata are strings, maybe it would be useful to mention a relative scale to that?

A standard Python string under 64 KB would correspond to ~1300 characters, from

import sys sys.getsizeof("a") > 50 64_000 / 50 > 1280

Google also says the average number of characters per word in English is 4.7, so that's ~272 words per string

Mostly relevant to fields like description, or sometimes reference_frame

I like the idea of relating this to characters. Do you know how big characters are within HDF5? Is it the same as Python strings in memory?

Hmm I'll have to investigate that a bit deeper, but most likely not the same as Python... I seem to remember us having some issues related to their byte string types here on the Inspector, but maybe that was for fields that are technically set as datasets, not attributes

Even in Python, if you apply a specific encoding it can change the size quite a bit

sys.getsizeof("a".encode("utf-8")) > 34

which would put the max closer to ~400 words

Well, from https://docs.h5py.org/en/stable/strings.html#storing-strings and https://docs.h5py.org/en/stable/special.html#h5py.string_dtype and manually inspecting some of these attributes with HDFView

It seems HDF5 stores with UTF-8 encoding, so about ~400 words on average from above

~400 words

words or characters?

words or characters?

words using Google's estimate of average characters per word in English

Or just shy of ~1900 characters if that's easier to communicate

docs/best_practices/extensions.rst

Co-authored-by: Cody Baker <[email protected]>

oruebel · 2023-02-07T21:14:25Z

docs/best_practices/extensions.rst

+* **Use the postfix ``Table`` when extending a ``DynamicTable`` type.** e.g.,
+  ``neurodata_type_def: LaserSettingsTable``
+* **Explicit**. E.g., avoid the use of ambiguous abbreviation in names.
+


Suggested change

Limit flexibility: Consider data reuse and tool developers

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

One of the aims of NWB is to make reusing data easier. This means that when proposing an extension you need to put yourself in the shoes of someone who will receive an NWB dataset and attempt to analyze it. Additionally, consider developers that will try to write tools that take NWB datasets as inputs. It’s worth assessing how much additional code different ways of approaching your extension will lead to.

oruebel · 2023-02-07T21:15:28Z

docs/best_practices/extensions.rst

+  and you should identify the most specific type of ``TimeSeries`` relevant for your use case
+  (e.g., extend ``ElectricalSeries`` to define a new kind of electrical recording).
+* Extend ``DynamicTable`` to store tabular data.
+* Extend ``TimeIntervals`` to store specific annotations of intervals in time.


Suggested change

* Extend ``TimeIntervals`` to store specific annotations of intervals in time.

* Extend ``TimeIntervals`` to store specific annotations of intervals in time.

Strive for backward compatible changes

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

NWB is already incorporated in many tools - proposing a change that will make already released NWB datasets non-compliant will cause a lot of confusion and will lead to significant cost to update codes.

oruebel · 2023-02-07T21:17:15Z

docs/best_practices/extensions.rst

+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+By using the :nwb_extension_git:`ndx-template` to create new extensions helps ensure
+that extensions can be easily shared and reused and published via the :ndx-catalog:`NDX Catalog <>`.


Suggested change

that extensions can be easily shared and reused and published via the :ndx-catalog:`NDX Catalog <>`.

that extensions can be easily shared and reused and published via the :ndx-catalog:`NDX Catalog <>`.

Get the community involved

~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Try to reach out to colleagues working with the type of data you are trying to add support for. The more eyes you will get on your extension the better it will get.

for more information, see https://pre-commit.ci

add best practices from NeurodataWithoutBorders/nwb-overview#72

5649098

bendichter requested a review from CodyCBakerPhD February 2, 2023 17:43

pre-commit-ci bot and others added 2 commits February 2, 2023 17:43

[pre-commit.ci] auto fixes from pre-commit.com hooks

a750c3f

for more information, see https://pre-commit.ci

Merge branch 'dev' into extension_best_practices

5c33003

CodyCBakerPhD reviewed Feb 2, 2023

View reviewed changes

docs/best_practices/extensions.rst Show resolved Hide resolved

CodyCBakerPhD reviewed Feb 2, 2023

View reviewed changes

docs/best_practices/extensions.rst Show resolved Hide resolved

CodyCBakerPhD reviewed Feb 2, 2023

View reviewed changes

docs/best_practices/extensions.rst Show resolved Hide resolved

bendichter commented Feb 2, 2023

View reviewed changes

docs/best_practices/extensions.rst Show resolved Hide resolved

bendichter and others added 12 commits February 2, 2023 12:50

Update docs/best_practices/extensions.rst

99d2482

Update conf.py

4076d70

Update requirements-rtd.txt

ef20e9d

fix underline length

6af0fe7

Merge remote-tracking branch 'origin/extension_best_practices' into e…

25cdd41

…xtension_best_practices

fix bullet list

21c5e8a

[pre-commit.ci] auto fixes from pre-commit.com hooks

2e00f7f

for more information, see https://pre-commit.ci

adjust language

f17e1b9

Merge remote-tracking branch 'origin/extension_best_practices' into e…

fe72670

…xtension_best_practices

fix external links

9ffe41a

fix warning

73d5b9b

fix warning

aa57eae

bendichter requested a review from CodyCBakerPhD February 3, 2023 17:22

CodyCBakerPhD reviewed Feb 3, 2023

View reviewed changes

docs/best_practices/extensions.rst Outdated Show resolved Hide resolved

CodyCBakerPhD reviewed Feb 3, 2023

View reviewed changes

docs/best_practices/extensions.rst Outdated Show resolved Hide resolved

bendichter and others added 2 commits February 3, 2023 17:13

Update docs/best_practices/extensions.rst

ba32682

Co-authored-by: Cody Baker <[email protected]>

Update docs/best_practices/extensions.rst

efd03d8

Co-authored-by: Cody Baker <[email protected]>

oruebel reviewed Feb 7, 2023

View reviewed changes

oruebel mentioned this pull request Feb 7, 2023

Add link to extensio best practices to the extension docs NeurodataWithoutBorders/nwb-overview#76

Open

CodyCBakerPhD and others added 4 commits February 7, 2023 16:49

Merge branch 'dev' into extension_best_practices

20a3b31

Merge branch 'dev' into extension_best_practices

7131b01

Merge branch 'dev' into extension_best_practices

d2d503e

[pre-commit.ci] auto fixes from pre-commit.com hooks

f38013c

for more information, see https://pre-commit.ci

CodyCBakerPhD requested a review from stephprince August 12, 2024 16:55

CodyCBakerPhD assigned stephprince Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add extension best practices #338

add extension best practices #338

bendichter commented Feb 2, 2023

CodyCBakerPhD commented Feb 2, 2023

CodyCBakerPhD Feb 3, 2023

CodyCBakerPhD Feb 3, 2023

oruebel Feb 6, 2023

CodyCBakerPhD Feb 3, 2023

CodyCBakerPhD Feb 3, 2023

bendichter Feb 3, 2023

CodyCBakerPhD Feb 3, 2023

CodyCBakerPhD Feb 6, 2023

oruebel Feb 6, 2023

CodyCBakerPhD Feb 6, 2023

CodyCBakerPhD Feb 6, 2023

oruebel Feb 7, 2023 •

edited

Loading

oruebel Feb 7, 2023

oruebel Feb 7, 2023

+Limit flexibility: Consider data reuse and tool developers
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+One of the aims of NWB is to make reusing data easier. This means that when proposing an extension you need to put yourself in the shoes of someone who will receive an NWB dataset and attempt to analyze it. Additionally, consider developers that will try to write tools that take NWB datasets as inputs. It’s worth assessing how much additional code different ways of approaching your extension will lead to.

add extension best practices #338

Are you sure you want to change the base?

add extension best practices #338

Conversation

bendichter commented Feb 2, 2023

Motivation

CodyCBakerPhD commented Feb 2, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oruebel Feb 7, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oruebel Feb 7, 2023 •

edited

Loading