[ENH, MRG] Add EpochsSpectrumArray and SpectrumArray classes #11803

alexrockhill · 2023-07-13T22:52:49Z

Makes API a lot nicer and consistent with MNE related to discussion here: https://mne.discourse.group/t/instantiating-epochsspectrum-and-spectrum-class-objects/7127.

Without the docstrings of the array, you actually have to look in __set_state__ to figure out what is required in the state argument which is pretty confusing for me. The use case of ragged/unequal epochs is something I ran into too.

Okay with you @drammock ?

]

mne/time_frequency/spectrum.py

alexrockhill · 2023-07-14T04:18:42Z

Looks like the failures are unrelated, good to go by me

drammock

I pushed a commit with a couple changes too far outside the diff to do as web-interface suggestions. See my remaining comments below.

doc/changes/latest.inc

mne/time_frequency/spectrum.py

drammock · 2023-07-14T20:31:32Z

mne/time_frequency/spectrum.py

+                inst_type_str="Raw",
+                data_type="Average Power Spectrum",


If the data comes from an array we don't know if it was computed on Raw or averaged (Evoked) data. Also we don't know if it's power/amplitude, real/complex, etc.

Suggested change

inst_type_str="Raw",

data_type="Average Power Spectrum",

inst_type_str="Array",

data_type="Unknown",

This will probably break some tests (assuming we're testing thoroughly) so hopefully those test failures can guide what needs to change elsewhere

So real/complex can be inferred from the data type. We ask specifically for power in the constructor so maybe we should require that and let the amplitude kwarg in Spectrum.plot handle that? That'd be my 2 cents but not highly opinionated, whatever is simplest

mne/time_frequency/spectrum.py

mne/time_frequency/tests/test_spectrum.py

tutorials/simulation/10_array_objs.py

…into defaults

alexrockhill · 2023-07-14T21:59:23Z

Okay, I think the outstanding things here are:

Decide on fmin/fmax or freqs constructor
Think about if we want to support passing amplitude with a kwarg (psd_array_welch/multitaper only supports output power or complex so I would vote just supporting those which can be inferred from data type)
Decide about what Epochs-level data to put in the EpochsSpectrumArray API (I think it's a good compromise now but happy to hear from all the opinions @drammock requested to weigh in)

Otherwise looks good, thanks for the review @drammock.

Oh one more thing, I set Spectrum._inst_type to np.ndarray for the arrays, it's not used anywhere yet so this won't break anything but I'm not sure what the plan for that attribute was so just wanted to make sure that's the right move for future compatibility.

drammock

OK looking pretty good, thanks for tackling this! Left a couple more comments/questions, let's wait and see what other devs say now.

mne/time_frequency/spectrum.py

drammock · 2023-07-14T21:57:57Z

mne/time_frequency/spectrum.py

-    %(drop_log)s
+    %(events_epochs)s
+    %(event_id)s
+    %(metadata_epochs)s


did metadata get missed here or did you intentionally keep it? To me that one is the clearest case of "can / should be set afterward, not in constructor"

I left it in, I could see a use case for it here but that sounds reasonable to modularize if we prefer they set metadata later, I'll take it out

mne/time_frequency/tests/test_spectrum.py

tutorials/simulation/10_array_objs.py

alexrockhill · 2023-07-17T16:47:04Z

Okay to merge? The EpochsSpectrumArray constructor is minimal so we can always follow up with a PR adding more things to the constructor if necessary. This seems like a good start.

mne/time_frequency/tests/test_spectrum.py

agramfort · 2023-07-18T14:40:24Z

Ok for me

drammock · 2023-07-18T16:26:52Z

Okay to merge?

Your checklist above has only 1 of 3 items checked off (plus there's a fourth item that doesn't have a checkbox --- the question about _inst_type --- which is also not resolved yet). So no: not okay to merge. Is there some rush on your end?

alexrockhill · 2023-07-18T16:59:56Z

Not a huge rush but I'm sharing a script that uses it with someone else so it's a bit of a pain to describe how to install a feature branch so it would be nice not to take too long.

I was going to let whoever merged the PR check those boxes to make sure they're okay with them, it's fine by me how it is right now (after I implement the latest review).

alexrockhill · 2023-07-18T17:10:25Z

I just went ahead and added the fixtures so everyone wins 😄

Oh man that's a creepy smile, maybe 🙂 that's better

alexrockhill · 2023-07-18T19:01:34Z

Looks like the failure is unrelated.

drammock · 2023-08-25T14:24:51Z

sorry, I got captured by a grant deadline, it was on my list to come back to this yesterday but I ran out of time. Will prioritize for today.

alexrockhill · 2023-08-25T15:37:17Z

No worries, next week is great too, just didn't want it to get forgotten.

…into defaults

drammock · 2023-08-25T16:39:36Z

we chatted about this PR in the dev meeting today, to gauge opinions on whether supporting complex data as input to SpectrumArray and EpochsSpectrumArray is important or YAGNI. General feeling was:

PRO: it would be nice to include it for symmetry reasons, since in some cases our .compute_psd() methods return complex data.
CON 1: it makes it hard to validate the input data (specifically the dimensionality). Currently we output:
- 2D real-valued Spectrum (welch, multitaper on Raw/Evoked)
- 3D real-valued Spectrum (unaggregated welch on Raw/Evoked)
- 3D complex-valued Spectrum (unaggregated multitaper on Raw/Evoked)
- 3D real-valued EpochsSpectrum (welch, multitaper on Epochs)
- 4D real-valued EpochsSpectrum (unaggregated welch on Epochs)
- 4D complex-valued EpochsSpectrum (unaggregated multitaper on Epochs)
If we only accept real-valued input, we can safely only support 2 use cases: 2D real data -> Spectrum, 3D real data -> EpochsSpectrum. In contrast, if we allow complex input, for example how do we decide if a 3D complex input should be treated as unaggregated estimates from continuous data versus normal estimates from epoched data?
CON 2: supporting complex data means thinking a lot about downstream processing effects (i.e., does it make sense to plot? if so, is abs() a sufficient approach for converting to real-valued numbers? etc) which means significant maintenance burden. Now, we already kinda have to do this for the unaggregated multitaper output, but importantly that is the only case where we allow complex data so (1) we know what is the sensible way to convert to real-valued when needed (i.e. aggregating with precomputed multitaper weights) and (2) we can rely on knowing the dimension of the data (because we created the object).

All this has made me realize something that I previously overlooked / didn't discuss in the meeting, which is that when we originally added complex support, we were outputting numpy arrays so the ramifications didn't need to be traced through the rest of our API to make sure the complexness of the data didn't break things. When I created the Spectrum classes, support for complex unaggregated multitaper output was included for legacy reasons, but it probably would have been better to not support it in the classes (and to keep open a separate array-yielding code path for users who want the unaggregated multitaper output).

Which leads me to ask @mmagnuski (who originally requested the complex unaggregated multitaper output) a few questions:

what was your original use case that led to that request?
do you know others who use MNE's unaggregated multitaper estimates in their work?
do you foresee a use case for computing complex-valued spectral estimates outside of MNE and then putting them into a Spectrum or EpochsSpectrum object?

alexrockhill · 2023-08-25T20:59:29Z

Those are pretty reasonable concerns: you have to pass the method if it's complex and I added checks that the dimensions matched the size of the frequencies passed so I think it the checks for the wrong data make it very safe (i.e. even if frequencies and n_segments/n_tapers are same size, the method is required so that would disambiguate the dimensions. I really can't think of a way you could pass the wrong data).

mmagnuski · 2023-08-26T07:58:58Z

@drammock
Hi, I am fine with NOT supporting complex per-taper output in Spectrum classes. The reason for returning complex per-taper output is mostly for calculating multitaper coherence (and other similar measures) - this is done per taper and then averaged. This is an example paper that showcases something similar.
But in general I don't think it would be used frequently and it does not make much sense to support it in the Spectrum object - that might be more time spent implementing and testing than actual user-time spent using the code. :)

agramfort · 2023-08-27T09:16:10Z

what's the status here? I see some remaining comments by @drammock. It's a matter of agreeing if we support complex or not?

alexrockhill · 2023-08-29T18:12:33Z

what's the status here? I see some remaining comments by @drammock. It's a matter of agreeing if we support complex or not?

Yes, that's my understanding we should agree whether to support complex psds.

One last point: I think from a maintenance perspective having complex spectrumarrays was actually very helpful because in testing complete cases, there were several bugs in how plotting was handled.

alexrockhill · 2023-08-29T18:14:00Z

mne/time_frequency/spectrum.py

+        # handle unaggregated multitaper
+        if hasattr(self, "_mt_weights"):
+            logger.info("Aggregating multitaper estimates before plotting...")
+            psds = _psd_from_mt(psds, weights=self._mt_weights)
+        # handle unaggregated Welch
+        elif "segment" in self._dims:
+            logger.info("Aggregating Welch estimates (median) before plotting...")
+            seg_axis = self._dims.index("segment")
+            psds = np.nanmedian(psds, axis=seg_axis)
+        if np.iscomplexobj(psds):  # convert to power for plotting
+            psds = (psds * psds.conj()).real
+        if "epoch" in self._dims:
+            psds = np.mean(psds, axis=self._dims.index("epoch"))


drammock · 2023-08-29T19:14:42Z

I'll restate her the pros and cons from my comment above:

PRO: it would be nice to include it for symmetry reasons, since in some cases our .compute_psd() methods return complex data.
CON 1: it makes it hard to validate the input data (specifically the dimensionality).
CON 2: supporting complex data means thinking a lot about downstream processing effects.

@mmagnuski's comment suggests to me that we should probably deprecate complex output support in the (Epochs)Spectrum classes, and provide an alternate code path that outputs NumPy arrays, for folks like @mmagnuski who really want access to the individual taper estimates. If we do that, then the main "pro" of allowing complex input to the (Epochs)SpectrumArray classes goes away. I'll gladly acknowledge that some issues with plotting were revealed by the effort of adding that support, but IMO that does not constitute a "maintenance win" --- we can easily keep the plotting bugfixes while eliminating the complex support, and keeping complex support is still more lines of code and more branching code paths and more downstream effects to keep track of, all for the sake of (as far as I've heard) no known real-world use case for why it's preferable to have the complex data in a Spectrum-like container.

My vote here is:

a PR that deprecates complex output in the Spectrum classes
a PR that adds a spectrum pytest fixture and adds it to all the existing tests that use Spectrum objects (which was not done here, and is why I initially requested a separate PR for that in the first place)
a PR that adds the new (Epochs)SpectrumArray classes with only real-valued input allowed (and no support for unaggregated-welch-style input, i.e., 2D is always SpectrumArray, 3D is always EpochsSpectrumArray), and associated tests

alexrockhill · 2023-08-29T19:28:12Z

That's fine with me, I'll remove complex support, you spent a lot of time and effort making spectrum classes @drammock so I'm happy to differ to your judgment and I agree there are not a lot of apparent use cases that come to mind--phase doesn't make any sense, it's not time-frequency, thanks for the direct communication, that helps move forward.

…into defaults

alexrockhill · 2023-08-29T19:41:13Z

Ok I removed complex support and this is ready for review. I changed the data_type to Power Spectrum from Real/Complex Spectrum because of the change not to support complex input. It's just for the display, I really don't care that much so just let me know if anyone wants it back to Real or whatever else.

drammock

here's a self-review as commentary on the changes I've just pushed.

@larsoner @agramfort or @mmagnuski maybe best for one of you to push the green button on this one, it's got a fair amount of code from both me and @alexrockhill

drammock · 2023-09-01T21:36:22Z

mne/conftest.py

    """Get raw with power spectral density computed from mne.io.tests.data."""
-    return [raw.compute_psd(method=method) for method in ("welch", "multitaper")]
+    return raw.compute_psd()


I don't think we have a good reason to test both Welch and multitaper in the fixture; the differences aren't relevant to most tests (and should be covered adequately by the tests of the welch/mutitaper array methods used under the hood)

drammock · 2023-09-01T21:36:43Z

mne/conftest.py

@@ -298,9 +298,9 @@ def raw_ctf():


 @pytest.fixture(scope="function")
-def raw_psds(raw):
+def raw_spectrum(raw):


for naming consistency

drammock · 2023-09-01T21:37:14Z

mne/time_frequency/__init__.py

-from .ar import fit_iir_model_raw
-from .multitaper import dpss_windows, psd_array_multitaper, tfr_array_multitaper
-from .spectrum import (
-    EpochsSpectrum,
-    EpochsSpectrumArray,
-    Spectrum,
-    SpectrumArray,
-    read_spectrum,
-)
-from ._stft import stft, istft, stftfreq
-from ._stockwell import tfr_stockwell, tfr_array_stockwell


when you merged in main, this wasn't properly integrated into the new lazy loading scheme

drammock · 2023-09-01T21:37:54Z

mne/time_frequency/spectrum.py

@@ -429,7 +429,7 @@ def _check_values(self):
        assert len(self._dims) == self._data.ndim, (self._dims, self._data.ndim)
        assert self._data.shape == self._shape
        # negative values OK if the spectrum is really fourier coefficients
-        if np.iscomplexobj(self._data):
+        if "taper" in self._dims:


reverting to reduce churn, esp. given that we now plan to deprecate support for complex data anyway.

drammock · 2023-09-01T21:38:16Z

mne/time_frequency/spectrum.py

-        # handle unaggregated multitaper
-        if hasattr(self, "_mt_weights"):
-            logger.info("Aggregating multitaper estimates before plotting...")
-            psds = _psd_from_mt(psds, weights=self._mt_weights)
-        # handle unaggregated Welch
-        elif "segment" in self._dims:
-            logger.info("Aggregating Welch estimates (median) before plotting...")
-            seg_axis = self._dims.index("segment")
-            psds = np.nanmedian(psds, axis=seg_axis)
-        if np.iscomplexobj(psds):  # convert to power for plotting
-            psds = (psds * psds.conj()).real


reverting; will not be needed.

drammock · 2023-09-01T21:38:31Z

mne/time_frequency/spectrum.py

-        # handle unaggregated multitaper
-        if hasattr(self, "_mt_weights"):
-            logger.info("Aggregating multitaper estimates before plotting...")
-            psds = _psd_from_mt(psds, weights=self._mt_weights)
-        # handle unaggregated Welch
-        elif "segment" in self._dims:
-            logger.info("Aggregating Welch estimates (median) before plotting...")
-            seg_axis = self._dims.index("segment")
-            psds = np.nanmedian(psds, axis=seg_axis)
-        if np.iscomplexobj(psds):  # convert to power for plotting
-            psds = (psds * psds.conj()).real


reverting, will not be needed.

drammock · 2023-09-01T21:39:37Z

mne/time_frequency/spectrum.py

@@ -1226,11 +1197,21 @@ def __getitem__(self, item):
        return BaseRaw._getitem(self, item, return_times=False)


-def _check_data_shape(param, dim, data, expected):
-    if data.shape[dim] != expected:
+def _check_data_shape(data, freqs, info, ndim):


I reworked this to check everything at once rather than needing multiple calls to the check function. Seemed simpler this way once we knew we only had to handle 2D and 3D cases.

drammock · 2023-09-01T21:40:52Z

mne/time_frequency/tests/test_spectrum.py

@@ -141,18 +142,21 @@ def test_spectrum_io(inst, tmp_path, request, evoked):
    assert orig == loaded


-def test_spectrum_copy(raw):
+def test_spectrum_copy(raw_spectrum):


Although I'd have preferred adding the fixture in its own PR, I decided it was expedient to just go with it, so I've used it in the other spectrum tests too.

tutorials/simulation/10_array_objs.py

drammock · 2023-09-02T12:18:17Z

All green, this one is ready for review/merge

larsoner

Pretty short diff in the end, thanks @alexrockhill @drammock !

…ls#11803) Co-authored-by: Daniel McCloy <[email protected]>

alexrockhill added 4 commits July 13, 2023 15:50

[ENH, MRG] Add EpochsSpectrumArray and SpectrumArray classes [skip ci

395a758

]

update latest

710cef4

fix refs

e34bc38

wrong versionadded

3fcb557

alexrockhill commented Jul 14, 2023

View reviewed changes

mne/time_frequency/spectrum.py Outdated Show resolved Hide resolved

mne/time_frequency/spectrum.py Outdated Show resolved Hide resolved

alexrockhill added 2 commits July 13, 2023 19:57

Update mne/time_frequency/spectrum.py

597c48c

Update mne/time_frequency/spectrum.py

6a54f68

alexrockhill and others added 2 commits July 13, 2023 21:35

epoch not epochs

125903e

edit seealso [ci skip]

e0fb07f

drammock reviewed Jul 14, 2023

View reviewed changes

alexrockhill added 4 commits July 14, 2023 14:43

Dan review

8cee941

Merge branch 'defaults' of https://github.com/alexrockhill/mne-python …

738c8d3

…into defaults

style

0cd6ec8

Merge branch 'main' into defaults

5d18ff3

drammock reviewed Jul 14, 2023

View reviewed changes

alexrockhill added 3 commits July 14, 2023 15:22

cruft, test plot

27be8d8

style

145b6bb

very picky style

71016d6

agramfort reviewed Jul 18, 2023

View reviewed changes

drammock mentioned this pull request Jul 18, 2023

add spectrum fixture to pytest config #11811

Closed

alexrockhill added 2 commits July 18, 2023 10:09

add fixtures

1ccf8a4

Merge branch 'main' into defaults

ece2b39

alexrockhill added 2 commits August 25, 2023 08:42

try skip h5io

54488ed

Merge branch 'defaults' of https://github.com/alexrockhill/mne-python …

1a1e57f

…into defaults

Merge branch 'main' into defaults

9f45402

alexrockhill commented Aug 29, 2023

View reviewed changes

alexrockhill added 2 commits August 29, 2023 12:39

remove complex support

8f49d05

Merge branch 'defaults' of https://github.com/alexrockhill/mne-python …

831c1ee

…into defaults

alexrockhill and others added 4 commits August 29, 2023 16:45

Merge branch 'main' into defaults

b6707f3

simplifications and fixes

e178231

Merge remote-tracking branch 'upstream/main' into defaults

88c790e

oops missed this

f735e12

drammock approved these changes Sep 1, 2023

View reviewed changes

drammock reviewed Sep 1, 2023

View reviewed changes

tutorials/simulation/10_array_objs.py Outdated Show resolved Hide resolved

Update tutorials/simulation/10_array_objs.py

40627ac

larsoner approved these changes Sep 2, 2023

View reviewed changes

larsoner merged commit db07d55 into mne-tools:main Sep 2, 2023
26 checks passed

alexrockhill deleted the defaults branch September 5, 2023 16:19

drammock mentioned this pull request Sep 10, 2023

Deprecate complex spectrum obj #11978

Merged

snwnde pushed a commit to snwnde/mne-python that referenced this pull request Mar 20, 2024

[ENH, MRG] Add EpochsSpectrumArray and SpectrumArray classes (mne-too…

01892d7

…ls#11803) Co-authored-by: Daniel McCloy <[email protected]>

tsbinns mentioned this pull request Jul 24, 2024

[ENH] (Re)implement complex data support for Spectrum and SpectrumArray classes #12747

Merged

[ENH, MRG] Add EpochsSpectrumArray and SpectrumArray classes #11803

[ENH, MRG] Add EpochsSpectrumArray and SpectrumArray classes #11803

Conversation

alexrockhill commented Jul 13, 2023

alexrockhill commented Jul 14, 2023

drammock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexrockhill Jul 14, 2023 • edited Loading

Choose a reason for hiding this comment

alexrockhill commented Jul 14, 2023 • edited Loading

drammock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alexrockhill commented Jul 17, 2023

agramfort commented Jul 18, 2023 via email

drammock commented Jul 18, 2023

alexrockhill commented Jul 18, 2023 • edited Loading

alexrockhill commented Jul 18, 2023 • edited Loading

alexrockhill commented Jul 18, 2023

drammock commented Aug 25, 2023

alexrockhill commented Aug 25, 2023

drammock commented Aug 25, 2023 • edited Loading

alexrockhill commented Aug 25, 2023

mmagnuski commented Aug 26, 2023

agramfort commented Aug 27, 2023

alexrockhill commented Aug 29, 2023 • edited Loading

Choose a reason for hiding this comment

drammock commented Aug 29, 2023

alexrockhill commented Aug 29, 2023

alexrockhill commented Aug 29, 2023

drammock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drammock commented Sep 2, 2023

larsoner left a comment

Choose a reason for hiding this comment

alexrockhill Jul 14, 2023 •

edited

Loading

alexrockhill commented Jul 14, 2023 •

edited

Loading

alexrockhill commented Jul 18, 2023 •

edited

Loading

alexrockhill commented Jul 18, 2023 •

edited

Loading

drammock commented Aug 25, 2023 •

edited

Loading

alexrockhill commented Aug 29, 2023 •

edited

Loading