Update SpectrumList loader for JWST data #579

jdavies-st · 2020-01-31T16:40:29Z

Allow the reader to read either flux or surf_bright
Generalize the extraction of units to track column name, not column number
Read in the uncertainty array
Fix up some docstrings elsewhere
Write tests, tests, tests

Note that extract1d spectral products in the pipeline currently do not have populated uncertainty arrays. They are all zeros. That will be changing in the next month or so.

nmearl

This looks good @jdavies-st! I'd recommend adding a final test that is expected to fail to handle the case where the srctype is malformed (e.g. not point or extended), but otherwise, this is great.

jdavies-st · 2020-02-05T14:51:42Z

Good point. I believe srctype can sometimes be "UNKNOWN" as well, so this would be a good thing to check.

eteq

One minor (but potentially important) suggestion, but otherwise I think this looks good.

An additional suggestion, which might be better answered as "this is better done as a follow-on PR" - there's not really any part of the documentation that explains how to use this from the user perspective. There's https://specutils.readthedocs.io/en/stable/spectrum1d.html#reading-from-a-file but it doesn't have enough information for the user to really grasp anything about the subtelties of Spectrum1D vs SpectrumList and so on (or even how to select the appropriate reader if they want a particular one), while https://specutils.readthedocs.io/en/stable/custom_loading.html is geared towards making a loader rather than using one. Perhaps this loader should be mentioned in the docs as an example of when you might use SpectrumList instead of Spectrum1D?

eteq · 2020-02-07T22:37:45Z

specutils/io/default_loaders/jwst_reader.py

        return True
    # This probably means we didn't have a FITS file
    except Exception:
        return False


-@data_loader("JWST", identifier=identify_jwst_fits, dtype=SpectrumList,
+@data_loader("JWST", identifier=identify_jwst_x1d_fits, dtype=SpectrumList,


Suggested change

@data_loader("JWST", identifier=identify_jwst_x1d_fits, dtype=SpectrumList,

@data_loader("JWST_x1d", identifier=identify_jwst_x1d_fits, dtype=SpectrumList,

I think this is necessary if we want to have a whole family of JWST loaders, right? I.e., if we don't do this there's no way to use the name to specify which of the JWST loaders to use?

Yes, I think this is necessary as well. Would this be a backwards incompatible change though? Does that matter?

Perhaps this is where my limited knowledge of the infrastructure of the io registry and specifically in how it has been implemented in specutils works.

eteq · 2020-02-07T22:43:58Z

A related thought to the above (but somewhat independent): should there also be a Spectrum1D that can be used for JWST data sets that have just one spectrum in them? I don't know if this is actually an at all common use case, but it seems like it would be a trivial addition to this if there are any?

(This could also be a separate issue to not delay this particular PR)

- Retain SpectrumList.read() on multi-object x1d functionality - Add Spectrum1D.read() on single spectrum x1d files - Return RuntimeError if trying to use Spectrum1D.read on multi-object x1d data - Update tabular-fits reader to ignore JWST data

jdavies-st · 2020-02-10T19:59:40Z

I've added a reader for single spectrum x1d files for the Spectrum1D class. If passed multiple spectra, it raises RuntimeError. I've also expanded the testing. Hopefully I've responded to all review comments now.

specutils/io/default_loaders/tests/test_jwst_loader.py

dhomeier · 2020-02-11T18:25:23Z

specutils/io/default_loaders/jwst_reader.py

+
+                error_units = u.Unit(hdu.columns["error"].unit)
+                uncertainty = StdDevUncertainty(hdu.data["error"] * error_units)
+


Having just sneaked into a discussion on the pros and cons of using the Table API for loading hdu data, I am wondering if this could be streamlined to

data = Table.read(hdu) if srctype == "POINT": flux = Quantity(data["flux"]) uncertainty = StdDevUncertainty(data["error"]) ...

and so on for the other srctypes – see e.g. for comparison muscles_sed.py.

That looks good! I'll give it a try. Thanks for the tip.

That worked very well. Thanks. Added in a new commit. And I added some units checking to the tests, something I had overlooked before.

Just remembered that parsing_utils has a function spectrum_from_column_mapping that in principle would even allow to simply specify this as dictionaries like

if srctype == "POINT": column_mapping = {'wavelength': ('spectral_axis', None), 'flux': ('flux', None), 'error': ('uncertainty', None)} elif srctype == "EXTENDED": column_mapping = {'wavelength': ('spectral_axis', None), 'surf_bright': ('flux', None), 'sb_error': ('uncertainty', None)}

but this would require #573 to be merged to work without explicitly specifying the units - just for reference.

I can add that simplification in a future PR. 👍

- The 2 identifiers can distinguish between single and multi spec x1d files and load it into the appropriate Spectrum1D or SpectrumList - Add tests for this and for warning when error array is zeros

jdavies-st · 2020-02-13T20:11:40Z

And finally I added some population of the Spectrum1D.meta attribute. Basically concat together the PRIMARY FITS header and the per-EXTRACT1D extension header into the meta dict as key:value pairs. This doesn't retain the header object comments, but perhaps this is preferable to having the whole FITS header as a value in a header dict key?

nmearl · 2020-02-17T13:57:34Z

@jdavies-st I think the method of concatenating the header is fine, I'm not see any reason why it should cause issues now. But we should remind ourselves that we did it if we do run into programs. This looks great!

Update SpectrumList loader for JWST data

eb0c0c9

jdavies-st force-pushed the loaders-jwst branch from 916dbe1 to eb0c0c9 Compare January 31, 2020 18:52

nmearl self-requested a review January 31, 2020 19:34

jdavies-st requested a review from eteq February 4, 2020 15:32

nmearl reviewed Feb 5, 2020

View reviewed changes

jdavies-st added 2 commits February 6, 2020 10:11

Captilize SRCTYPEs and add test for failure mode

a0e7b78

Add docstrings and better names to tests

bedd18b

eteq approved these changes Feb 7, 2020

View reviewed changes

nmearl reviewed Feb 11, 2020

View reviewed changes

specutils/io/default_loaders/tests/test_jwst_loader.py Outdated Show resolved Hide resolved

dhomeier reviewed Feb 11, 2020

View reviewed changes

jdavies-st added 3 commits February 11, 2020 14:54

Add x1d multi-slit identifier to JWST loader

99c2a26

- The 2 identifiers can distinguish between single and multi spec x1d files and load it into the appropriate Spectrum1D or SpectrumList - Add tests for this and for warning when error array is zeros

Use Table and Quantity to read in units for JWST loaders

f21a74d

Add primary and slit FITS headers to Spectrum1D.meta

ceff2d8

nmearl merged commit 2da78bd into astropy:master Feb 17, 2020

dhomeier mentioned this pull request Feb 18, 2020

Fix default_loaders tabular-fits format and automatic recognition #573

Merged

jdavies-st deleted the loaders-jwst branch February 21, 2020 15:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update SpectrumList loader for JWST data #579

Update SpectrumList loader for JWST data #579

jdavies-st commented Jan 31, 2020 •

edited

Loading

nmearl left a comment

jdavies-st commented Feb 5, 2020

eteq left a comment

eteq Feb 7, 2020

jdavies-st Feb 10, 2020

eteq commented Feb 7, 2020 •

edited

Loading

jdavies-st commented Feb 10, 2020

dhomeier Feb 11, 2020

jdavies-st Feb 11, 2020

jdavies-st Feb 11, 2020

dhomeier Feb 12, 2020

jdavies-st Feb 13, 2020

jdavies-st commented Feb 13, 2020

nmearl commented Feb 17, 2020

	@data_loader("JWST", identifier=identify_jwst_x1d_fits, dtype=SpectrumList,
	@data_loader("JWST_x1d", identifier=identify_jwst_x1d_fits, dtype=SpectrumList,


		error_units = u.Unit(hdu.columns["error"].unit)
		uncertainty = StdDevUncertainty(hdu.data["error"] * error_units)

Update SpectrumList loader for JWST data #579

Update SpectrumList loader for JWST data #579

Conversation

jdavies-st commented Jan 31, 2020 • edited Loading

nmearl left a comment

Choose a reason for hiding this comment

jdavies-st commented Feb 5, 2020

eteq left a comment

Choose a reason for hiding this comment

eteq Feb 7, 2020

Choose a reason for hiding this comment

jdavies-st Feb 10, 2020

Choose a reason for hiding this comment

eteq commented Feb 7, 2020 • edited Loading

jdavies-st commented Feb 10, 2020

dhomeier Feb 11, 2020

Choose a reason for hiding this comment

jdavies-st Feb 11, 2020

Choose a reason for hiding this comment

jdavies-st Feb 11, 2020

Choose a reason for hiding this comment

dhomeier Feb 12, 2020

Choose a reason for hiding this comment

jdavies-st Feb 13, 2020

Choose a reason for hiding this comment

jdavies-st commented Feb 13, 2020

nmearl commented Feb 17, 2020

jdavies-st commented Jan 31, 2020 •

edited

Loading

eteq commented Feb 7, 2020 •

edited

Loading