NLCD2016 Tree Canopy #1243

isaaccorley · 2023-04-14T02:56:49Z

This PR adds the NLCD2016 Tree Canopy dataset

See https://www.mrlc.gov/data/nlcd-2016-usfs-tree-canopy-cover-conus

torchgeo/datasets/nlcd.py

adamjstewart · 2023-04-14T03:03:44Z

torchgeo/datasets/nlcd.py

+class NLCD2016TreeCanopy(RasterDataset):
+    """National Land Cover Database 2016 (NLCD2016) - Tree Canopy dataset.
+
+    The `National Land Cover Database <https://www.mrlc.gov/>`_ provides 30m tree


I would link to the tree canopy page, not the NLCD page

Suggested change

The `National Land Cover Database <https://www.mrlc.gov/>`_ provides 30m tree

The `Multi-Resolution Land Characteristics (MRLC) Consortium <https://www.mrlc.gov/>`_ provides 30m tree

Here's the specific page for reference https://www.mrlc.gov/data/nlcd-2016-usfs-tree-canopy-cover-conus

calebrob6 · 2023-04-14T04:20:32Z

torchgeo/datasets/nlcd.py

+class NLCD2016TreeCanopy(RasterDataset):
+    """National Land Cover Database 2016 (NLCD2016) - Tree Canopy dataset.
+
+    The `National Land Cover Database <https://www.mrlc.gov/>`_ provides 30m tree


Here's the specific page for reference https://www.mrlc.gov/data/nlcd-2016-usfs-tree-canopy-cover-conus

calebrob6 · 2023-04-14T04:28:25Z

Comparison of the IMG format to COG format:

IMG is uncompressed, just the tree canopy dataset is 20GB on disk. It is 16832104560 pixels that are 1 byte each + overviews :)
IMG format consists of a .html, .ige, .img, and .img.xml file
COG is 4.2 GB total, no extra files, lossless compression
Random windowed reads on the IMG data is ~.6 seconds per 1000
Random windowed reads on the COG data is 2.1 seconds per 1000

adamjstewart · 2023-04-14T15:16:26Z

Surprised img is faster than COGs, I thought COGs were the gold standard.

adamjstewart · 2023-04-14T15:16:46Z

This will need to be rebased once #1244 is merged.

calebrob6 · 2023-04-14T16:11:32Z

Surprised img is faster than COGs, I thought COGs were the gold standard.

I'm guessing the difference is compression related (COG is 5x smaller and 3x slower to read). It is apples to oranges as if these were hosted on a remote server, you could still do windowed reading quickly with a COG.

calebrob6 · 2023-04-14T16:36:49Z

Confirmed that the difference is entirely compression related:

calebrob6 · 2023-04-14T16:47:58Z

(for completeness, because I was curious)

It is apples to oranges as if these were hosted on a remote server, you could still do windowed reading quickly with a COG.

Not quite actually, you can still do windowed reading from remote files with the Erdas Imagine format, but it is 2x slower than COGs. Also, compression vs. no compression doesn't seem to matter when reading from remote files (it looks like compressed is slightly faster, which makes sense as the time it takes to transfer the data is going to dominate).

calebrob6 · 2023-04-14T16:48:08Z

TL;DR -- use COGs

adamjstewart · 2024-08-21T08:24:51Z

We now have a generic NLCD dataset, this is likely something we should add to nlcd.py instead of making it its own unrelated dataset.

add nlcd 2016 tree canopy datasets

52f5300

isaaccorley marked this pull request as draft April 14, 2023 02:56

isaaccorley self-assigned this Apr 14, 2023

github-actions bot added the datasets Geospatial or benchmark datasets label Apr 14, 2023

adamjstewart reviewed Apr 14, 2023

View reviewed changes

torchgeo/datasets/nlcd.py Outdated Show resolved Hide resolved

adamjstewart reviewed Apr 14, 2023

View reviewed changes

adamjstewart added this to the 0.5.0 milestone Apr 14, 2023

calebrob6 previously approved these changes Apr 14, 2023

View reviewed changes

shameless copy from nils PR

e83c706

isaaccorley dismissed calebrob6’s stale review via e83c706 April 15, 2023 18:08

isaaccorley added 8 commits April 21, 2023 15:23

Merge branch 'main' into datasets/nlcd2016treecanopy

61b0c34

Merge branch 'main' into datasets/nlcd2016treecanopy

dddddb4

Merge branch 'main' into datasets/nlcd2016treecanopy

cce2123

Merge branch 'main' into datasets/nlcd2016treecanopy

b0f70d3

Merge branch 'main' into datasets/nlcd2016treecanopy

02bb4b3

Merge branch 'main' into datasets/nlcd2016treecanopy

f05f4fb

Merge branch 'main' into datasets/nlcd2016treecanopy

e5ccebd

Merge branch 'main' into datasets/nlcd2016treecanopy

2fa02d5

adamjstewart removed this from the 0.5.0 milestone Sep 28, 2023

Merge branch 'main' into datasets/nlcd2016treecanopy

8924940

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NLCD2016 Tree Canopy #1243

NLCD2016 Tree Canopy #1243

isaaccorley commented Apr 14, 2023

adamjstewart Apr 14, 2023

calebrob6 Apr 14, 2023

calebrob6 Apr 14, 2023

calebrob6 Apr 14, 2023

calebrob6 commented Apr 14, 2023

adamjstewart commented Apr 14, 2023

adamjstewart commented Apr 14, 2023

calebrob6 commented Apr 14, 2023

calebrob6 commented Apr 14, 2023

calebrob6 commented Apr 14, 2023 •

edited

Loading

calebrob6 commented Apr 14, 2023

adamjstewart commented Aug 21, 2024

	The `National Land Cover Database <https://www.mrlc.gov/>`_ provides 30m tree
	The `Multi-Resolution Land Characteristics (MRLC) Consortium <https://www.mrlc.gov/>`_ provides 30m tree

NLCD2016 Tree Canopy #1243

Are you sure you want to change the base?

NLCD2016 Tree Canopy #1243

Conversation

isaaccorley commented Apr 14, 2023

adamjstewart Apr 14, 2023

Choose a reason for hiding this comment

calebrob6 Apr 14, 2023

Choose a reason for hiding this comment

calebrob6 Apr 14, 2023

Choose a reason for hiding this comment

calebrob6 Apr 14, 2023

Choose a reason for hiding this comment

calebrob6 commented Apr 14, 2023

adamjstewart commented Apr 14, 2023

adamjstewart commented Apr 14, 2023

calebrob6 commented Apr 14, 2023

calebrob6 commented Apr 14, 2023

calebrob6 commented Apr 14, 2023 • edited Loading

calebrob6 commented Apr 14, 2023

adamjstewart commented Aug 21, 2024

calebrob6 commented Apr 14, 2023 •

edited

Loading