Croptype #71

gabrieltseng · 2024-06-04T17:32:32Z

gabrieltseng · 2024-06-04T17:33:48Z

presto/eval.py

-                learning_rate=0.05,
-                early_stopping_rounds=20,
-                l2_leaf_reg=3,
+                learning_rate=0.2,


How are we selecting these parameters?

gabrieltseng · 2024-06-04T17:36:39Z

paper_eval.py

+
+# argparser.add_argument("--val_samples_file", type=str, default="cropland_spatial_generalization_test_split_samples.csv")
+
+argparser.add_argument("--presto_model_type", type=str, default="presto-ft-ct")


Is this argument (presto_model_type) only used to name the experiment file? if so can it be given a different name; the current name implies it will somehow affect the model

Same for other names which don't affect the functionality, e.g. compositing_window.

Also I think the most up to date main hasn't been merged into this, since the 10d compositing is also supported but doesn't seem to be here.

good point. addressed that here 1601dbc

gabrieltseng · 2024-06-04T17:40:52Z

paper_eval.py

-    model = Presto.construct(**model_kwargs)
-    best_model_path = None
-model.to(device)
+val_samples_file = f"{task_type}_{test_type}_generalization_test_split_samples.csv"


This approach requires the script to be run 3 times to get the full results (instead of only once to get all 3 results, which was the case previously).

Perhaps we can update this so that a single run gets all the data?

gabrieltseng · 2024-06-04T17:42:25Z

paper_eval.py

+# check if finetuned model already exists
+logger.info("Checking if the finetuned model exists")
+if os.path.isfile(finetuned_model_path):
+    logger.info("Finetuned model found! Loading...")


Do we want to do this? I can imagine this introducing lots of unexpected errors since if a finetuned model exists from a previous run, it would automatically affect this run whether or not we want it to

Also would this work if there is no finetuned model? I think the script would error out since finetuned_model would never be initialized?

Also also, how is the model finetuned in this case? I think thats pretty important so it would be good to capture that in this script

I still feel the need for being able to upload the finetuned model, particularly if I want to run downstream classifiers only and collect metrics. in particular, this piece only checks if the model exists, collects metrics for the uploaded model along with spatial plots and runs sklearn models.
I didn't succeed in implementing the upload in presto.py, so now this piece wouldn't run anyway. for now, I
just put a placeholder here 532ec67

gabrieltseng · 2024-06-04T17:50:19Z

presto/eval.py

-    batch_size: int = 64
-    patience: int = 10
-    num_workers: int = 4
+    batch_size: int = 2048


How were these chosen? This batch size is probably too large for finetuning

indeed, reverted batch_size to 256 here a350280
I was playing with leaning rate, but also ended up using your value.

presto/eval.py

…kadal transformation

gabrieltseng · 2024-06-26T23:42:56Z

presto/presto.py

-            (torch.zeros(x.shape[0])[:, None].to(device).int(), orig_indices + 1),
-            dim=1,
-        )
+        x, upd_mask, orig_indices = self.add_token(latlon_tokens, x, upd_mask, orig_indices)


@cbutsko a hackey way to get the model to ignore latlons is to change the mask here. A mask value of 0s tells the model to include the value, and a mask value of 1s tells it to ignore the value.

New tokens get added to the front of the sequence. Concretely:

x has shape [batch_size, num_tokens, dim]
and mask has shape [batch_size, num_tokens].

The latlons just got added to the front of the sequence, so you can do

upd_mask[:, 0] = 1

right after line 489 (where self.add_token(latlon_tokens, x, upd_mask, orig_indices) is called) to update the mask so that the latlon token will be ignored entirely.

added this line as you suggested instead of filling latlons with zeros in the dataset
0cf1e60

…CH slower 🦥

@kvantricht

… now naive balancing does not repeat every smallest class and is much faster; thanks for the idea @kvantricht!

Timestep position debugging

…ct SSL

…odel\

Valid month and mask debugging

Butsko Christina added 2 commits May 31, 2024 11:08

added code for multiclass finetuning Presto for croptype task

bf5dda2

fixed result collection for sklearn models

7fd1dde

gabrieltseng commented Jun 4, 2024

View reviewed changes

presto/eval.py Show resolved Hide resolved

Butsko Christina added 2 commits June 12, 2024 12:21

added hierarchical classifier v.0 to downstream models

6090b9f

added patches for handling valid_date as token; added more updated de…

ea4b2b6

…kadal transformation

gabrieltseng commented Jun 26, 2024

View reviewed changes

This was referenced Jun 28, 2024

Valid date for crop types #47

Open

Add hierarchical head on top of Presto encoder for crop type classification #27

Open

Balancing class representation in the dataset for crop type classification #81

Open

Multi class pretraining #49

Closed

Butsko Christina and others added 15 commits July 2, 2024 16:54

major change: spatial prediction for croptype + looots of minor changes

7285c38

replaced confusing argument name; improved formatting

1601dbc

added valid_month parameters to default config

586db66

added placeholder for loading finetuned model

532ec67

class name constructed from task_type; cleaned unused pieces

a350280

added a line to mask latlons for hackey generalizability test

0cf1e60

formatting & cleaning

725e2af

updated test split files

5f5d296

bug fix

6ef5c14

bug fixes

2440203

bug fixes and default argument updates

9b6088d

implemented simple balancing for croptype; CAUTION: makes training MU…

4417a3b

…CH slower 🦥

switched class balancing from finetune_class to a new balanced_class;…

38907b0

… now naive balancing does not repeat every smallest class and is much faster; thanks for the idea @kvantricht!

Merge branch 'main' into croptype

406fb16

black fixes

ee1b2c5

kvantricht and others added 30 commits October 8, 2024 18:57

Attempt to auto-format

865ab1e

Should be f-string

add376a

Moved import to top

e5e0109

Merge pull request #114 from WorldCereal/timestep-position-debugging

89d72a4

Timestep position debugging

removed unnecessary lines that double the size of embeddings

e163e9f

added loading of finetuned model

6780a83

slightly cleaner handling of valid_month token

6b904e8

changed strict to True during model loading

66e0c72

enhanced plotting

069ede7

updated masking not to take into account existing mask

8702c7f

turning on augmentation for downstream model

f96f78d

Bugfix: use valid_month_as_token kwarg

80bae7b

Formatting fixes

2ef8dde

Formatting fix bis

ea042e7

added tests for both for using valid_month token and not

205fd76

reverting masking changes for now; need to make sure it does not affe…

8cd4d15

…ct SSL

changed default value of valid_month_as_token to False when loading m…

e4fdac4

…odel\

added valid_month related tests

059bd6b

commented lines that create ref feature files

641c8c5

added new reference feature files for with and without valid_month

e97fc70

removed unnecessary prints

2d889ee

formatting

6c43ceb

fixed test for valid_month token

024ffc3

create ref feature files

6f05047

fixed tests

34a026b

a very brave attempt to mess with encoder compile 🙈

6b09d83

removing redundant creation of valid_month token when the flag is False

e9cbfa8

removed obsolete TODOs

080e20c

Merge pull request #115 from WorldCereal/valid_month-and-mask-debugging

a75adbf

Valid month and mask debugging

Bump version number to 0.1.6

aa2f74d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Croptype #71

Croptype #71

gabrieltseng commented Jun 4, 2024 •

edited

Loading

gabrieltseng Jun 4, 2024

gabrieltseng Jun 4, 2024

gabrieltseng Jun 4, 2024

cbutsko Aug 5, 2024

gabrieltseng Jun 4, 2024

gabrieltseng Jun 4, 2024

gabrieltseng Jun 4, 2024

gabrieltseng Jun 4, 2024

cbutsko Aug 5, 2024

gabrieltseng Jun 4, 2024

cbutsko Aug 5, 2024

gabrieltseng Jun 26, 2024

cbutsko Aug 5, 2024


		# argparser.add_argument("--val_samples_file", type=str, default="cropland_spatial_generalization_test_split_samples.csv")

		argparser.add_argument("--presto_model_type", type=str, default="presto-ft-ct")

Croptype #71

Are you sure you want to change the base?

Croptype #71

Conversation

gabrieltseng commented Jun 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gabrieltseng commented Jun 4, 2024 •

edited

Loading