feat: get rid off output and input conversion for user #732

Gerhardsa0 · 2024-05-06T16:53:21Z

Is your feature request related to a problem?

Right now the user needs to set Input and Output conversion, but its always the same type of conversion.

Desired solution

Create one conversion, which the user sees.

Possible alternatives (optional)

No response

Screenshots (optional)

No response

Additional Context (optional)

No response

lars-reimann · 2024-05-06T17:16:57Z

Is this different from #656?

Gerhardsa0 · 2024-05-08T13:30:00Z

Is this different from #656?

@Marsmaennchen221 and I were thinking about removing Output and Input Conversion and generalizing it, to a Conversion.
The problem with that is, that for Time Series, I like that I have two separate interfaces for defining parameters, like window_size, forecast_horizon and prediction_name.

lars-reimann · 2024-05-13T16:40:20Z

I'm wondering whether the output conversion is really needed. We should already know from the dataset we get as input what the desired shape of the output is.

Likewise for the input conversions, there's a strong overlap to datasets, as we discussed:

InputConversionTable needs no extra arguments.
InputConversionImage needs the image size, which is already in the dataset.
InputConversionTimeSeries needs the window size and forecast horizon. We could also move these to the dataset.

Edit 1: The input conversion can be helpful, though, to indicate the input type of fit and the input and output type of predict. We could keep these completely empty and just use them to set the type parameters of the model correctly.

Edit 2: We also get this information when we fit, however, based on the given dataset. Since fit returns a new NN instance, it could specify its type parameters then. For the unfitted model, we can just leave the type parameters on Any. This means, we'd need a common superclass Dataset for all datasets, with at least two type parameters (input and output type).

@Marsmaennchen221 @Gerhardsa0 @sibre28

Marsmaennchen221 · 2024-05-13T18:00:28Z

Edit 2: We also get this information when we fit, however, based on the given dataset. Since fit returns a new NN instance, it could specify its type parameters then. For the unfitted model, we can just leave the type parameters on Any. This means, we'd need a common superclass Dataset for all datasets, with at least two type parameters (input and output type).

If we do this, what is the point of having a __init__ anyway? At least for Image NNs, the image size is needed to build the internal PyTorch model, which we do so far in the __init__ method. If we move this to the fit method, I don't see any real point of having an unfitted model, as the only thing we can do with it is to fit the model.

@lars-reimann

lars-reimann · 2024-05-13T18:11:43Z

We could compose several (untrained) NNs into an ensemble or data processing pipeline, which can then be trained.
We define the layers of the model in __init__. This is consistent with the classical models, where we define hyperparameters in __init__ and differentiate between unfitted and fitted models.
Hyperparameter optimization #264 can still be integrated into __init__, where we could specify several architectures to try.

At least for Image NNs, the image size is needed to build the internal PyTorch model, which we do so far in the __init__ method.

Does this already initialize the weights of the model? If so, it would be considerably more efficient to do this in fit, since we must otherwise create a clone of these to prevent mutation.

Closes partially #732 ### Summary of Changes Output conversions are the exact inversion of the input conversion, so there is no need to specify them again. Now, a neural network only takes an input conversion and a list of layers. This also gets rid of several errors that could occur if input and output conversions did not fit together. In a later PR, the input conversion will also be removed, since they mirror datasets. --------- Co-authored-by: megalinter-bot <[email protected]>

## [0.26.0](v0.25.0...v0.26.0) (2024-05-29) ### Features * `Table.count_row_if` ([#788](#788)) ([4137131](4137131)), closes [#786](#786) * added method to load pretrained models from huggingface ([#790](#790)) ([dd8394b](dd8394b)) * infer input size of forward and LSTM layers ([#808](#808)) ([098a07f](098a07f)) * outline around dots of scatterplot ([#785](#785)) ([ee8acf7](ee8acf7)) * remove output conversions ([#792](#792)) ([46f2f5d](46f2f5d)), closes [#732](#732) * shorten some excessively long names ([#787](#787)) ([1c3ea59](1c3ea59)), closes [#772](#772) * specify column names in constructor of table transformers ([#795](#795)) ([69a780c](69a780c)) * store window size and forecast horizon in dataset ([#794](#794)) ([f07bc5a](f07bc5a)) * string operations on cells ([#791](#791)) ([4a17f76](4a17f76)) ### Bug Fixes * handling of boolean columns in column statistics ([#778](#778)) ([f61cceb](f61cceb)) * sort x values of line plot ([#782](#782)) ([74d8649](74d8649))

Gerhardsa0 added the enhancement 💡 New feature or request label May 6, 2024

Gerhardsa0 mentioned this issue May 6, 2024

feat: added rnn layer and TimeSeries conversion #615

Merged

lars-reimann mentioned this issue May 20, 2024

feat: remove output conversions #792

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: get rid off output and input conversion for user #732

feat: get rid off output and input conversion for user #732

Gerhardsa0 commented May 6, 2024

lars-reimann commented May 6, 2024

Gerhardsa0 commented May 8, 2024

lars-reimann commented May 13, 2024 •

edited

Loading

Marsmaennchen221 commented May 13, 2024

lars-reimann commented May 13, 2024 •

edited

Loading

feat: get rid off output and input conversion for user #732

feat: get rid off output and input conversion for user #732

Comments

Gerhardsa0 commented May 6, 2024

Is your feature request related to a problem?

Desired solution

Possible alternatives (optional)

Screenshots (optional)

Additional Context (optional)

lars-reimann commented May 6, 2024

Gerhardsa0 commented May 8, 2024

lars-reimann commented May 13, 2024 • edited Loading

Marsmaennchen221 commented May 13, 2024

lars-reimann commented May 13, 2024 • edited Loading

lars-reimann commented May 13, 2024 •

edited

Loading

lars-reimann commented May 13, 2024 •

edited

Loading