feat: Add multi-layer, bidirectional RNN with tanh activation #28804

muzakkirhussain011 · 2024-08-16T23:18:25Z

PR Description

This pull request introduces a multi-layer bidirectional RNN with tanh activation.

Changes Made:

Implemented rnn_tanh function for multi-layer RNNs with optional bidirectional configuration.
Added rnn_tanh_update for processing RNN cell updates.
Included support for weight transformation and bidirectional RNN configurations.

Key Features:

Handles both unidirectional and bidirectional RNNs.
Supports multiple layers and optional biases.
Includes dropout and training mode configurations (currently not implemented).

Usage:

Call rnn_tanh with input data, initial state, weights, and configuration parameters.
The output includes the final hidden state, output sequence, and RNN outputs.

This update enhances the flexibility and functionality of RNN implementations, providing robust support for various RNN configurations.

Related Issue

#28798

Checklist

Did you add a function?
Did you add the tests?
Did you run your tests and are your tests passing?
Did pre-commit not fail on any check?
Did you follow the steps we provided?

func_wrapper.py is a Python module designed to streamline the integration of Hugging Face Transformers into your natural language processing (NLP) projects. It provides a set of input and output conversion wrappers to simplify the process of passing data between your custom functions and Transformers' data structures. Input Conversion Wrappers: inputs_to_transformers_tensors: This wrapper converts input data (text, tensors, etc.) into Transformers-compatible data structures. It is particularly useful when your custom functions expect diverse input types. Output Conversion Wrappers: outputs_to_pytorch_tensors: After your custom function returns data, this wrapper ensures that the output data is converted into PyTorch tensors or other appropriate formats. Usage: Import func_wrapper.py into your project. Initialize a Hugging Face Transformers model and tokenizer. Wrap your custom function with to_transformers_tensors_and_back. This wrapped function can now accept and return Transformers-compatible data. Here's a simple example of how to use func_wrapper.py: import torch from transformers import BertForSequenceClassification, BertTokenizer from ivy.functional.frontends.transformers.func_wrapper import to_transformers_tensors_and_back # Initialize the model and tokenizer model_name = "bert-base-uncased" model = BertForSequenceClassification.from_pretrained(model_name) tokenizer = BertTokenizer.from_pretrained(model_name) # Wrap your custom function using the conversion wrappers wrapped_function = to_transformers_tensors_and_back(your_function, model, tokenizer) # Prepare sample input data sample_input_text = "This is a sample input text." sample_input_tensor = torch.rand((3, 3)) # Call your wrapped function with the sample input data output = wrapped_function(sample_input_text, sample_input_tensor) # The output is automatically converted to PyTorch tensors print(output) Please note that func_wrapper.py is still in development, and further enhancements and refinements are expected. Your feedback and contributions to improve its functionality are welcome.

muzakkirhussain011 · 2024-08-17T11:40:52Z

Hi @Sam-Armstrong

I would like to request your review of this pull request. Your input and insights are greatly valued.

Thank you,

Sam-Armstrong

Hey @muzakkirhussain011 thanks for the contribution!

Please could you add tests for ivy.rnn_tanh and ivy.rnn_tanh_update? The tests will need to go in ivy_tests/test_ivy/test_functional/test_nn/test_layers.py, so you can look at the tests already there to follow the precedent of how to do this.

I think it would also make sense to add the torch frontend _VF.rnn_tanh and _VF.rnn_tanh_cell in this same PR, would you be able to look into that?

Thanks again 😁

Sam-Armstrong · 2024-08-17T20:45:20Z

ivy/functional/ivy/layers.py

+    return output[:, -1], output, h_outs
+
+
+def _rnn_tanh_cell(


We could make this a public rather than private method, so ivy.rnn_tanh_cell can be used in the torch frontend rnn_tanh_cell implementation?

muzakkirhussain011 · 2024-08-17T21:07:09Z

Hi @Sam-Armstrong,
Thanks for the feedback! I'll take a look at adding the necessary tests and explore the suggested changes. I'll keep you updated on my progress. Thanks again!

muzakkirhussain011 added 4 commits September 4, 2023 20:07

Merge branch 'ivy-llc:main' into patch-4

485ded2

Delete ivy/functional/frontends/transformers/func_wrapper.py

cf1ae37

Implement multi-layer bidirectional RNN with tanh activation

5d130b5

Sam-Armstrong requested changes Aug 17, 2024

View reviewed changes

muzakkirhussain011 added 2 commits August 18, 2024 01:51

Add rnn_tanh_update test

e9f4c37

Made rnn_tanh_cell a public method

281d24d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add multi-layer, bidirectional RNN with tanh activation #28804

feat: Add multi-layer, bidirectional RNN with tanh activation #28804

muzakkirhussain011 commented Aug 16, 2024

muzakkirhussain011 commented Aug 17, 2024

Sam-Armstrong left a comment

Sam-Armstrong Aug 17, 2024

muzakkirhussain011 commented Aug 17, 2024

feat: Add multi-layer, bidirectional RNN with tanh activation #28804

Are you sure you want to change the base?

feat: Add multi-layer, bidirectional RNN with tanh activation #28804

Conversation

muzakkirhussain011 commented Aug 16, 2024

PR Description

Changes Made:

Key Features:

Usage:

Related Issue

Checklist

muzakkirhussain011 commented Aug 17, 2024

Sam-Armstrong left a comment

Choose a reason for hiding this comment

Sam-Armstrong Aug 17, 2024

Choose a reason for hiding this comment

muzakkirhussain011 commented Aug 17, 2024