Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compiling PyTorch LSTM with Hidden State Input #1074

Open
vishal-chandra opened this issue Oct 7, 2024 · 3 comments · Fixed by #1086
Open

Compiling PyTorch LSTM with Hidden State Input #1074

vishal-chandra opened this issue Oct 7, 2024 · 3 comments · Fixed by #1086
Labels

Comments

@vishal-chandra
Copy link

Summary

Getting a C++ redefinition error on hidden state input when compiling a pytorch model with an LSTM error

Details

I have a model that contains an LSTM layer with the hidden state to that layer passed into model.forward some of the time (default None, same as the torch LSTM layer definition). When I remove this parameter of model.forward, the model compiles fine. However, with it included, the hidden state input variable is shown fine in the ModelGraph inputs, but is defined twice in the resulting C++ code, leading to a compilation error.

Relevant Code Snippet

def forward(self, ..., hidden):
...
lstm_out, hidden_out = self.lstm(lstm_input, hidden)
...

Resulting Error

Writing HLS project
Done
In file included from firmware/myproject.cpp:3:
firmware/myproject.h:14:14: error: redefinition of parameter 'hidden'
input3_t hidden[N_INPUT_1_3N_INPUT_2_3N_INPUT_3_3], result_t layer11_out[N_LAYER_11]
^
firmware/myproject.h:13:102: note: previous declaration is here

Steps to Reproduce

Working on hls4ml 0.9.0.dev65+gafed23b1 and torch 2.6.0. Create a simple model with an LSTM layer and the hidden state as a parameter to the forward function and compile the model.

Expected behavior

Hidden is listed only once in ModelGraph.inputs and is defined only once in the C++ code.

Actual behavior

Hidden is listed only once in ModelGraph.inputs but is defined twice in the C++ code, leading to a compilation error. I also notice that in the printed network topology, the LSTM layer is listed as having only one input.

@JanFSchulte
Copy link
Contributor

Hi!

Thanks for using hls4ml and reporting this issue.

We actually currently do not support passing of non-zero tensors for the initial hidden state in recurrent layers in Pytorch (see the description of the PR that introduced them #850). Unfortunately, we have not really documented this limitation so far, that needs to be improved. Of course, this feature should also be properly supported, but so far we haven't quite managed to do it.

However, the code should still compile and produce results, just that those results will be numerically incorrect. This test (https://github.com/fastmachinelearning/hls4ml/blob/main/test/pytest/test_recurrent_pytorch.py#L49-L91) for example passes tensors of zeroes and works fine. Once I use non-zero tensors, the output is incorrect, but I can't reproduce your compilation issues. Could you share a minimal model and code you use to invoke hls4ml?

@vishal-chandra
Copy link
Author

Hi Jan,

Sorry to miss your reply. I actually also can't reproduce the compilation issue with a minimal LSTM-only model, so it must be something with how my larger model is constructed. Was just converting with convert_from_pytorch_model() and compiling from there.

I see that #1086 mentions a one-line fix to this issue in the pytorch parser -- are hidden states now supported with pytorch?

@JanFSchulte
Copy link
Contributor

Hi! Sorry, I tagged this issue into that PR by mistake.

I just made a PR that should address this issue and implements the passing of initial values for the hidden and cell states in LSTMs (and GRUs): #1120

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants