fix for token_type_ids #149

quic-mamta · 2024-10-10T05:54:29Z

Fix token type ids in pytorch inputs for falcon 180B

Signed-off-by: Mamta Singh <[email protected]>

ochougul · 2024-10-10T09:34:21Z

QEfficient/utils/generate_inputs.py

@@ -53,6 +53,7 @@ def prepare_pytorch_inputs(self):
        input_ids = inputs["input_ids"]
        batch_size, input_len = input_ids.shape
        inputs.pop("attention_mask")
+        inputs.pop("token_type_ids", None)


don't we need to do this in prepare_ort_inputs and also in text_generation script?
Did you test the model that has this issue with infer CLI API?

I had tested with export command provided in the jira but 180B model export started but failed due to RAM issue. added the changes for ort and aic runs as well. and verified infer command for single layer model.

Signed-off-by: Mamta Singh <[email protected]>

fix for token_type_ids

4da6336

Signed-off-by: Mamta Singh <[email protected]>

quic-mamta requested review from irajagop and vbaddi October 10, 2024 05:55

quic-mamta self-assigned this Oct 10, 2024

quic-mamta requested review from ochougul and quic-rishinr October 10, 2024 05:55

ochougul requested changes Oct 10, 2024

View reviewed changes

quic-mamta added 2 commits October 10, 2024 15:23

update TextGeneration and prepare_ort_inputs

16e4065

Signed-off-by: Mamta Singh <[email protected]>

update prepare_ort_inputs

b4c48de

Signed-off-by: Mamta Singh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix for token_type_ids #149

fix for token_type_ids #149

quic-mamta commented Oct 10, 2024 •

edited

Loading

ochougul Oct 10, 2024

quic-mamta Oct 10, 2024 •

edited

Loading

fix for token_type_ids #149

Are you sure you want to change the base?

fix for token_type_ids #149

Conversation

quic-mamta commented Oct 10, 2024 • edited Loading

ochougul Oct 10, 2024

Choose a reason for hiding this comment

quic-mamta Oct 10, 2024 • edited Loading

Choose a reason for hiding this comment

quic-mamta commented Oct 10, 2024 •

edited

Loading

quic-mamta Oct 10, 2024 •

edited

Loading