Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update infer and execute API to take prompts from txt file for BS>=1 #11

Merged
merged 29 commits into from
May 23, 2024

Conversation

quic-mamta
Copy link
Contributor

@quic-mamta quic-mamta commented May 16, 2024

Update infer and execute API to take prompts from txt file for BS>=1
If user has passed BS=1 and multiple prompts and wants to execute all of them, that is also supported.

  1. User can use --prompt or --prompt_txt_file_path, any of these options.
  2. With --prompt, multiple prompts to be passed using "|" delimiter
  3. Atleast one argument, either --prompt or --prompt_txt_file_path is required.
  4. If any one argument is given then it will take inputs from given argument.
  5. If both options are given then inputs will be taken from txt file and a warning is given for same.
  6. For BS>1, if number of prompts provided in txt file or prompt don't match BS, it throws an error saying mismatch.
  7. We currently don't support running in a loop for BS>1

Example commands :

  1. for BS=1,
    python -m QEfficient.cloud.infer --model-name gpt2 --batch-size 1 --prompt-len 32 --ctx-len 128 --num-cores 16 --
    device-group [0] --prompt "My name is"

  2. for BS>1,
    python -m QEfficient.cloud.infer --model-name gpt2 --batch-size 3 --prompt-len 32 --ctx-len 128 --num-cores 16 --
    device-group [0] --prompt "My name is | The sun rises from | The flat earth theory is the belief that"

  3. for BS >=1,
    python -m QEfficient.cloud.infer --model-name gpt2 --batch-size 3 --prompt-len 32 --ctx-len 128 --num-cores 16 --
    device-group [0] --prompts_txt_file_path examples/prompts.txt

@anujgupt-github
Copy link
Contributor

Regarding the description:
"
for bs>1,
python -m QEfficient.cloud.infer --model-name gpt2 --batch-size 3 --prompt-len 32 --ctx-len 128 --num-cores 16 --device-
group '[0]' --inputs_file_path examples/prompts.txt
"

Can a user still use prompts.txt for BS=1, if there's just a single prompt in the txt? If not, that should be allowed too,

@quic-mamta
Copy link
Contributor Author

quic-mamta commented May 17, 2024

Regarding the description: " for bs>1, python -m QEfficient.cloud.infer --model-name gpt2 --batch-size 3 --prompt-len 32 --ctx-len 128 --num-cores 16 --device- group '[0]' --inputs_file_path examples/prompts.txt "

Can a user still use prompts.txt for BS=1, if there's just a single prompt in the txt? If not, that should be allowed too,

Yes, prompts.txt works with BS=1 also, updated the description.

@quic-mamta quic-mamta changed the title Update infer and execute API to take prompts from txt file for bs>1 Update infer and execute API to take prompts from txt file for BS>=1 May 17, 2024
@anujgupt-github
Copy link
Contributor

LGTM

Copy link
Contributor

@ochougul ochougul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you think of doing a function that takes in input_file_path, qpc_path, prompt as input and decides how to run the qpc on device based on these three parameters.

This should remove duplication of code in two files, and you can call the same function in two files.

QEfficient/cloud/execute.py Outdated Show resolved Hide resolved
QEfficient/cloud/execute.py Outdated Show resolved Hide resolved
QEfficient/cloud/execute.py Outdated Show resolved Hide resolved
@ochougul ochougul added the enhancement New feature or request label May 17, 2024
@ochougul
Copy link
Contributor

Also, shouldn't we support the same feature via low level API i.e. cloud_ai_100_exec_kv, I think it will be easier if we let this API take in the params qpc_path, device_id, tokenizer which are mandatory parameters to be passed by user,
prompt or input_file_path which will be optional params, but user must pass either one of them.
Then this handles how to execute, this way, nothing changes in the infer, execute files, we just pass an extra parameter in cloud_ai_100_exec_kv and this will handle everything else.

And we support this feature then, via low as well as high level API.

@ochougul
Copy link
Contributor

ochougul commented May 18, 2024

  1. Add XOR condition for exclusively passing either prompt or input_file_path
  2. Figure out BS from specializations.json in case of execute
  3. Move the logic to exec_kv call, so it doesn't print stats multiple time in case of BS=1, and len(prompt)>1, also clean code.

@quic-mamta
Copy link
Contributor Author

  1. Add XOR condition for exclusively passing either prompt or input_file_path
  2. Figure out BS from specializations.json in case of execute
  3. Move the logic to exec_kv call, so it doesn't print stats multiple time in case of BS=1, and len(prompt)>1, also clean code.

Updated.

quic-mamta and others added 3 commits May 21, 2024 12:15
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
QEfficient/generation/text_generation_inference.py Outdated Show resolved Hide resolved
QEfficient/generation/text_generation_inference.py Outdated Show resolved Hide resolved
QEfficient/cloud/execute.py Outdated Show resolved Hide resolved
QEfficient/cloud/execute.py Outdated Show resolved Hide resolved
QEfficient/cloud/execute.py Outdated Show resolved Hide resolved
QEfficient/generation/text_generation_inference.py Outdated Show resolved Hide resolved
quic-mamta and others added 2 commits May 21, 2024 17:01
quic-mamta and others added 3 commits May 21, 2024 17:56
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
QEfficient/cloud/infer.py Outdated Show resolved Hide resolved
Signed-off-by: Mamta Singh <[email protected]>
Copy link
Contributor

@ochougul ochougul left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

everything else LGTM.

QEfficient/generation/text_generation_inference.py Outdated Show resolved Hide resolved
QEfficient/cloud/infer.py Outdated Show resolved Hide resolved
QEfficient/generation/text_generation_inference.py Outdated Show resolved Hide resolved
quic-mamta and others added 6 commits May 22, 2024 22:54
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
@quic-mamta quic-mamta merged commit 40fd985 into quic:main May 23, 2024
2 checks passed
vbaddi added a commit that referenced this pull request May 23, 2024
…(Branch 1.16) (#22)

Update infer and execute API to take prompts from txt file for BS>=1 (#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1




* Update infer and execute API



* Update infer and execute API



* Update README.md



* Update README.md



* Update README.md



* Update infer, execute and text generation interface



* Update execute.py



* Update execute.py



* Update text generation interface



* Update Notebooks




* Update README.md




* Update README.md




* Update text_generation_inference.py



* Update infer and execute and text generation interface



* Update infer.py



* Update README.md



* Update README.md



* Update README.md



* Update infer.py



* Update execute.py



* Update files



* Update files



* Update README.md



* Update QEfficientGPT2.ipynb



* Update QEfficientMPT.ipynb



* Update README.md



* Update README.md



---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Co-authored-by: Mamta Singh <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 7, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 7, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
ochougul pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 12, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
ochougul pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 12, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 25, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jun 25, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
abukhoy pushed a commit to abukhoy/efficient-transformers that referenced this pull request Jun 28, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>

Signed-off-by: root <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 16, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 16, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
Signed-off-by: amitraj <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 19, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
Signed-off-by: amitraj <[email protected]>
quic-amitraj pushed a commit to quic-amitraj/efficient-transformers that referenced this pull request Jul 24, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: Onkar Chougule <[email protected]>
Signed-off-by: quic-amitraj <[email protected]>
Signed-off-by: amitraj <[email protected]>
shubhagr-quic pushed a commit to shubhagr-quic/efficient-transformers that referenced this pull request Oct 4, 2024
…uic#11)

* [QEff]: Update infer and execute API to take prompts from txt file for bs>1

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update infer and execute API

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer, execute and text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update text generation interface

Signed-off-by: mamtsing <[email protected]>

* Update Notebooks

Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: mamtsing <[email protected]>

* Update text_generation_inference.py

Signed-off-by: Mamta Singh <[email protected]>

* Update infer and execute and text generation interface

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update infer.py

Signed-off-by: Mamta Singh <[email protected]>

* Update execute.py

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update files

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientGPT2.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update QEfficientMPT.ipynb

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

* Update README.md

Signed-off-by: Mamta Singh <[email protected]>

---------

Signed-off-by: mamtsing <[email protected]>
Signed-off-by: mamtsing <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Signed-off-by: quic-mamta <[email protected]>
Signed-off-by: Mamta Singh <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants