Changes made to grammar check function #69

bitanb1999 · 2023-03-09T18:04:19Z

Please check the options that you have completed and strike-out the options that do not apply via this pull request:

a clear title and description to the Pull Request has been provided
you have read
the Contributing doc
the Developer Guide
the pull request passes the tests (./test-coverage "tests slow-tests") - this will also be visible via the Code coverage report and CI/CD task on the Pull Request
you have performed some kind of smoke test by running your changes in an isolated environment i.e. Docker container, Google Colab, Kaggle, etc...
~~[ ] the notebooks are updated (see notebooks folder, read the Notebooks docs)~~
CHANGELOG.md has been updated (please follow the existing format)

Goal or purpose of the PR

The grammar check function previously used the python language tool, which took significant time to process each text in the textual dataframe and return the output.

Changes implemented in the PR

I analyzed the alternatives available in NLP and came across two options: 1. Happy transformers with the hyperparameter tuning of Gramformer( check: https://github.com/PrithivirajDamodaran/Gramformer) and Gingerit package (check: https://github.com/Azd325/gingerit). Gingerit had a throughput time of 34.8 seconds whereas the language tool from python took 41secs to process each text. This seemed to be a huge upgrade.
Transformers are also a great alternative and did equivalently well but given the constraint of accessing Huggingface every time a text needs to be checked, seems like unnecessary overhead.
I have made the changes to the requirement file and to the grammar check python file.

sourcery-ai · 2023-03-09T18:04:26Z

Sourcery Code Quality Report

✅ Merging this PR will increase code quality in the affected files by 1.06%.

Quality metrics	Before	After	Change
Complexity	1.94 ⭐	1.94 ⭐	0.00
Method Length	35.25 ⭐	34.50 ⭐	-0.75 👍
Working memory	4.88 ⭐	4.56 ⭐	-0.32 👍
Quality	89.10% ⭐	90.16% ⭐	1.06% 👍

Other metrics	Before	After	Change
Lines	37	39	2

Changed files	Quality Before	Quality After	Quality Change
nlp_profiler/high_level_features/grammar_quality_check.py	89.10% ⭐	90.16% ⭐	1.06% 👍

Here are some functions in these files that still need a tune-up:

File	Function	Complexity	Length	Working Memory	Quality	Recommendation

Legend and Explanation

The emojis denote the absolute quality of the code:

⭐ excellent
🙂 good
😞 poor
⛔ very poor

The 👍 and 👎 indicate whether the quality has improved or gotten worse with this pull request.

Please see our documentation here for details on how these metrics are calculated.

We are actively working on this report - lots more documentation and extra metrics to come!

Help us improve this quality report!

…ioned in PR #69

neomatrix369

Overall LGTM - if you can pls address a few comments before we go ahead and merge it,

I'm waiting for the tests to also pass on GitHub actions.

nlp_profiler/high_level_features/grammar_quality_check.py

neomatrix369 · 2023-03-10T12:35:48Z

Well done with @sourcery-ai improvements

neomatrix369 · 2023-03-10T12:40:49Z

Overall really great work doing the analysis and verifying and checking other alternatives to replace the existing slow grammar checker with a better alternative, also shows we can plug-in and play different tooling with little or not too significant changes
.

requirements.txt

neomatrix369 · 2023-03-10T12:50:39Z

Please also do one last check in https://github.com/neomatrix369/nlp_profiler/blob/master/CONTRIBUTING.md to see if any dependent files need changing i.e. re-running notebooks etc, the Developer Guide is also something to review as a closing action.

Maybe you can enhance the existing grammar check example in the notebook(s) to illustrate the new package's features.

There are notebooks on this repo, please take a look at them and re-run them on your local machine to see if your changes have taken effect and no issues have arisen.

There are also markdown files in this repo, they may need a touch-up due to this change - can you pls check if that's the case?

neomatrix369 · 2023-03-12T10:29:25Z

@bitanb1999 Please read this comment and try to see if you can resolve it #69 (comment) - in either case respond on the PR with your findings

bitanb1999 · 2023-03-12T11:29:21Z

Please also do one last check in https://github.com/neomatrix369/nlp_profiler/blob/master/CONTRIBUTING.md to see if any dependent files need changing i.e. re-running notebooks etc, the Developer Guide is also something to review as a closing action.

Maybe you can enhance the existing grammar check example in the notebook(s) to illustrate the new package's features.

There are notebooks on this repo, please take a look at them and re-run them on your local machine to see if your changes have taken effect and no issues have arisen.

There are also markdown files in this repo, they may need a touch-up due to this change - can you pls check if that's the case?

I have gone through the contributing.md and I have abided by them all. Also, since the function is being just changed and there is no significant outer change in how the user will be calling the profiler, the notebooks remain as they are and the functions are to be called, as they were being called previously. Hence, no updates are needed in the notebooks.

neomatrix369 · 2023-03-12T15:54:45Z

If you see code format across the changes is inconsistent - linter/formatter would pick this up

neomatrix369 · 2023-03-12T16:16:09Z

One last thing to do is update the CHANGELOG.md for this change - its very easy to do, see how the previous ones are done

CHANGELOG.md

changes made to grammar check function

354f1fa

sourcery-ai bot mentioned this pull request Mar 9, 2023

changes made to grammar check function (Sourcery refactored) #70

Closed

sourcery ai changes incorporated and grammar function updated as ment…

2efde59

…ioned in PR #69

neomatrix369 self-requested a review March 10, 2023 12:27

neomatrix369 added enhancement New feature or request high-level feature(s) labels Mar 10, 2023

neomatrix369 requested changes Mar 10, 2023

View reviewed changes

nlp_profiler/high_level_features/grammar_quality_check.py Show resolved Hide resolved

nlp_profiler/high_level_features/grammar_quality_check.py Show resolved Hide resolved

nlp_profiler/high_level_features/grammar_quality_check.py Outdated Show resolved Hide resolved

neomatrix369 assigned bitanb1999 Mar 10, 2023

neomatrix369 reviewed Mar 10, 2023

View reviewed changes

requirements.txt Show resolved Hide resolved

comments added for readability

3e7c92f

neomatrix369 mentioned this pull request Mar 12, 2023

Spelling checker has been modified #71

Merged

6 tasks

code cleaned

5b6d95f

neomatrix369 changed the title ~~changes made to grammar check function~~ Changes made to grammar check function Mar 12, 2023

bitanb1999 and others added 2 commits March 12, 2023 23:40

code cleaned with black

b5a5dda

Update CHANGELOG.md

2b95049

neomatrix369 reviewed Mar 12, 2023

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Update CHANGELOG.md

c891ba3

neomatrix369 merged commit 614944d into neomatrix369:master Mar 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changes made to grammar check function #69

Changes made to grammar check function #69

bitanb1999 commented Mar 9, 2023 •

edited by neomatrix369

Loading

sourcery-ai bot commented Mar 9, 2023 •

edited

Loading

neomatrix369 left a comment

neomatrix369 commented Mar 10, 2023

neomatrix369 commented Mar 10, 2023

neomatrix369 commented Mar 10, 2023 •

edited

Loading

neomatrix369 commented Mar 12, 2023

bitanb1999 commented Mar 12, 2023

neomatrix369 commented Mar 12, 2023

neomatrix369 commented Mar 12, 2023

Changes made to grammar check function #69

Changes made to grammar check function #69

Conversation

bitanb1999 commented Mar 9, 2023 • edited by neomatrix369 Loading

Goal or purpose of the PR

Changes implemented in the PR

sourcery-ai bot commented Mar 9, 2023 • edited Loading

Sourcery Code Quality Report

Legend and Explanation

neomatrix369 left a comment

Choose a reason for hiding this comment

neomatrix369 commented Mar 10, 2023

neomatrix369 commented Mar 10, 2023

neomatrix369 commented Mar 10, 2023 • edited Loading

neomatrix369 commented Mar 12, 2023

bitanb1999 commented Mar 12, 2023

neomatrix369 commented Mar 12, 2023

neomatrix369 commented Mar 12, 2023

bitanb1999 commented Mar 9, 2023 •

edited by neomatrix369

Loading

sourcery-ai bot commented Mar 9, 2023 •

edited

Loading

neomatrix369 commented Mar 10, 2023 •

edited

Loading