Improve complete.bash #113

lakshayrohila · 2024-02-15T15:48:17Z

What does it do?

Improves complete.bash by:

removing the redundant loop in _tldr_get_files(), which does nothing but re-echoes the same output.
removing the redundant | sort | uniq calls.
the _tldr_get_files() now only returns the matching results, instead of returning thousands of lines each time.
applies fix similar to Fix: prevent freezing of autocomplete in Zsh #105 for bash.
earlier version could not actually autocomplete on cases such as --vers, which it should autocomplete. This version would autocomplete in that case to tldr --version.
earlier not all command line options were present in this file (such as --verbose); this version adds them.

Why the change?

The autocomplete function almost freezes (on my system) for almost 20 secs (I am using it in bash). This is due to the above reasons. This version reduces that time to almost instantaneous.

How can this be tested?

Install the autocomplete file just as is described in this repo's README.md, and try to use completion (with tab key press) in bash.

Where to start code review?

I've already mentioned the major changes in the What does it do? section.

Questions?

N/A

Checklist

I have checked there aren't any existing PRs open to fix this issue/add this feature.
I have compiled the code with make and tested the change in an active installation with sudo make install.

Notes for (2) : Since this is not a change in the code for the actual tldr program, I have not run (2); but rather tested the change with the steps I mentioned in How can this be tested? section.

…ault case.

lakshayrohila · 2024-02-20T09:50:50Z

autocomplete/complete.bash

+# usage: _tldr_get_files [architecture] [semi-completed word to search]
 _tldr_get_files() {
-	local ret
-	local files="$(find $HOME/.tldrc/tldr/pages/$1 -name '*.md' -exec basename {} .md \;)"
-
-	IFS=$'\n\t'
-	for f in $files; do
-	    echo $f
-	done
+    find "$HOME"/.tldrc/tldr/pages/"$1" -name "$2"'*.md' -exec basename -s .md {} +


I would recommend the reviewers to test if find ... -name '*.md' ... is better, or is find ... -name "$2"'*.md' ....

Comparing the performance of _tldr_complete function using time command does not help, since both methods perform almost the same.

Using the following commands, I was able to determine that after applying this change, the bottleneck is the find command:

$ word="he" $ cmpl=`find $HOME/.tldrc/tldr/pages/linux/ -name "$word"'*.md' -exec basename -s '.md' {} + && find $HOME/.tldrc/tldr/pages/common/ -name "$word"'*.md' -exec basename -s '.md' {} +`; cmpl_sorted_n_uniq=`printf "%s" "$cmpl" | sort | uniq` $ compgen -W "$cmpl_sorted_n_uniq" -- "$word" | pv -lr > /dev/null $ ( find $HOME/.tldrc/tldr/pages/linux/ -name "$word"'*.md' -exec basename -s '.md' {} + && find $HOME/.tldrc/tldr/pages/common/ -name "$word"'*.md' -exec basename -s '.md' {} + ) | pv -rl >/dev/null

The results will be in this form:

[compgen's speed] [find's speed]

In the other case (using only -name '*.md' in find command), I found compgen was the bottleneck instead of find; using the following commands:

$ word="he" $ cmpl=`find $HOME/.tldrc/tldr/pages/linux/ -name '*.md' -exec basename -s '.md' {} + && find $HOME/.tldrc/tldr/pages/common/ -name '*.md' -exec basename -s '.md' {} +`; cmpl_sorted_n_uniq=`printf "%s" "$cmpl" | sort | uniq` $ compgen -W "$cmpl_sorted_n_uniq" -- "$word" | pv -lr > /dev/null $ ( find $HOME/.tldrc/tldr/pages/linux/ -name '*.md' -exec basename -s '.md' {} + && find $HOME/.tldrc/tldr/pages/common/ -name '*.md' -exec basename -s '.md' {} + ) | pv -rl >/dev/null

The result will be in the same form as described above.

The results I got:

with find ... -name '*.md' ...:

compgen's speed: 1.69k/s

find's speed: 241k/s

with find ... -name "$2"'*.md' ...:

compgen's speed: 255k/s

find's speed: 1.48k/s

Both methods used about 0.05 secs on my system (using time _tldr_complete).

In theory find.... | xargs -P "$(nproc)" basename... hour potentially be faster. I wonder if using a pure-bash alternative to basename or an awk/sed solution there could help? The trouble is that you're spawning a new process for each line with the basename call there.

elliotfontaine · 2024-05-05T22:21:47Z

Thanks for the quick fix @lakshayrohila !

sbrl

Looks like a good improvement to me. Avoiding the basename subprocess would increase performance further - maybe by a significant margin.

So sorry it's taken until now for me to review this! Life has been hectic.

lakshayrohila · 2024-09-13T15:43:19Z

@sbrl, @elliotfontaine thanks for all your replies! I have become a bit busy now, so won't be able to do anything further on the issue - so am closing the PR. I request you to take on this issue yourself, it will be just quick copy-n-paste of this PR's code and whatever further edits you want to do.

lakshayrohila added 5 commits February 15, 2024 20:59

Improve complete.bash

7857e50

Update complete.bash

12ae008

Fix cmpl no newline between two platforms

362fcd2

Add all command line options.

d28b432

Apply tldr-pages#105's fix here and remove unrequired hadnling of def…

3715667

…ault case.

kbdharun requested a review from sbrl February 16, 2024 04:40

lakshayrohila commented Feb 20, 2024

View reviewed changes

sbrl reviewed Aug 18, 2024

View reviewed changes

lakshayrohila closed this Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve complete.bash #113

Improve complete.bash #113

lakshayrohila commented Feb 15, 2024 •

edited

Loading

lakshayrohila Feb 20, 2024 •

edited

Loading

sbrl Aug 18, 2024

elliotfontaine commented May 5, 2024

sbrl left a comment

lakshayrohila commented Sep 13, 2024

Improve complete.bash #113

Improve complete.bash #113

Conversation

lakshayrohila commented Feb 15, 2024 • edited Loading

What does it do?

Why the change?

How can this be tested?

Where to start code review?

Questions?

Checklist

lakshayrohila Feb 20, 2024 • edited Loading

Choose a reason for hiding this comment

sbrl Aug 18, 2024

Choose a reason for hiding this comment

elliotfontaine commented May 5, 2024

sbrl left a comment

Choose a reason for hiding this comment

lakshayrohila commented Sep 13, 2024

lakshayrohila commented Feb 15, 2024 •

edited

Loading

lakshayrohila Feb 20, 2024 •

edited

Loading