implement `--sort-files` option #263

daxim · 2016-12-01T11:12:45Z

Ack has it. I always want to receive sorted output because I then can easily cross-check by eye-balling and compare the list of search results with the output of ls or tree, which are also sorted.

The text was updated successfully, but these errors were encountered:

BurntSushi · 2016-12-01T12:03:56Z

This is related to #152 but it subtly different. #152 is asking for deterministic output and you're asking for sorted output. What do you want to sort by? Should it be customizable? What do you hope for this to achieve that, say, rg foo | sort does not? Is it acceptable to lose parallelism? (I hope so, because doing this while retaining parallelism seems hard.)

nerdrew · 2016-12-02T00:05:56Z

@BurntSushi Hmmm. My bad. I don't care about sorting as much as I care about grouping by directory. So #152 seems more like what I'm looking for.

E.g.

dir1/file1.rs:blah
dir2/file4.rs:blah
dir1/file2.rs:blah

vs

dir1/file1.rs:blah
dir1/file2.rs:blah
dir2/file4.rs:blah

Totally understand the performance penalty for grouping results. Is it feasible to use the parallel runners to do the searching and aggregate the results afterward?

BurntSushi · 2016-12-02T00:14:37Z

@nerdrew Well, I mean, yes, that's what a hypothetical solution would have to do. But now you've introduced a cost: extra memory use. There might be extra time cost too, for having to do the aggregate, but it could be immeasurable.

Of course, that might have been feasible in 0.2.x, but 0.3.x introduced a parallel directory iterator so that actually crawling through the directories themselves is parallelized. Making that do aggregation (and importantly, knowing when an aggregation is complete) seems hard.

I would say that there's basically two options here:

Deal with -j1 if you want determinism.
Implement a new parallel searcher that's similar to the one in 0.2.x, but add aggregation. (This wouldn't be that hard.)

daxim · 2016-12-02T03:06:38Z

What do you want to sort by?

By the relative path name of the matching files; treat it as a string. Directory depth does not matter.

Should it be customizable?

No, it should simply follow LC_COLLATE.

What do you hope for this to achieve that, say, rg foo | sort does not?

The output of normal ack --sort-files on an interactive tty is human-readable with its path name headings above the matching lines, each group visually separated from each other, line numbers and colours. But when rg is piped into something, it's not human-readable any more. The path names are smushed together with the matching lines, there are no double line feeds to create paragraphs, line numbers are lost and results are not highlighted any more.

Is it acceptable to lose parallelism?

Yes.

BurntSushi · 2016-12-02T11:45:54Z

If --sort-files can imply -j1, then I think this is a relatively straight-forward thing to implement. It involves calling WalkDir::sort_by from the the single threaded ignore::Walk iterator, which in turn requires exposing a sort_by method on `ignore::WalkBuilder.

BurntSushi added the question An issue that is lacking clarity on one or more points. label Dec 1, 2016

BurntSushi mentioned this issue Dec 1, 2016

Add sort option? #260

Closed

BurntSushi closed this as completed in b65a8c3 Jan 7, 2017

BurntSushi mentioned this issue Sep 23, 2017

Add --offset option to allow for pagination of large files #608

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement `--sort-files` option #263

implement `--sort-files` option #263

daxim commented Dec 1, 2016

BurntSushi commented Dec 1, 2016

nerdrew commented Dec 2, 2016

BurntSushi commented Dec 2, 2016

daxim commented Dec 2, 2016

BurntSushi commented Dec 2, 2016

implement --sort-files option #263

implement --sort-files option #263

Comments

daxim commented Dec 1, 2016

BurntSushi commented Dec 1, 2016

nerdrew commented Dec 2, 2016

BurntSushi commented Dec 2, 2016

daxim commented Dec 2, 2016

BurntSushi commented Dec 2, 2016

implement `--sort-files` option #263

implement `--sort-files` option #263