Rust/src/string at master · fred-sheehan/Rust

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
aho_corasick.rs		aho_corasick.rs
anagram.rs		anagram.rs
autocomplete_using_trie.rs		autocomplete_using_trie.rs
boyer_moore_search.rs		boyer_moore_search.rs
burrows_wheeler_transform.rs		burrows_wheeler_transform.rs
duval_algorithm.rs		duval_algorithm.rs
hamming_distance.rs		hamming_distance.rs
jaro_winkler_distance.rs		jaro_winkler_distance.rs
knuth_morris_pratt.rs		knuth_morris_pratt.rs
levenshtein_distance.rs		levenshtein_distance.rs
manacher.rs		manacher.rs
mod.rs		mod.rs
palindrome.rs		palindrome.rs
rabin_karp.rs		rabin_karp.rs
reverse.rs		reverse.rs
run_length_encoding.rs		run_length_encoding.rs
suffix_array.rs		suffix_array.rs
suffix_tree.rs		suffix_tree.rs
z_algorithm.rs		z_algorithm.rs

String Algorithms

From Wikipedia: a string-searching algorithm invented by Alfred V. Aho and Margaret J. Corasick in 1975.[1] It is a kind of dictionary-matching algorithm that locates elements of a finite set of strings (the "dictionary") within an input text. It matches all strings simultaneously.

Burrows-Wheeler transform

From Wikipedia: The Burrows–Wheeler transform (BWT, also called block-sorting compression) rearranges a character string into runs of similar characters. This is useful for compression, since it tends to be easy to compress a string that has runs of repeated characters by techniques such as move-to-front transform and run-length encoding. More importantly, the transformation is reversible, without needing to store any additional data except the position of the first original character. The BWT is thus a "free" method of improving the efficiency of text compression algorithms, costing only some extra computation.

Properties

Worst-case performance O(n)

Knuth Morris Pratt

From Wikipedia: searches for occurrences of a "word" W within a main "text string" S by employing the observation that when a mismatch occurs, the word itself embodies sufficient information to determine where the next match could begin, thus bypassing re-examination of previously matched characters. Knuth Morris Pratt search runs in linear time in the length of W and S.

Properties

Case performance O(s + w)
Case space complexity O(w)

Manacher

From Wikipedia: find a longest palindrome in a string in linear time.

Properties

Worst-case time complexity is O(n)
Worst-case space complexity is O(n)

Rabin Karp

From Wikipedia: a string-searching algorithm created by Richard M. Karp and Michael O. Rabin that uses hashing to find an exact match of a pattern string in a text.

Hamming Distance

From Wikipedia: In information theory, the Hamming distance between two strings of equal length is the number of positions at which the corresponding symbols are different. In other words, it measures the minimum number of substitutions required to change one string into the other, or the minimum number of errors that could have transformed one string into the other. In a more general context, the Hamming distance is one of several string metrics for measuring the edit distance between two sequences. It is named after the American mathematician Richard Hamming.

Run Length Encoding

From Wikipedia: a form of lossless data compression in which runs of data (sequences in which the same data value occurs in many consecutive data elements) are stored as a single data value and count, rather than as the original run.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

string

string

README.md

String Algorithms

Aho-Corasick Algorithm

Burrows-Wheeler transform

Knuth Morris Pratt

Manacher

Rabin Karp

Hamming Distance

Run Length Encoding

Files

string

Directory actions

More options

Directory actions

More options

Latest commit

History

string

Folders and files

parent directory

String Algorithms