- Information-theoretic measures of distributional similarity
- Entropy
- Cross entropy
- Kullback-Leibler divergence
- Jensen-Shannon divergence
- Text preprocessing using Shell commands
- Naive Bayes text categorization model
- Cocke-Younger-Kasami parsing implementation
- Entropy