Skip to content

mlang/wikiwordfreq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

wikiwc

A Wikipedia word frequency counter.

This project makes use of Wikipedia_Extractor to pre-process a full Mediawiki dump into basically plain text files. It then parses these files into separate words, and counts the number of occurences of each word.

Usage

As a default, wikiwc downloads the german wikipedia.

$ make WIKILANG=en

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published