The datawake is currently in alpha development.
The datawake is multi-tier software system consisting of client side applications (chrome plugin), web servers (tangelo), and distributed backend platforms (kafka,storm,etc).
The datawake captures user browsing data for future analysis while also capturing important information on web pages and supplementing that information with domain specific knowledge and tools.
see dev-env/README.md