forked from cloudera/emailarchive
-
Notifications
You must be signed in to change notification settings - Fork 0
Hadoop for archiving email
bucchi/emailarchive
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
To run the sample, take the following steps: 1. Put sample emails from data folder into HDFS 2. Run hadoop job: hadoop jar convertsearch.jar ConvertEmailsToSequence <sample email dir> <output dir> hadoop jar convertsearch.jar SearchEmail <sequence file dir> 3. The sample data contains small set of .msg files (all copies) and the results in your /tmp dir should be identical to this
About
Hadoop for archiving email
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published