Apache Bahir provides extensions to distributed analytics platforms such as Apache Spark & Apache Flink.
The Initial Bahir source code (see issue BAHIR-1) containing the source for the Apache Spark streaming connectors for akka, mqtt, twitter, zeromq extracted from Apache Spark revision 8301fad (before the deletion of the streaming connectors akka, mqtt, twitter, zeromq).
Source code folder structure:
- streaming-akka
- examples/src/main/...
- src/main/...
- streaming-mqtt
- examples
- src
- python
- ...
Bahir is built using Apache Maven. To build Bahir and its example programs, run:
mvn -DskipTests clean install
Testing first requires building Bahir. Once Bahir is built, tests can be run using:
mvn test
Each extension currently available in Apache Bahir has an example application located under the "examples" folder.
Currently, each submodule has its own README.md, with information on example usages and API.
Furthermore, to generate scaladocs for each module:
$ mvn package
Scaladocs is generated in, MODULE_NAME/target/site/scaladocs/index.html
. __ Where MODULE_NAME
is one of, sql-streaming-mqtt
, streaming-akka
, streaming-mqtt
, streaming-zeromq
, streaming-twitter
. __
Currently, each module in Bahir is available through spark packages. Please follow linking sub section in module specific README.md for more details.