Skip to content

Latest commit

 

History

History
26 lines (18 loc) · 2 KB

README.md

File metadata and controls

26 lines (18 loc) · 2 KB

Ingest

This portion of jesterj defines a software package that can be run to effect the transport of documents into a search engine. This software is entirely back end with no UI other than log file output. (A UI will be created under /control in subseqent releases). The primary things that set the current release of Jesterj appart from tradditional ETL and many other solutions for getting documents into search engines are:

  1. It is built for search. Transformations are performed on ket to list of value datastructures, not rows (Documents), gone are the days of repeatedly splitting and joining delimited lists when manipulating data for multivalue fields!
  2. Zero infrastructure config for simple to moderate projects. No database to setup or connect, no hadoop or spark cluster to build out, no need to push configs or state into zookeeper. All you need is the data source, the search engine, a Plan for processing the data and this jar file.
  3. Ability to handle branched and joined paths (Full DAG processing)
  4. Built in fault tolerance out of the box, even for complex processing workflow

Running

  1. Checkout and build the head revision from the repository to produce a "node" jar (much better than last release now) The command to build a jar is ./gradlew packageUnoJar
  2. java -jar jesterj-node-1.0-beta2.jar

This will print usage info. This jar contains all dependencies, and thus can be copied to any machine and run without any additional setup. It will create <user_home_dir>/.jj and place logs and files needed for it's embedded cassandra database there. These directories can be relocated via symlinks if desired after they have been created.

However without an implementation of a PlanProvider, we can't do much useful, so next you'll want to check out our Documentation

System Requirements

  • Posix Operating system (Linux, BSD, Mac OS X, Windows is not supported)
  • Java 11 installed (Tested: )
  • Minimum 4 core (8 thread) cpu recommended