Turok is a timeseries database abstraction over DynamoDB.
Turok consists of:
- A library -
turok-lib
- for transforming metric data into Turok compatible data structures. These can then be pushed onto SQS. - An aggregator application -
turokd
- that consumes from SQS, applies aggregation rules as defined in the message and writes to DynamoDB. For any new metric,turokd
will also write to an SQS queue forturok-indexer
. - An indexer application -
turok-indexer
- that consumes from SQS and updates the index stored on S3. - A Graphite finder -
graphite-turok-reader
- that is able to read data from DynamoDB using the index on S3 and present it to Graphite. - A Data Pipeline template -
turok-keeper
- that applies a configured retention policy per data resolution.
Diagram: https://docs.google.com/drawings/d/1XZTNbfzcCVsesSF8TEH1g8TJVh_YX0wfQIbVlt7qFmE/
TODO
Message:
{
metric : String; `node1.messages_received`
start_time : String; `01-04-2014 14:35:00`
datapoints : JSON Encoded Datapoint List; [1, 2, 30000, 3, 40]
resolution : String; `20sec`, `1min`
aggregation_type : String; `sum`, `average`, `minimum`, `maximum`
}
TODO
Diagram: https://docs.google.com/drawings/d/1toZsDrXXuMDI15496urWTEOGPsw-4JweRf661tN6uCU/
Table name: <TABLE_PREFIX><TABLE_JOINER><DATE><TABLE_JOINER><RESOLUTION> ; `turok_01-04-2014_20sec`
{
metric : String; `node1.messages_received`
start_time : String; `01-04-2014 14:35:00`
datapoints : JSON Encoded Datapoint List; `[1, 2, 30000, 3, 40]`
}
Tables are partitioned by date:
- to make applying retention policies simpler,
- to make read and write DynamoDB provisioning optimisations possible.
DynamoDB only supports sets and not lists, so we need to serialise it for persistence.
TODO
TODO
Reference: http://graphite.readthedocs.org/en/latest/storage-backends.html
TODO
Reference: http://docs.aws.amazon.com/datapipeline/latest/DeveloperGuide/what-is-datapipeline.html
Install boto
and moto
:
pip install boto moto
Download the latest release of DynamoDB Local:
http://dynamodb-local.s3-website-us-west-2.amazonaws.com/dynamodb_local_latest
Un-tar, rename and move to the bin
directory:
tar -xf dynamodb_local_2014-04-24.tar.gz
mv dynamodb_local_2014-04-24 turok/bin/dynamodb_local
Reference: http://docs.aws.amazon.com/amazondynamodb/latest/developerguide/Tools.DynamoDBLocal.html
Start DynamoDB Local:
java -Djava.library.path=./bin/dynamodb_local/DynamoDBLocal_lib -jar bin/dynamodb_local/DynamoDBLocal.jar --inMemory
Start SQS moto_server:
moto_server -p 8001 sqs
Run tests:
./bin/run_tests.sh