stock price

Stock price is a small project generating random prices of Bovespa stocks and its goal is to guide my studies using Kafka ecosystem.

Simply put, Apache Kafka is a kind of merge between message queue and publish-subscribe message patterns, and both part of a larger message-oriented middleware system.

Basic concepts

Kafka is a distributed stream-processing platform designed to provide high-throughput, low-latency platform for handling real-time data feeds. It is heavily influenced by transaction logs (history of actions executed by RDBMS to guarantee ACID transactions).

What does that mean? It is like a log file, an append-only system ordered by time. Every incoming record is appended to the end of the "file" (log) and consumers read from left to right.

Topic

Records are published into topics and each topic can have one or more partitions. And each partition is an ordered and immutable sequence of events that is appended to. Every record in the partition is assigned a sequential id number called offset which uniquely identifies that record inside a partition.

Retention

Kafka records are durable using a configurable retention period. For example, if the retention policy is set to two days, then for the two days after a record is published, it is available for consumption, after which it will be discarded to free up space.

Performance is effectively constant with respect to data size so storing data for a long time is not a problem. Kafka will perform the same whether you have 50 KB or 50 TB of persistent data on the server.

Distributed

Partitions of a topic are distributed over the servers in a Kafka cluster and they are replicated to handle fault tolerance (configurable number of replicas).

Producers

Producers publish data to topics. The producer is responsible for choosing the partition in a topic where it wants to publish. This can be done in a round-robin fashion to balance load or according to some specific semantic, like data-affinity where some keys go to a specific partition.

Consumers

Consumers live in a consumer group and each record published on a topic is delivered to only one consumer within a consumer group.

Useful commands

After downloading Kafka and setting it up, use the following commands:

Start ZooKeeper: bin/zookeeper-server-start.sh config/zookeeper.properties
Start Kafka: bin/kafka-server-start.sh config/server.properties
Run MainAppStockPriceProducer.java
Run MainAppStockPriceConsumer.java

Improvements

Replace Jackson by Avro aiming to improve serialization speed
Too many things to list here :P

Versions

Java 10
Scala 2.12
Apache Kafka version 1.1.0

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
images		images
src/main/java/br/com/lellis/stockprice		src/main/java/br/com/lellis/stockprice
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

stock price

Basic concepts

Topic

Retention

Distributed

Producers

Consumers

Useful commands

Improvements

Versions

About

Releases

Packages

Languages

brunolellis/stock-price-kafka

Folders and files

Latest commit

History

Repository files navigation

stock price

Basic concepts

Topic

Retention

Distributed

Producers

Consumers

Useful commands

Improvements

Versions

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages