feat: Add consumer offsets to DESCRIBE EXTENDED #5476

jeqo · 2020-05-25T22:52:33Z

Description

Fix #3604

Testing done

Unit tests covering new options, with consumer offsets

Reviewer checklist

Ensure docs are updated if necessary. (eg. if a user visible feature is being added or changed).
Ensure relevant issues are linked (description should include text like "Fixes #")

Missing functionality

Get groupId from Source
Set timeout for Kafka Admin operations

ghost · 2020-05-25T22:52:35Z

It looks like @jeqo hasn't signed our Contributor License Agreement, yet.

The purpose of a CLA is to ensure that the guardian of a project's outputs has the necessary ownership or grants of rights over all contributions to allow them to distribute under the chosen licence.
Wikipedia

You can read and sign our full Contributor License Agreement here.

Once you've signed reply with [clabot:check] to prove it.

Appreciation of efforts,

clabot

jeqo · 2020-05-26T19:08:10Z

Current output:

ksql> describe extended orders;

Name                 : ORDERS
Type                 : STREAM
Timestamp field      : Not set - using <ROWTIME>
Key format           : KAFKA
Value format         : JSON
Kafka topic          : test_topic (partitions: 12, replication: 1)
Statement            : CREATE STREAM orders (
    ROWKEY INT KEY,
    ORDERUNITS double
)
WITH (
    kafka_topic='test_topic',
    partitions=12,
    replicas=1,
    value_format='JSON'
);

 Field      | Type                   

 ROWKEY     | INTEGER          (key) 
 ORDERUNITS | DOUBLE                 


Queries that read from this STREAM
-----------------------------------
CSAS_S1_0 (RUNNING) : CREATE STREAM S1 WITH (KAFKA_TOPIC='S1', PARTITIONS=12, REPLICAS=1) AS SELECT   ORDERS.ROWKEY ROWKEY,   ORDERS.ORDERUNITS ORDERUNITS,   (CASE WHEN (ORDERS.ORDERUNITS < 2.0) THEN 'small' WHEN (ORDERS.ORDERUNITS < 4.0) THEN 'medium' ELSE 'large' END) CASE_RESULT FROM ORDERS ORDERS EMIT CHANGES;

For query topology and execution plan please run: EXPLAIN <QueryId>

Local runtime statistics
------------------------


(Statistics of the local KSQL server interaction with the Kafka topic test_topic)

Consumer Group       : _confluent-ksql-default_query_CSAS_S1_0
Kafka topic          : test_topic

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 8          | 8      | 0   
 1         | 0            | 0          | 0      | 0   
 2         | 0            | 0          | 0      | 0   
 3         | 0            | 0          | 0      | 0   
 4         | 0            | 2          | 2      | 0   
 5         | 0            | 0          | 0      | 0   
 6         | 0            | 0          | 0      | 0   
 7         | 0            | 0          | 0      | 0   
 8         | 0            | 0          | 0      | 0   
 9         | 0            | 0          | 0      | 0   
 10        | 0            | 0          | 0      | 0   
 11        | 0            | 0          | 0      | 0

ksqldb-rest-app/src/main/java/io/confluent/ksql/rest/server/execution/ListSourceExecutor.java

agavra

Thanks @jeqo! This is a really good first stab at solving this, and I think it'll be a really useful tool to have when looking at ksqlDB applications operationally. I didn't have time to give this a full review, but I figured I give you some first round comments in the meantime.

ksqldb-cli/src/main/java/io/confluent/ksql/cli/console/Console.java

ksqldb-engine/src/main/java/io/confluent/ksql/services/SandboxedServiceContext.java

ksqldb-rest-app/src/main/java/io/confluent/ksql/rest/entity/SourceDescriptionFactory.java

ksqldb-rest-app/src/main/java/io/confluent/ksql/rest/server/execution/ListSourceExecutor.java

jeqo · 2020-06-04T21:19:17Z

[clabot:check]

ghost · 2020-06-04T21:19:18Z

@confluentinc It looks like @jeqo just signed our Contributor License Agreement. 👍

Always at your service,

clabot

agavra

mostly LGTM, sorry for the delayed review cycles! a few comments still left

ksqldb-engine/src/main/java/io/confluent/ksql/services/KafkaConsumerGroupClientImpl.java

ksqldb-engine/src/main/java/io/confluent/ksql/services/KafkaTopicClientImpl.java

ksqldb-rest-app/src/main/java/io/confluent/ksql/rest/server/execution/ListSourceExecutor.java

ksqldb-rest-model/src/test/java/io/confluent/ksql/rest/entity/SourceDescriptionTest.java

big-andy-coates · 2020-06-25T10:48:10Z

And then a small enhancement request, if you don't mind, to make this much more useful when there are LARGE numbers of partitions...

Can we add a 'max lag' summary at the top please? That way if you have 100 or 1000 partitions, you don't need to scan down to find the worst lag: you can just look at this value. Of course, you'd still need to scan down to find which partition was lagging, but at least you can quickly see how badly, or not, the group is lagging.

For example:

Consumer Group       : _confluent-ksql-default_query_CSAS_S1_0
Kafka topic          : test_topic
Max lag:             : 8

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 8          | 8      | 0   
 1         | 9            |24          | 16     | 8  
 2         | 0            | 0          | 0      | 0   
 3         | 0            | 0          | 0      | 0   
 4         | 0            | 2          | 2      | 0   
 5         | 0            | 0          | 0      | 0   
 6         | 0            | 0          | 0      | 0   
 7         | 0            | 10         | 8      | 2   
 8         | 0            | 0          | 0      | 0   
 9         | 0            | 0          | 0      | 0   
 10        | 0            | 0          | 0      | 0   
 11        | 0            | 0          | 0      | 0

big-andy-coates

Hi @jeqo

Awesome that you've created this PR - very much appreciated. I've added some comments, nits and suggestions below.

As it stands, I'm not sure the design is quite right. I'm not sure if the relationship between topic and consumer groups are right. If I'm reading the code correctly, (apologies, I've not had time to pull it locally and debug to check), then it's comparing the partition offsets of a source's sink topic with the consumer group offsets of queries that write into the source. I believe this is wrong.

Let's look at an example to work it through:

CREATE STREAM S1 (... some columns... )
   WITH (kafka_topic='s1', ...);

CREATE STREAM S2 (... some columns... )
   WITH (kafka_topic='s2', ...);

CREATE STREAM OUTPUT WITH (
     kafka_topic='op'
   ) AS 
     SELECT *
     FROM S1
       JOIN S2 WITHIN 10 SECONDS ON S1.id = S2.id;

Then we run:

DESCRIBE EXTENDED OUTPUT;

This means:

our DataSource is OUTPUT.
it's kafkaTopicName is op.
and it has a single RunningQuery.

Importantly, the consumer group of the RunningQuery will be consuming from topics s1 and s2, and NOT op.

So we can not compare the offsets of the consumer group with the offsets of op. We must compare the offsets of the consumer group with the offsets of the topic-partitions the consumer group is consuming!

If we add in a second query:

CREATE STREAM S3 (... some columns... )
   WITH (kafka_topic='s3', ...);

INSET INTO OUTPUT
   SELECT * FROM S3;

Now there will be two RunningQuerys for OUTPUT. The second consumer group will be consuming from s3 only.

So we need to change the logic to compare offsets of the right topics. I would suggest the following logic:

Iterate over all RunningQuery
a. calculate queryApplicationId
b. describe the consumer group and get the map of TopicPartition -> offset.
Once you've iterated over all running queries you can:
a. make a single request to get the earliest offset of ALL topic partitions we're interested in, i.e. all partitions of s1, s2 and s3 from the example above.
b. make a second single request to get the latest offsets of all the tps.

This way, we minimise calls to the brokers.

ksqldb-common/src/main/java/io/confluent/ksql/util/KsqlConfig.java

ksqldb-engine/src/main/java/io/confluent/ksql/services/KafkaConsumerGroupClientImpl.java

ksqldb-engine/src/main/java/io/confluent/ksql/services/KafkaTopicClientImpl.java

ksqldb-engine/src/test/java/io/confluent/ksql/services/FakeKafkaConsumerGroupClient.java

ksqldb-rest-app/src/main/java/io/confluent/ksql/rest/server/execution/ListSourceExecutor.java

ksqldb-rest-model/src/main/java/io/confluent/ksql/rest/entity/SourceConsumerGroupOffset.java

ksqldb-rest-model/src/main/java/io/confluent/ksql/rest/entity/SourceDescription.java

ksqldb-rest-model/src/main/java/io/confluent/ksql/rest/entity/SourceConsumerGroupOffsets.java

jeqo · 2020-06-25T18:17:30Z

@big-andy-coates It was a bit confusing at the beginning to see how the concepts fit together internally, but your example makes it so much clearer, thanks for this!

I'd expect to go through this around next week.

jeqo · 2020-07-07T00:09:18Z

Output starts to look better:

Consumer Group       : _confluent-ksql-default_query_CSAS_OUTPUT_0
Kafka topic          : S1
Max lag              : 0

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 0          | 0      | 0   
 1         | 0            | 0          | 0      | 0   
 2         | 0            | 0          | 0      | 0   
 3         | 0            | 1          | 1      | 0   
 4         | 0            | 0          | 0      | 0   
 5         | 0            | 0          | 0      | 0   
 6         | 0            | 0          | 0      | 0   
 7         | 0            | 0          | 0      | 0   
 8         | 0            | 0          | 0      | 0   
 9         | 0            | 0          | 0      | 0   
 10        | 0            | 0          | 0      | 0   
 11        | 0            | 0          | 0      | 0   


Consumer Group       : _confluent-ksql-default_query_CSAS_OUTPUT_0
Kafka topic          : S2
Max lag              : 0

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 0          | 0      | 0   
 1         | 0            | 0          | 0      | 0   
 2         | 0            | 0          | 0      | 0   
 3         | 0            | 1          | 1      | 0   
 4         | 0            | 0          | 0      | 0   
 5         | 0            | 0          | 0      | 0   
 6         | 0            | 0          | 0      | 0   
 7         | 0            | 0          | 0      | 0   
 8         | 0            | 0          | 0      | 0   
 9         | 0            | 0          | 0      | 0   
 10        | 0            | 0          | 0      | 0   
 11        | 0            | 0          | 0      | 0   


Consumer Group       : _confluent-ksql-default_query_INSERTQUERY_7
Kafka topic          : S3
Max lag              : 0

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 1          | 1      | 0

Will go through the specifics later.

ksqldb-common/src/main/java/io/confluent/ksql/util/QueryApplicationId.java

ksqldb-engine/src/main/java/io/confluent/ksql/services/KafkaConsumerGroupClientImpl.java

jeqo · 2020-08-07T11:41:24Z

@big-andy-coates I managed to cover the new requirements, take a look to the output:

ksql> describe extended output;

/////......

Queries that write from this STREAM
-----------------------------------
CSAS_OUTPUT_0 (RUNNING) : CREATE STREAM OUTPUT WITH (KAFKA_TOPIC='OP', PARTITIONS=12, REPLICAS=1) AS SELECT   S1.NAME NAME,   S1.AGE AGE,   S2.COUNTRY COUNTRY FROM S1 S1 INNER JOIN S2 S2 WITHIN 10 SECONDS ON ((S1.NAME = S2.NAME)) EMIT CHANGES;
INSERTQUERY_7 (RUNNING) : INSERT INTO OUTPUT SELECT * FROM S3;

For query topology and execution plan please run: EXPLAIN <QueryId>

Local runtime statistics
------------------------


(Statistics of the local KSQL server interaction with the Kafka topic OP)

Consumer Groups summary:

Consumer Group       : _confluent-ksql-default_query_CSAS_OUTPUT_0

Kafka topic          : S1
Max lag              : 0

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 0          | 0      | 0   
 1         | 0            | 0          | 0      | 0   
 2         | 0            | 0          | 0      | 0   
 3         | 0            | 3          | 3      | 0   
 4         | 0            | 0          | 0      | 0   
 5         | 0            | 0          | 0      | 0   
 6         | 0            | 0          | 0      | 0   
 7         | 0            | 0          | 0      | 0   
 8         | 0            | 0          | 0      | 0   
 9         | 0            | 0          | 0      | 0   
 10        | 0            | 0          | 0      | 0   
 11        | 0            | 0          | 0      | 0   


Kafka topic          : S2
Max lag              : 0

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 0          | 0      | 0   
 1         | 0            | 0          | 0      | 0   
 2         | 0            | 0          | 0      | 0   
 3         | 0            | 3          | 3      | 0   
 4         | 0            | 0          | 0      | 0   
 5         | 0            | 0          | 0      | 0   
 6         | 0            | 0          | 0      | 0   
 7         | 0            | 0          | 0      | 0   
 8         | 0            | 0          | 0      | 0   
 9         | 0            | 0          | 0      | 0   
 10        | 0            | 0          | 0      | 0   
 11        | 0            | 0          | 0      | 0   


Consumer Group       : _confluent-ksql-default_query_INSERTQUERY_7

Kafka topic          : S3
Max lag              : 0

 Partition | Start Offset | End Offset | Offset | Lag 

 0         | 0            | 3          | 3      | 0

and when topics are empty for a consumer group, the message you recommend is printed.

big-andy-coates · 2020-08-07T14:17:48Z

Great work @jeqo - thanks for the contribution!

Extend the `DESCRIBE EXTENDED` output to include information about the consumer groups used to populate the source, if any. For each query populating the source, the consumer group and topic information is shown. This information allows you to see the topic's start and end offset, and the consumer group's lasts committed offset and associated lag, for each partition the consumer group is consuming from. Example new output: ``` Consumer Groups summary: Consumer Group : _confluent-ksql-default_query_CSAS_OUTPUT_0 Kafka topic : S1 Max lag : 0 Partition | Start Offset | End Offset | Offset | Lag 0 | 0 | 0 | 0 | 0 1 | 0 | 0 | 0 | 0 2 | 0 | 0 | 0 | 0 3 | 0 | 3 | 3 | 0 4 | 0 | 0 | 0 | 0 5 | 0 | 0 | 0 | 0 6 | 0 | 0 | 0 | 0 7 | 0 | 0 | 0 | 0 8 | 0 | 0 | 0 | 0 9 | 0 | 0 | 0 | 0 10 | 0 | 0 | 0 | 0 11 | 0 | 0 | 0 | 0 Kafka topic : S2 Max lag : 0 Partition | Start Offset | End Offset | Offset | Lag 0 | 0 | 0 | 0 | 0 1 | 0 | 0 | 0 | 0 2 | 0 | 0 | 0 | 0 3 | 0 | 3 | 3 | 0 4 | 0 | 0 | 0 | 0 5 | 0 | 0 | 0 | 0 6 | 0 | 0 | 0 | 0 7 | 0 | 0 | 0 | 0 8 | 0 | 0 | 0 | 0 9 | 0 | 0 | 0 | 0 10 | 0 | 0 | 0 | 0 11 | 0 | 0 | 0 | 0 Consumer Group : _confluent-ksql-default_query_INSERTQUERY_7 Kafka topic : S3 Max lag : 0 Partition | Start Offset | End Offset | Offset | Lag 0 | 0 | 3 | 3 | 0 ``` Co-authored-by: Andy Coates <[email protected]>

jeqo changed the title ~~Add consumer offsets to DESCRIBE EXTENDED~~ feat: Add consumer offsets to DESCRIBE EXTENDED May 25, 2020

agavra self-assigned this May 26, 2020

jeqo force-pushed the describe-extended-with-offsets branch from 31548ac to 257c4d0 Compare May 26, 2020 21:01

jeqo commented May 26, 2020

View reviewed changes

jeqo marked this pull request as ready for review May 26, 2020 21:31

jeqo requested a review from a team as a code owner May 26, 2020 21:31

jeqo mentioned this pull request May 26, 2020

DESCRIBE EXTENDED should show consumer offset location #3604

Closed

agavra reviewed May 29, 2020

View reviewed changes

agavra requested a review from a team May 29, 2020 20:45

jeqo force-pushed the describe-extended-with-offsets branch 2 times, most recently from d4845b4 to 14eb7a7 Compare June 3, 2020 22:03

jeqo requested a review from agavra June 4, 2020 19:22

agavra reviewed Jun 12, 2020

View reviewed changes

jeqo force-pushed the describe-extended-with-offsets branch from 46263e2 to f628d07 Compare June 16, 2020 22:08

jeqo requested a review from agavra June 17, 2020 09:14

agavra mentioned this pull request Jun 19, 2020

feat: include row count in SHOW TOPICS EXTENDED #5642

Closed

2 tasks

big-andy-coates suggested changes Jun 25, 2020

View reviewed changes

jeqo force-pushed the describe-extended-with-offsets branch from d7de0de to 8c33496 Compare July 6, 2020 20:59

jeqo force-pushed the describe-extended-with-offsets branch from d795bec to c681551 Compare July 11, 2020 16:17

jeqo commented Jul 11, 2020

View reviewed changes

ksqldb-common/src/main/java/io/confluent/ksql/util/QueryApplicationId.java Outdated Show resolved Hide resolved

ksqldb-engine/src/main/java/io/confluent/ksql/services/KafkaConsumerGroupClientImpl.java Outdated Show resolved Hide resolved

jeqo requested a review from big-andy-coates July 11, 2020 19:47

jeqo force-pushed the describe-extended-with-offsets branch from b5cda02 to 4b3964d Compare July 15, 2020 09:27

jeqo force-pushed the describe-extended-with-offsets branch from ac636f8 to 086c4b0 Compare July 22, 2020 15:53

jeqo added 16 commits August 7, 2020 10:30

move time suffix

d8063f1

test query app id builder

f1dad37

fix: remove commented out code

e6e4ab6

validate objects non null

bd0d517

add tests to list offsets

fba0a75

add test when offsets differ

e34555f

converge tests

46c5473

fix checkstyle

04f517a

fix issues with long instantiation

480e7c4

fix checkstyle

c3f5eb4

map consumer groups to topics

dc97ede

fix console tests

03d0cfc

add topic offsets summary

32b3df5

fix unused import

8e4612f

fix console test

1976131

fix json format

6f213a8

jeqo force-pushed the describe-extended-with-offsets branch from 40fbd3f to 6f213a8 Compare August 7, 2020 10:50

fix groupId mapping

40a484a

jeqo added 2 commits August 7, 2020 12:47

fix import

082b5d2

fix json order

ca007da

jeqo requested a review from big-andy-coates August 7, 2020 13:58

big-andy-coates added this to the 0.12.0 milestone Aug 7, 2020

big-andy-coates self-assigned this Aug 7, 2020

big-andy-coates added the enhancement label Aug 7, 2020

big-andy-coates merged commit 9ce3c97 into confluentinc:master Aug 7, 2020

jeqo deleted the describe-extended-with-offsets branch August 7, 2020 15:06

agavra mentioned this pull request Oct 13, 2020

Log consumer lag metrics at regular intervals #6416

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add consumer offsets to DESCRIBE EXTENDED #5476

feat: Add consumer offsets to DESCRIBE EXTENDED #5476

jeqo commented May 25, 2020 •

edited

Loading

ghost commented May 25, 2020

jeqo commented May 26, 2020

agavra left a comment

jeqo commented Jun 4, 2020

ghost commented Jun 4, 2020

agavra left a comment

big-andy-coates commented Jun 25, 2020 •

edited

Loading

big-andy-coates left a comment

jeqo commented Jun 25, 2020

jeqo commented Jul 7, 2020

jeqo commented Aug 7, 2020

big-andy-coates commented Aug 7, 2020

feat: Add consumer offsets to DESCRIBE EXTENDED #5476

feat: Add consumer offsets to DESCRIBE EXTENDED #5476

Conversation

jeqo commented May 25, 2020 • edited Loading

Description

Testing done

Reviewer checklist

Missing functionality

ghost commented May 25, 2020

jeqo commented May 26, 2020

agavra left a comment

Choose a reason for hiding this comment

jeqo commented Jun 4, 2020

ghost commented Jun 4, 2020

agavra left a comment

Choose a reason for hiding this comment

big-andy-coates commented Jun 25, 2020 • edited Loading

big-andy-coates left a comment

Choose a reason for hiding this comment

jeqo commented Jun 25, 2020

jeqo commented Jul 7, 2020

jeqo commented Aug 7, 2020

big-andy-coates commented Aug 7, 2020

jeqo commented May 25, 2020 •

edited

Loading

big-andy-coates commented Jun 25, 2020 •

edited

Loading