Add Manual host banning to PgCat #340

drdrsh · 2023-03-03T18:06:34Z

Sometimes we want an admin to be able to ban a host for some time to route traffic away from that host for reasons like partial outages, replication lag, and scheduled maintenance.

We can achieve this today using a configuration update but a quicker approach is to send a control command to PgCat that bans the replica for some specified duration.

This command does not change the current banning rules like

Primaries cannot be banned
When all replicas are banned, all replicas are unbanned

Commands added

BAN <host> <duration_seconds>;
BAN localhost 10;
    db     |     user      |  role   |   host
------------+---------------+---------+-----------
 sharded_db | other_user    | replica | localhost
 sharded_db | other_user    | replica | localhost
 sharded_db | other_user    | replica | localhost
 simple_db  | simple_user   | replica | localhost
 sharded_db | sharding_user | replica | localhost
 sharded_db | sharding_user | replica | localhost
 sharded_db | sharding_user | replica | localhost


SHOW BANS;
     db     |     user      |  role   |   host    |    reason    |          ban_time          | ban_duration_seconds | ban_remaining_seconds
------------+---------------+---------+-----------+--------------+----------------------------+----------------------+-----------------------
 sharded_db | sharding_user | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.258965 | 10                   | 8
 sharded_db | sharding_user | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.259333 | 10                   | 8
 sharded_db | sharding_user | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.259635 | 10                   | 8
 sharded_db | other_user    | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.259851 | 10                   | 8
 sharded_db | other_user    | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.262950 | 10                   | 8
 sharded_db | other_user    | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.267296 | 10                   | 8
 simple_db  | simple_user   | replica | localhost | AdminBan(10) | 2023-03-03 21:58:20.267537 | 10                   | 8


UNBAN <host>;
UNBAN localhost;
     db     |     user      |  role   |   host
------------+---------------+---------+-----------
 sharded_db | sharding_user | replica | localhost
 sharded_db | sharding_user | replica | localhost
 sharded_db | sharding_user | replica | localhost
 sharded_db | other_user    | replica | localhost
 sharded_db | other_user    | replica | localhost
 sharded_db | other_user    | replica | localhost
 simple_db  | simple_user   | replica | localhost

dat2 · 2023-03-03T18:12:28Z

src/errors.rs

+    MessageReceiveFailed,
+    FailedCheckout,
+    StatementTimeout,
+    ManualBan,


Suggested change

ManualBan,

AdminBan,

dat2 · 2023-03-03T18:13:16Z

src/admin.rs

+{
+    let host = match tokens.get(1) {
+        Some(host) => host,
+        None => return error_response(stream, "BAN command requires a hostname to ban").await,


non blocking, can be done in a follow up: do we want to accept a duration string for how long to ban it for?

levkk · 2023-03-05T15:58:55Z

src/admin.rs

+    res.put(row_description(&columns));
+
+    for (id, pool) in get_all_pools().iter() {
+        for address in pool.get_addresses_from_host(host) {


Do you think we should use the address name here instead to make this more like the other admin commands?

The common use case for admin banning (from my experience) is when a database is in a degraded state or is about to under go some maintenance. Using hostname for admin banning in these situation makes more sense as opposed to having to do an extra lookup to figure out the address name that corresponds to the host

As long as we show that host name somewhere in our stats, so the user can find it without guessing, e.g. sometimes people use IP addresses and sometimes they use DNS, and sometimes both refer to the same place.

levkk · 2023-03-05T16:01:38Z

src/admin.rs

+                _ => pool.settings.ban_time,
+            };
+            let remaining = ban_duration - (now - ban_time.timestamp());
+            if remaining <= 0 || address.role == Role::Primary {


Primary should never be added to this data structure.If it was, we may want to know.

levkk · 2023-03-05T16:02:17Z

src/admin.rs

+
+    for (id, pool) in get_all_pools().iter() {
+        for address in pool.get_addresses_from_host(host) {
+            if !pool.is_banned(&address) && address.role != Role::Primary {


The primary check should be handled by the pool ideally as it is now I believe?

levkk

Nice!

Sometimes we want an admin to be able to ban a host for some time to route traffic away from that host for reasons like partial outages, replication lag, and scheduled maintenance. We can achieve this today using a configuration update but a quicker approach is to send a control command to PgCat that bans the replica for some specified duration. This command does not change the current banning rules like Primaries cannot be banned When all replicas are banned, all replicas are unbanned

drdrsh added 2 commits March 3, 2023 12:04

Add Manual host banning to PgCat

c19f7b1

merge conflict

08c0d50

drdrsh changed the title ~~Mostafa manual ban support~~ Add Manual host banning to PgCat Mar 3, 2023

comment

cce59fb

dat2 reviewed Mar 3, 2023

View reviewed changes

drdrsh added 4 commits March 3, 2023 12:56

Make duration configurable

623f2d9

multishard for tests

8b90233

fix tests

caf90cd

build

1f5c829

drdrsh marked this pull request as ready for review March 3, 2023 21:53

drdrsh requested a review from levkk March 4, 2023 02:04

levkk reviewed Mar 5, 2023

View reviewed changes

address comments

2d31e32

levkk approved these changes Mar 6, 2023

View reviewed changes

drdrsh merged commit 2cc6a09 into postgresml:main Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Manual host banning to PgCat #340

Add Manual host banning to PgCat #340

drdrsh commented Mar 3, 2023 •

edited

Loading

dat2 Mar 3, 2023

dat2 Mar 3, 2023

levkk Mar 5, 2023 •

edited

Loading

drdrsh Mar 5, 2023 •

edited

Loading

levkk Mar 6, 2023

levkk Mar 5, 2023 •

edited

Loading

levkk Mar 5, 2023

levkk left a comment

Add Manual host banning to PgCat #340

Add Manual host banning to PgCat #340

Conversation

drdrsh commented Mar 3, 2023 • edited Loading

dat2 Mar 3, 2023

Choose a reason for hiding this comment

dat2 Mar 3, 2023

Choose a reason for hiding this comment

levkk Mar 5, 2023 • edited Loading

Choose a reason for hiding this comment

drdrsh Mar 5, 2023 • edited Loading

Choose a reason for hiding this comment

levkk Mar 6, 2023

Choose a reason for hiding this comment

levkk Mar 5, 2023 • edited Loading

Choose a reason for hiding this comment

levkk Mar 5, 2023

Choose a reason for hiding this comment

levkk left a comment

Choose a reason for hiding this comment

drdrsh commented Mar 3, 2023 •

edited

Loading

levkk Mar 5, 2023 •

edited

Loading

drdrsh Mar 5, 2023 •

edited

Loading

levkk Mar 5, 2023 •

edited

Loading