[DRAFT] feat: begin modernization of symmetric crypto [#268][#289][#357] #411

rmlibre · 2024-07-16T09:39:59Z

Description

Generally, this PR is aimed at bringing best practices to the handling of secrets. That is done through making them more inaccessible, or by clearing them from memory as soon as possible. This PR also brings domain separation, input canonicalization, context commitment, & authenticated associated data to distinct encryption / decryption contexts.

More details & updates to come. Comments & reviews are open.

Domain Separation:

When using cryptography to authenticate data, or derive secrets, etc, it's important to tie those processes to their intended usage contexts by declaring the context with an unambiguous label. This allows both the production & consumption of cryptographic artifacts to ensure that they're in agreement over the label (domain) the artifact is supposed to be used within.

References:

What is meant by domain separation ...
What non-trivial benefit does including ...
Separate Your Domains: NIST PQC ...

Input Canonicalization:

Cryptography works best when its users are 100% sure & clear about what information they're processing. That can be subtlety, and surprisingly, challenging to do.

Example:

Say we're not using canonical input encoding, and we're hashing (H) a username: "green" & an email address: "housekeeper[at]email[dot]com". Here, if the delineation is unclear, then the information is not clear. Another user with username: "greenhouse" and email address: "keeper[at]email[dot]com" will produce the same hash.

H("green" + "[email protected]") == H("greenhouse" + "[email protected]")

References:

What is Canonicalization Attack?
Canonicalization Attacks Against ...

AEAD & Context Commitment:

Confusingly, the two most widely deployed authenticated ciphers (AES-GCM & ChaCha20-Poly1305) do not create ciphertexts which commit to a key. Which means that knowing the key allows one to create a single ciphertext which successfully decrypts to various different plaintext messages. In most cases this isn't considered an issue because it requires knowledge of the key. But, it does violate intuitive notions of what it means for something to be authentic when forgeries are easy to produce. It has also been shown to lead to practical problems:

Partitioning oracles can arise when encryption schemes are not committing with respect to their keys. We detail adaptive chosen ciphertext attacks that exploit partitioning oracles to efficiently recover passwords and de-anonymize anonymous communications.

~ Source: USENIX Security 2021

Because Fernet uses HMAC-SHA-256, instead of the universal hashes of the previously discussed ciphers, it doesn't suffer from this problem. But, it also doesn't have a direct interface for authenticating domain labels or other associated data. Borrowing partially from Efficient Schemes for Committing ..., which proposes deriving a fresh key by running a keyed PRF over all of the inputs, it can elegantly support the authentication of additional values.

References:

Key Committing AEADs
How to Abuse and Fix Authenticated Encryption ...
Succinctly-Committing Authenticated Encryption
Key commitment in GCM (or AEAD in general)

Remediations

... detailed descriptions coming soon

Passing Workflows:

…dsg#289]

Add pytest fixture parameterization to test under both dev & production settings.

…sg#289][scidsg#357]

…sg#357]

…scidsg#357]

…dsg#289][scidsg#357]

…cidsg#289][scidsg#357]

…][scidsg#289][scidsg#357]

…scidsg#357]

…cidsg#289][scidsg#357]

jeremywmoore

Here's a first pass review. Another pass may yield some more suggestions but I think there are some things to work out around managing the salt.

hushline/__init__.py

hushline/crypto/secrets_manager.py

…cidsg#411]

Suggested in review: scidsg#411 (comment) scidsg#411 (comment) scidsg#411 (comment) Co-authored-by: Jeremy Moore <[email protected]>

gitguardian · 2024-07-18T05:17:33Z

⚠️ GitGuardian has uncovered 6 secrets following the scan of your pull request.

Please consider investigating the findings and remediating the incidents. Failure to do so may lead to compromising the associated services or software components.

Since your pull request originates from a forked repository, GitGuardian is not able to associate the secrets uncovered with secret incidents on your GitGuardian dashboard.
Skipping this check run and merging your pull request will create secret incidents on your GitGuardian dashboard.

🔎 Detected hardcoded secrets in your pull request

GitGuardian id	GitGuardian status	Secret	Commit	Filename
-	-	Generic High Entropy Secret	`7f35ab2`	docker-compose.yaml	View secret
-	-	Generic High Entropy Secret	`d6af3e3`	dev_env.sh	View secret
-	-	Generic High Entropy Secret	`0924906`	dev_env.sh	View secret
-	-	Generic High Entropy Secret	`7f35ab2`	dev_env.sh	View secret
-	-	Generic High Entropy Secret	`294338b`	dev_env.sh	View secret
-	-	Generic High Entropy Secret	`294338b`	docker-compose.yaml	View secret

🛠 Guidelines to remediate hardcoded secrets

Understand the implications of revoking this secret by investigating where it is used in your code.
Replace and store your secrets safely. Learn here the best practices.
Revoke and rotate these secrets.
If possible, rewrite git history. Rewriting git history is not a trivial act. You might completely break other contributing developers' workflow and you risk accidentally deleting legitimate data.

To avoid such incidents in the future consider

following these best practices for managing and storing secrets including API keys and other credentials
install secret detection on pre-commit to catch secret before it leaves your machine and ease remediation.

^{🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.}

hushline/model.py

…idsg#411]

jeremywmoore

How does having a single shared salt compare to generating a per-user salt? Does using a kdf which incorporates the domain (and possible aad) obviate this need?

Although since we may still want to generate a per-user salt for passwords to combat precompute attacks.

hushline/__init__.py

hushline/crypto/secrets_manager.py

hushline/model.py

Suggested in review: scidsg#411 (review)

rmlibre

Commit to the password hash algorithm's associated encodings

hushline/model.py

…idsg#411] Suggested in review: scidsg#411 (review)

Cherry-picked from (b2cb88e) on PR branch [scidsg#411]

Cherry-picked from (e173071) on PR branch [scidsg#411]

Add a hushline specific domain label, & add name of destination hash to pre-hash domain label. These are additional good practice measures to mitigate hash shucking by eliminating external datasets from being possible sources of correlation. References: a. https://www.youtube.com/watch?v=OQD3qDYMyYQ b. https://security.stackexchange.com/a/234795 Reference (b) excerpt: "But the crucial point is that [a pre-hash] can be an otherwise uncracked [pre-hash] that happens to be present in some other leak, which you can then attack at massively increased speeds."

…scidsg#411]" This reverts commit e173071.

…idsg#411] The embedded timestamps in Fernet ciphertexts can be utilized in the case of server compromise to correlate analyzed web traffic originating from a targeted user, even when using Tor, with the encrypted database values hosted by servers running hushline. This commit switches from the Fernet cipher to the AEAD cipher ChaCha20Poly1305 which doesn't automatically generate timestamps, removing this source of unintended activity tracking correlation. Using the DKNA method (derived inputs: key, nonce, associated data), the 12 byte nonce is easily upgraded to a 32 byte salt, making repeats infeasible: >=2**-85.33 chance for each additional message after ~2**85.33 encryptions in the same context. This is referred to as the optimal bound for birthday bound security. The DKNA method also provides resistance to commitment attacks by binding the non-message inputs together into pseudo-random internal cipher states. This switch also comes with space efficiency improvements for the database. 32.65% more plaintext can fit into the same fields, due to the new raw bytes ciphertext being on average ~31.12% more efficient than Fernet's base64 encoded ciphertext. Fernet: ------- (Base64 encoded) 1 B | 8 B | 16 B | X B | 32 B Version ‖ Timestamp ‖ IV ‖ Ciphertext ‖ HMAC ChaCha20Poly1305 with DKNA: ------- (Raw bytes) ------- 32 B | X B | 16 B Salt ‖ Ciphertext ‖ Poly-Tag References: https://dataprot.net/articles/no-log-vpn/ https://martinolivier.com/open/timestamps.pdf https://github.com/Attacks-on-Tor/Attacks-on-Tor#correlation-attacks https://github.com/rmlibre/hushline/blob/7799379088e833282a0a8b9e8f9fd21756321522/docs/2-threat-model.md?plain=1#L75-L79 https://soatok.blog/2024/07/01/blowing-out-the-candles-on-the-birthday-bound/ https://crypto.stackexchange.com/q/112497

Quick-fix addressing privacy concerns discussed in PR branch [scidsg#411] commit (c772523). References: scidsg@c772523 Co-authored-by: Jeremy Moore <[email protected]>

Quick-fix addressing privacy concerns discussed in PR branch [#411] commit (c772523). References: c772523 Co-authored-by: Jeremy Moore <[email protected]>

Cherry-picked from (b2cb88e) on PR branch [#411]

rmlibre added 11 commits July 15, 2024 17:15

ci(tests): rely on actions to troubleshoot PR branch [scidsg#268][sci…

e7b7f5f

…dsg#289]

refactor: begin separation of dev & prod flows [scidsg#268][scidsg#289]

7bfd1c9

Add pytest fixture parameterization to test under both dev & production settings.

refactor!: prepare crypto.py to become a subpackage [scidsg#268][scid…

51d42a7

…sg#289][scidsg#357]

refactor: prepare new crypto subpackage [scidsg#268][scidsg#289][scid…

eeeaaaf

…sg#357]

feat(crypto): add basic secrets manager class [scidsg#268][scidsg#289][…

db5b0b4

…scidsg#357]

build(deps): prefer argon2 & proper canonicalization [scidsg#268][sci…

00f4fbe

…dsg#289][scidsg#357]

feat(crypto): implement canon domain KDF & secret wiping [scidsg#268][s…

5fd782f

…cidsg#289][scidsg#357]

refactor: install SecretsManager object without utilization [scidsg#268…

236a3af

…][scidsg#289][scidsg#357]

feat: begin modernization of symmetric crypto [scidsg#268][scidsg#289][…

b55c602

…scidsg#357]

fix(lint): make use of decryption failure exception type [scidsg#268][s…

920311b

…cidsg#289][scidsg#357]

merge(sync): pull 'scidsg/main' updates into PR branch [scidsg#411]

160fa01

rmlibre marked this pull request as ready for review July 16, 2024 10:21

jeremywmoore reviewed Jul 17, 2024

View reviewed changes

rmlibre and others added 8 commits July 17, 2024 19:30

test(crypto): cover more failing cases of incorrect inputs [scidsg#411]

e590f77

fix(crypto): pass all required decrypt_field kwargs to vault decrypt [s…

b2349f7

…cidsg#411]

fix(crypto): avoid unannounced mutation of argument [scidsg#411]

f8e2f3e

refactor(crypto): provide clean interface parity [scidsg#411]

dda7577

style: run Ruff formatter [scidsg#411]

a1d7c6c

refactor: use consistent terms for setup flow variables [scidsg#411]

7f35ab2

refactor: use function names which better declare intent [scidsg#411]

5407394

Suggested in review: scidsg#411 (comment) scidsg#411 (comment) scidsg#411 (comment) Co-authored-by: Jeremy Moore <[email protected]>

merge(sync): pull 'scidsg/main' updates into PR branch [scidsg#411]

a3b3f25

rmlibre commented Jul 18, 2024

View reviewed changes

hushline/model.py Outdated Show resolved Hide resolved

rmlibre added 6 commits July 18, 2024 18:34

feat(crypto): add domain labels to KDF initialization [scidsg#411]

f7405ae

style: reorder arguments for visual or semantic consistency [scidsg#411]

15b44d2

refactor: facilitate db usage for secret initialization [scidsg#411]

63dd395

test(crypto): make fast app fixtures for new row encryption tests [sc…

7f46de5

…idsg#411]

build(ruff): add Ruff formatter command to poetry Makefile [scidsg#411]

4617aa0

merge(sync): pull 'scidsg/main' updates into PR branch [scidsg#411]

f9e2d79

jeremywmoore reviewed Jul 19, 2024

View reviewed changes

feat(crypto): switch from using 'scrypt' to 'argon2id' [scidsg#411]

6f53d35

Suggested in review: scidsg#411 (review)

rmlibre commented Jul 22, 2024

View reviewed changes

hushline/model.py Outdated Show resolved Hide resolved

hushline/model.py Outdated Show resolved Hide resolved

fix: commit to the password hash algorithm's associated encodings [sc…

c23c9be

…idsg#411] Suggested in review: scidsg#411 (review)

rmlibre added a commit to rmlibre/hushline that referenced this pull request Jul 22, 2024

fix: wrong algorithms for password reset in 'settings.py'

5621b14

Cherry-picked from (b2cb88e) on PR branch [scidsg#411]

rmlibre mentioned this pull request Jul 22, 2024

fix: wrong algorithms for password reset in 'settings.py' #447

Merged

3 tasks

rmlibre added a commit to rmlibre/hushline that referenced this pull request Jul 23, 2024

test(2fa): improve stability with broader error code catches

cd79614

Cherry-picked from (e173071) on PR branch [scidsg#411]

rmlibre mentioned this pull request Jul 23, 2024

test(2fa): improve stability with broader error code catches #450

Closed

rmlibre added 17 commits July 25, 2024 22:04

refactor: use structural pattern matching control flow [scidsg#411]

7c4596a

refactor: use explicit 'admin_secret' name & type [scidsg#411]

e0ea56f

feat(crypto): add encoding commitments to aad [scidsg#411]

eed1e5b

style: combine short like-lines [scidsg#411]

f84f976

merge(sync): pull 'scidsg/main' updates into PR branch [scidsg#411]

0924906

fix(dev): re-add stripped newline from 'dev_env.sh' [scidsg#411]

d6af3e3

fix(crypto): commit to '_derive_key' 'size' encoding [scidsg#411]

baf26b5

fix(db): give 'value' column its property's name [scidsg#411]

f547c3d

merge(sync): pull 'scidsg/main' updates into PR branch [scidsg#411]

5f17657

revert: "test(2fa): improve stability with broader error code catches […

c51e6e8

…scidsg#411]" This reverts commit e173071.

docs(test): comment why mypy needs a mock table [scidsg#411]

080ae11

feat(crypto): include admin db table name in 'aad' [scidsg#411]

7799379

style: combine related, short lines [scidsg#411]

0ea037e

docs: acknowledge rmlibre's contributions [scidsg#411]

b8b3a5f

merge(sync): pull 'scidsg/main' updates into PR branch [scidsg#411]

ee93c6a

rmlibre mentioned this pull request Jul 30, 2024

fix(privacy): use constant timestamp in Fernet ciphertext #466

Merged

2 tasks

jeremywmoore pushed a commit that referenced this pull request Jul 31, 2024

fix: wrong algorithms for password reset in 'settings.py'

2406857

Cherry-picked from (b2cb88e) on PR branch [#411]

This was referenced Sep 29, 2024

Encrypting the PGP public key is unnecessary #582

Open

Sessions are not encrypted and leak user_id #603

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DRAFT] feat: begin modernization of symmetric crypto [#268][#289][#357] #411

[DRAFT] feat: begin modernization of symmetric crypto [#268][#289][#357] #411

rmlibre commented Jul 16, 2024 •

edited

Loading

jeremywmoore left a comment

gitguardian bot commented Jul 18, 2024 •

edited

Loading

jeremywmoore left a comment

rmlibre left a comment

[DRAFT] feat: begin modernization of symmetric crypto [#268][#289][#357] #411

Are you sure you want to change the base?

[DRAFT] feat: begin modernization of symmetric crypto [#268][#289][#357] #411

Conversation

rmlibre commented Jul 16, 2024 • edited Loading

Description

Domain Separation:

References:

Input Canonicalization:

Example:

References:

AEAD & Context Commitment:

References:

Remediations

Passing Workflows:

jeremywmoore left a comment

Choose a reason for hiding this comment

gitguardian bot commented Jul 18, 2024 • edited Loading

⚠️ GitGuardian has uncovered 6 secrets following the scan of your pull request.

jeremywmoore left a comment

Choose a reason for hiding this comment

rmlibre left a comment

Choose a reason for hiding this comment

rmlibre commented Jul 16, 2024 •

edited

Loading

gitguardian bot commented Jul 18, 2024 •

edited

Loading