0024: A2A Forward Secrecy

  • Author: Mike Lodder

  • Start Date: 2018-08-29

Status

  • Status: ACCEPTED

  • Status Date: 2018-11-01

  • Status Note: Idea was accepted by the Indy community, but no work has ever been done on it. At this point IETF’s MLS (which provides the same feature) may be adopted before this feature is implemented.

Summary

Specify the protocol to add forward secrecy between agent to agent messaging.

Motivation

Agent to agent communication uses Elliptic-Curve Integrated Encryption Scheme (ECIES) to protect messages. While this protection is good, it does not provide forward-secrecy and key-compromise impersonation resistance.

Forward-secrecy is the idea that compromise of the end point long term keys will not compromise the session keys. For agent to agent communication, a session is agent 1 sending a message to agent 2. Each message transmitted is a session. Key compromise impersonation means an active attacker that gains knowledge of the private key and can replace (impersonate) the agent when communicating. Agents that have active (synchronous) connections can achieve this using ephemeral keys to establish each session key. This is much harder to do when messages are delivered asynchronously. Another vector of attack stems from reusing keys. The more a key is used the higher the likelihood that an attacker can deduce the long term keys. If care is not taken with how messages are encrypted then messages with the same plaintext can yield the same ciphertext which allows an attacker to correlate two messages from the same agent.

Signal is a protocol that provides the forward-secrecy and key-compromise impersonation resistance for both synchronous and asynchronous messaging. This HIPE proposes to implement the Signal protocol for agent to agent communication to improve security and privacy–specifically the double ratchet algorithm. It is assumed that the transport layer across agents isn’t secure in any way. Signal will function regardless of the transport layer.

Out of scope

Ledger communication This HIPE is not proposing to use the Signal protocol to communicate with the Indy Ledger. In this case, TLS is a good solution instead of the Signal protocol. Signal requires state variables to maintain privacy and secrecy. These state variables must be kept private or all of its benefits are void. It is also not reasonable for the ledger to store the necessary state variables to enact the Signal protocol for each connection.

Routing How messages are forwarded to their various destinations is not the purpose of this HIPE. This HIPE just covers how message forward secrecy is to be implemented.

Tutorial

Terminology

  • isk: The sending agent’s identity secret key.

  • ivk: The sending agent’s identity verification (public) key.

  • rivk: The receiving agent’s identity verification (public) key.

  • revk: The receiving agent’s ephemeral verification (public) key.

  • esk: The sending agent’s ephemeral secret key.

  • epk: The sending agent’s ephemeral public key.

  • KDF: Key Derivation Function–derives one or more secrets from a master key. Acceptable functions are SHAKE128, SHAKE256, HKDF, Argon2id, or Scrypt. If variable key lengths are not needed or smaller than the function outputs, then functions from the SHA2 or SHA3 family could be used as well.

  • ECIES: Encryption system that uses Diffie-Hellman (DH) with isk and rivk to compute a shared secret K.1 and K.2 using a KDF. This is used to encrypt a message and compute an integrity check tag. The encrypted message and tag are sent to remote party r. r uses their local isk and rivk to compute the same secrets and decrypt the message and check the tag.

  • AuthCrypt: ECIES involving both agents’ static keys that also signs the payload with isk. Provides non-reputability for the sender.

  • AnonCrypt: ECIES involving the receiving agent’s static keys and ephemeral keys from the sender. Receiver does not know who sent the message.

  • 1: Agent 1 - Alice

  • 2: Agent 2 - Bob

  • ||: byte concatenation. For example 1001 || 0011 = 10010011

Review

Microledgers Indy agents currently use either anoncrypt or authcrypt to send messages. Authcrypt provides no forward-secrecy or key-compromise resistance for either agent. Anoncrypt provides forward-secrecy for the sending agent but not the receiving agent. An attacker can still decrypt the message from the receiving agent’s long term identity key but not the sender’s. This is known as weak forward secrecy. The existing architecture does provide a secure method for changing identity keys using microledgers. When agent 1 rotates identity keys, the transaction is signed using the existing isk. The transaction and new ivk are sent to agent 2. Agent 2 verifies the transaction using agent 1’s old rivk . If the transaction is valid, agent 2 updates to the new rivk.

Signal Double Ratchet Algorithm The algorithm is implemented by each party performing a key agreement to initially seed a Diffie-Hellman (DH) ratchet and using the ratchet to generate single use encryption keys. The ratchet is seeded with a Root key. Another DH keypair is used to create an output that is combined with the initial seed as input to a KDF to derive a sending ratchet and receiving ratchet. Every message is encrypted with a unique message key. The message keys are output keys from the sending and receiving chains. Calculating the next message key uses the current ratchet value and a constant as inputs to a KDF. Part of the output replaces the existing sending/receiving ratchet value and the other part becomes the message key. A message key is a symmetric encryption key.

When 1 sends a message to 2, a sending message key is computed to encrypt the message, and the current DH public key is sent with the message. 2 calculates a receiving message key to decrypt the message. 2 takes the DH public key received from 1 and creates a new DH keypair. This is used to ratchet the Root key. This then replaces the sending and receiving seeds. This process repeats for both agents as they send and receive messages which rotates and updates their ratcheting chains. Signal also allows the header metadata to be encrypted. Encrypting the header section is desirable to prevent correlation and enhance privacy.

Overview

Channel Setup Two parties connect agents out of band by scanning QR codes or manually entering information into an app. Currently, only the long term identity keys are exchanged. This proposal adds an additional one-time DH key to be exchanged as well. Each of the keys are used to calculate seed values and initialize the microloedger in the following manner:

1 sends 2 in an initial message:

  • Identity Key ivk

  • Ephemeral public key epk

  • An initial ciphertext encrypted with some AEAD encryption scheme (AES-GCM, SALSA20 or CHACHA20 with POLY1305) using AD = ipk || ripk. AD contains identity information for both parties. The initial ciphertext should contain the first ratchet DH key.

1 calculates using her private keys and 2’s public keys:

DH1 = DH(isk, rivk)
DH2 = DH(esk, rivk)
DH3 = DH(esk, repk)

RK = KDF(DH1, DH2, DH3)

Note that DH1 provides mutual authentication while DH2 and DH3 provides forward secrecy.

When 2 receives the initial message or QR code, she repeats the same calculations as 1 and attempts to decrypt the initial ciphertext. If decryption fails, then 2 aborts the protocol and deletes the public keys. If decryption succeeds, the setup completes by 2 calculating the message and header RKs deleting the ephemeral message and header public keys, storing 1’s identity public key in the microledger, and storing the current ratchet public key for 1. 2 then sends an initial message to 1 and 1 repeats the same process.

This setup is based on Signal’s X3DH, the main difference being there are no central servers for 1 to find 2 and visa versa and 1 and 2 authenticate each other as part of the setup.

Agent-to-Agent Messages Agents may now use the signal protocol to send encrypted messages. Each agent keeps state for the following variables:

  • DHs: The current sending DH ratchet key pair.

  • DHr: The current receiving DH public key.

  • RK: 32 byte message root key.

  • MCKs, MCKr: 32 byte chain keys for sending and receiving messages.

  • HKs, HKr: 32 byte header keys for sending and receiving.

  • NHKs, NHKr: 32 byte next header keys for sending and receiving.

  • Ns, Nr: Message numbers for sending and receiving.

  • PN: Number of messages in previous sending chain.

  • MKSkipped: Dictionary of skipped-over message keys, indexed by header key ratchet public kye and message number.

Messages may be received out-of-order. Signal’s double ratchet handles this by tracking N and HK in each message. If a ratchet step is triggered, the agent will store any keys needed to decrypt missing messages later before performing the ratchet. Messages received are decrypted using the current message key. The message key is immediately deleted. Message storage can use different encryption techniques local to the agent system going forward. When a message is sent, the sending encryption key is subsequently deleted.

When a ratchet is performed, the current RK’s are updated and N is reset back to 1. Skipped message keys are derived and stored in MKSkipped based on the N received and the current expected value. MKSkipped keys should only be stored for an acceptable amount of time (highly sensitive applications may not store them at all, where less sensitive ones may store them for one week. 72 hours is an acceptable default. Storing up to 5 asymmetric updates previous chain message keys is acceptable but probably no more if lots of messages are being dropped then it could indicate a man-in-the-middle or a faulty agent. No more than 2000 skipped message keys should be stored).

Encrypted message header format

The encrypted header is the concatenation of the following fields

Version || Nonce or IV || Ciphertext || Tag || HMAC
  • Version, 8 bits

  • Nonce or IV, variable length multiple of 32 bits

  • Ciphertext, encrypted header, variable length

  • Tag, 128 bits

  • HMAC, 256 bits

Encrypted header fields

Version

This field denotes which version of the format is being used. There are three versions defined:

  • 16 (0x10): Version 1 for using Salsa20-Poly1305 (IETF or extended versions too) authenticated encryption

  • 32 (0x20): Version 1 for using Chacha20-Poly1305 (IETF or extended versions too) authenticated encryption

  • 64 (0x40): Version 1 for using AES256-GCM authenticated encryption

Nonce or IV

The 128 bit initialization vector for AES-GCM or 196 bit nonce for Salsa/Chacha. This value MUST be unique and unpredictable for each message. With a high-quality source of entropy, random selection will do this with high probability

Ciphertext

This field has variable size. It contains the unencrypted header metadata.

Tag

The tag used to authenticate the message.

HMAC

The 256-bit SHA256 HMAC, under the header or next header signing-key, of the concatenation of the following fields:

Version || Nonce or IV || Ciphertext || Tag

Associating the message

An agent may have multiple channels from which a message can come. To associate this message with the correct channel, the agent tries to HMAC and decrypt the header using known channel header keys and the AD for that channel. HMAC and decryption will fail for incorrect channels and only succeed for a correct channel. Once channel association is established, decrypting the message can begin. See Double ratchet with header encryption for more details about ratcheting header keys.

A downside to encrypted headers is the cloud agent will store the message until an edge agent tells the cloud agent to delete it. In the case of multiple edge agents, agent 1 might accidentally receive a message meant for agent 2 and will not be able to decrypt it. Agent 1 will not know if the message is bad, not for her, has been tampered with, or is spam. To eliminate this ambiguity, routing information should be included so cloud agents can know which edge agent the message is for and the edge agent can with assurance know the message is for them. This information should be encrypted so only the cloud agent can read it. The routing information can be a hash of an edge agents identity public key.

Unencrypted header format

The header is the concatenation of the following fields:

Version || Timestamp || PN || N || DHr
  • Version, 8 bits

  • Timestamp, 64 bits

  • PN, 32 bits

  • N, 32 bits

  • DHr, variable length depending on DH key scheme

Version

Denotes which version of the format is being used. There is only one version defined, with the value 128 (0x80).

Timestamp

64-bit unsigned big-endian integer. Records the number of seconds elapsed between January 1, 1970 UTC and the time the message was created.

PN

The number of messages in previous sending chain.

N

The message ID for the current sending and receiving chains.

Threats

The threat model is defined in terms of what an attacker can acheive.

Assumptions

User

  • Acts reasonably and in good faith. (Giving their private identity key to an attacker would be unreasonable).

  • Installs authentic agent software

User’s Device

  • Device correctly executes the agent software and is not compromised by malware.

Security

  • Ed25519, x25519, Salsa20, Chacha20, Poly1305, HMAC-SHA256, AES-256 are valid.

Attacks

User’s Cloud Agent

  • Can learn when a user is online by observing messages (not their contents)

  • Can learn how many messages are received and when they are received (but not who sent them).

  • Can learn message sizes.

  • Can drop or corrupt messages.

  • Can spam the user with invalid messages.

  • Can duplicate old messages.

  • Could inform a contact (even falsely) that they have been revoked or fired.

Passive attacker observing all traffic

  • Can learn who is using the cloud agent.

  • Can learn when messages are sent and where they are sent.

  • Can observe when a new channel is setup and possibly insert self as man-in-the-middle.

Physical loss of user’s device

  • Attacker can perform offline attack to unlock device obtain undeleted messages and keys.

Compromise of user’s device

  • Attacker can obtain all messages going forward.

A contact

  • Can spam a user with messages.

  • Can to some extent prove to a third-party that a message came from a user.

  • Can retain messages from a user, forever.

  • Can learn that a user has changed identity keys (but this is the point).

  • Can learn how many devices a user is using to communicate with them.

Random attacker on the internet

  • Can DoS the cloud agent or the edge agent if the edge agent connects directly online.

Edge Cases

Ratchet out of sync There will be times when two party’s ratchets could get out of sync. If this happens, it will be difficult to differentiate between a faulty or spam message. Regardless, there might be times where a ratchet resync will be needed. To perform a resync, agent 1 can authcrypt a special resync message using both party’s identity keys. The resync message includes similar data necessary to calculate new ratchet seeds. After a resync, the identity keys could be rotated using microledgers to ensure forward secrecy for the resync message.

Reference

This HIPE is designed to work with Ed25519 keys but could work with any public key crypto system.

Signal

Drawbacks

This HIPE adds complexity to agent-to-agent messaging. It requires knowledge of cryptographic functions, more local space for storing state variables, and proper management of state variables. State variables will need to be backed up to resume channels. Syncing these values across agents that belong to the same identity will be impossible. Each of Alice’s agents will need to maintain their own state variables. This inhibits the possibility of using group encryption or group signatures to hide how many agents Alice has and which of her agents she is using. But since Alice trusts Bob enough to establish a channel with him, it might be an okay tradeoff.

Performance is another consideration. Signal requires executing KDFs every time a message is sent and received to derive keys, and computing a Diffie-Hellman ratchet. Care must be taken to choose a KDF that isn’t performance inhibitive. Choosing elliptic curve keypairs can reduce the size and performance penalty for computing the Diffie-Hellman ratchet.

Rationale and alternatives

Encrypted messaging has been around for long time and is a well understood problem. PGP was used to encrypt and send messages asynchronously in the form of email but it’s not forward secure and it leaks traffic information. Forward Secret PGP has never materialized. Email is also considered insecure since email addresses are largely public. Setting up secure email is very difficult.

Indy could try to come up with its own asynchronous messaging protocol but will probably not be able to create one better than Signal nor as widely adopted.

Agents could also continuously rotate keys using the microledgers but this would require extra data in every message that includes the new key and a signed transaction. The microledger maintains transactions forever. This solution would eventually result in a massive amount of data for the microledger.

Agents could also setup short-lived sessions that use a group symmetric key stored with the cloud agent but known to members of the group. The management involved in such a scheme is more complex than to discuss here but could be the subject for future HIPEs.

Signal is supported and improved by Open Whisper Systems and the Signal Foundation. Signal has been vetted by cryptographers and security professionals alike who have found it to be secure (Signal audit and A Formal Security Analysis of the Signal Messaging Protocol). Signal has been implemented in multiple programming languages already so the protocol does not need to be written from scratch. The open source libraries can be used directly with Indy.

Prior art

As stated, encrypted messaging between two parties is a well understood problem. Multiple solutions currently exist but the most popular are Off-the-record (OTR), Silent Circle’s Silent Text, Secure Chat, iMessage, and others. Signal evolved by combining the best of many of these and fixing existing weaknesses.

Unresolved questions

  • Is it necessary for agents to be able backup and restore their state variables?

  • How should agents backup and restore their state variables?