matrix-doc/proposals/4147-including-device-keys-...

7.8 KiB

MSC4147: Including device keys with Olm-encrypted to-device messages

Summary: a proposal to ensure that messages sent from a short-lived (but genuine) device can be securely distinguished from those sent from a spoofed device.

Background

When a Matrix client receives an encrypted message, it is necessary to establish whether that message was sent from a device genuinely belonging to the apparent sender, or from a spoofed device (for example, a device created by an attacker with access to the sender's account such as a malicious server admin, or a man-in-the-middle).

In short, this is done by requiring a signature on the sending device's device keys from the sending user's self-signing cross-signing key. Such a signature proves that the sending device was genuine.

Current client implementations check for such a signature by querying the sender's device keys when an encrypted message is received.

However, this does not work if the sending device logged out in the time between sending the message and it being received. This is particularly likely if the recipient is offline for a long time. In such a case, the sending server will have forgotten the sending device (and any cross-signing signatures) by the time the recipient queries for it. This makes the received message indistinguishable from one sent from a spoofed device.

Current implementations work around this by displaying a warning such as "sent by a deleted or unknown device" against the received message, but such messaging is unsatisfactory: a message should be either trusted or not.

We propose to solve this by including a copy of the device keys in the Olm-encrypted message, along with the cross-signing signatures, so that the recipient does not have to try to query the sender's keys.

Proposal

The plaintext payload of to-device messages encrypted with the m.olm.v1.curve25519-aes-sha2 encryption algorithm is currently of the form:

{
  "type": "<type of the plaintext event>",
  "content": "<content for the plaintext event>",
  "sender": "<sender_user_id>",
  "recipient": "<recipient_user_id>",
  "recipient_keys": {
    "ed25519": "<our_ed25519_key>"
  },
  "keys": {
    "ed25519": "<sender_ed25519_key>"
  }
}

We propose to add a new property: sender_device_keys, which is a copy of what the server would return in response to a /keys/query request, as the device keys for the sender's device. In other words, the plaintext payload will now look something like:

{
  "type": "<type of the plaintext event>",
  "content": "<content for the plaintext event>",
  "sender": "<sender_user_id>",
  "recipient": "<recipient_user_id>",
  "recipient_keys": {
    "ed25519": "<our_ed25519_key>"
  },
  "keys": {
    "ed25519": "<sender_ed25519_key>"
  },
  "sender_device_keys": {
    "algorithms": ["<supported>", "<algorithms>"],
    "user_id": "<user_id>",
    "device_id": "<device_id>",
    "keys": {
      "ed25519:<device_id>": "<sender_ed25519_key>",
      "curve25519:<device_id>": "<sender_curve25519_key>"
    },
    "signatures": {
      "<user_id>": {
        "ed25519:<device_id>": "<device_signature>",
        "ed25519:<ssk_id>": "<ssk_signature>",
      }
    }
  }
}

If this property is present, the keys.ed25519 property of the plaintext payload must be the same as the sender_device_keys.keys.ed25519:<DEVICEID> property. If they differ, the recipient should discard the event.

As the keys property is now redundant, it may be removed in a future version of the Matrix specification.

Potential issues

Adding this property will increase the size of the to-device message. We found it increased the length of a typical encrypted m.room_key message from about 1400 to 2400 bytes (a 70% increase). This will require increased storage on the recipient homeserver, and increase bandwidth for both senders and recipients. See Alternatives for discussion of mitigation strategies.

This proposal is not a complete solution. In particular, if the sender resets their cross-signing keys, and also logs out the sending device, the recipient still has no way to verify the sending device. The device signature in the Olm message is meaningless. A full solution would require the recipient to be able to obtain a history of cross-signing key changes, and to expose that information to the user; that is left for the future.

Alternatives

Minor variations

The sender_device_keys property could be added to the cleartext. That is, it could be added as a property to the m.room.encrypted event. This information is already public, as it is accessible from /keys/query (while the device is logged in), and does not need to be authenticated as it is protected by the self-signing signature, so it does not seem to need to be encrypted. However, there seems to be little reason not to encrypt the information. In addition, by including it in the encrypted payload, it leaves open the possibility of it replacing the keys property, which must be part of the encrypted payload to prevent an unknown key-share attack.

The sender_device_keys property could be added to the cleartext by the sender's homeserver, rather than by the sending client. Possibly within an unsigned property, as that is where properties added by homeservers are customarily added. It is not clear what advantage there would be to having this information being added by the client.

To mitigate the increased size of to-device events under this proposal, the sender_device_keys could be sent only in pre-key messages (Olm messages with type: 0 in the m.room.encrypted event) — with the rationale that if the Olm message is a normal (non-pre-key) message, this means that the recipient has already decrypted a pre-key message that contains the information, and so does not need to be re-sent the information), or if the signatures change (for example, if the sender resets their cross-signing keys), or if the sender has not yet sent their device_keys. However, this requires additional bookkeeping, and it is not clear whether this extra complexity is worth the reduction in bandwidth.

Alternative approach

A more radical proposal to decrease the overhead in to-device messages is to instead specify that /keys/query must include deleted devices as well as active ones, so that they can be reliably queried. Since the origin server might be unreachable at the time the recipient receives the message, such device lists would need to be cached on the recipient homeserver.

In other words, this approach would require all homeservers to keep a permanent record of all devices observed anywhere in the federation, at least for as long as there are undelivered to-device events from such devices.

Transparently: we have not significantly explored this approach. We have a working solution, and it is unclear that the advantages of this alternative approach outweigh the opportunity cost and delay in rollout of an important security feature. If, in future, the overhead of including the device keys in the to-device messages is found to be significant, it would be worth revisiting this.

Security considerations

If a device is logged out, there is no indication why it was logged out. For example, an attacker could steal a device and use it send a message. The user, upon realizing that the device has been stolen, could log out the device, but the message may still be sent, if the user does not notice the message and redact it.

Unstable prefix

Until this MSC is accepted, the new property should be named org.matrix.msc4147.device_keys.

Dependencies

None