matrix-doc/proposals/4102-unthreaded-read-receip...

6.2 KiB

MSC4102: Clarifying precedence in threaded and unthreaded read receipts in EDUs

This MSC assumes some knowledge around threaded receipts. Please read MSC3771: Read receipts for threads as for brevity some concepts won't be re-explained.

This proposal clarifies previously undefined behaviour around how read receipts are expressed over the CSAPI and SSAPI. This clarification reliably provides clients with sufficient information to determine read receipts, without placing additional burdens on client implementations.

Background

MSC3771: Read receipts for threads defined an API shape for threaded read receipts over the /sync API. They look like this:

{
  "content": {
    "$thread_reply": {
      "m.read": {
        "@rikj:jki.re": {
          "ts": 1436451550453,
          "thread_id": "$thread_root" // or "main" or absent
        }
      }
    }
  },
  "room_id": "!jEsUZKDJdhlrceRyVU:example.org",
  "type": "m.receipt"
}

This is problematic because it expresses receipt uniqueness based on the 3-uple (event ID, receipt type, user ID). In reality, uniqueness is based on the 4-uple of (event ID, receipt type, user ID, thread ID). This makes it impossible to express certain receipt aggregations, such as:

      Receipts Table
room | user | event | thread
-----+------+-------+--------
!foo | @bob | $abc  | NULL         // unthreaded receipt
!foo | @bob | $abc  | "some_id"    // threaded receipt

It is impossible to express these two receipts in a single m.receipt EDU, as the presence of thread_id: "some_id" by definition removes the absence of a thread ID, turning the receipt from an unthreaded receipt into a threaded receipt.

MSC3771: Read receipts for threads does not provide rules around how to combine these receipts. This has led to undefined server behaviour. In practice, both Synapse and the Sliding Sync Proxy will combine them in a "last write wins" manner. If the unthreaded receipt is sent after the threaded receipt, then /sync will always return the unthreaded receipt and vice versa. This is not a hypothetical problem: Element-Web will send both threaded and unthreaded receipts for the same event if the event in question is the most recent event in the room, which is fairly common.

This becomes a major problem when the threaded receipt "wins" and is returned instead of the unthreaded receipt. Some clients, such as Element X, do not handle threads and will ignore receipts with a non-main thread_id, assuming that clients will also be sending unthreaded receipts. When the threaded receipt "wins" however, these clients will simply not see the receipt, despite the sender sending it.

Proposal

When a server is combining receipts into an EDU, if there are multiple receipts for the same (user, event, receipt type), always choose the receipt which is unthreaded (has no thread_id) when aggregating into an EDU.

This change will apply to all m.receipt EDUs, which includes both CSAPI and Federation endpoints.

For example, given these two EDUs:

// EDU 1
{
    "content": {
        "$1435641916114394fHBLK:matrix.org": {
            "m.read": {
                "@erikj:jki.re": {
                    "ts": 1550000000000
                }
            }
        }
    },
    "type": "m.receipt"
}

// EDU 2
{
    "content": {
        "$1435641916114394fHBLK:matrix.org": {
            "m.read": {
                "@erikj:jki.re": {
                    "ts": 1559999999999,
                    "thread_id": "foo"
                }
            },
            "m.read.private": {
                "@self:example.org": {
                    "ts": 1660000000000,
                    "thread_id": "bar"
                }
            }
        }
    },
    "type": "m.receipt"
}

The unthreaded receipt wins and the combined EDU should become:

// Combined EDU
{
    "content": {
        "$1435641916114394fHBLK:matrix.org": {
            "m.read": {
                "@erikj:jki.re": {
                    "ts": 1550000000000
                }
            },
            "m.read.private": {
                "@self:example.org": {
                    "ts": 1660000000000,
                    "thread_id": "bar"
                }
            }
        }
    },
    "type": "m.receipt"
}

Note:

  • the data from the unthreaded receipt (the ts) is used.
  • the threaded receipt for type m.read.private is kept, as the 3-uple (user, event, receipt type) must match for the precedence rules to kick in.

Potential issues

Some data is lost when aggregating EDUs, as the fact that a threaded receipt was sent will be lost. This is not a problem in practice because an unthreaded receipt always takes precedence over a threaded receipt due to it being a superset of the same data.

Thread-unaware clients need to inspect thread_id: "main" receipts to accurately get read receipts from thread-aware clients, as that is how thread-aware clients send receipts for unthreaded events. Despite this, the main thread ID has no significance in this proposal, and should be treated as any other thread ID when determining precedence. This is because an unthreaded receipt (no thread ID) is still a superset of the main thread, just like any other thread.

Alternatives

Servers could split m.receipt EDUs and send threaded receipts in one EDU and unthreaded EDUs in another. This creates additional EDUs, increasing bandwidth usage, processing time and increases complexity as servers need to handle cases where EDUs are sent incompletely to remote clients/servers.

Servers could instead always choose the threaded receipt instead of the unthreaded receipt. This would make it harder to implement thread-unaware Matrix clients, as some receipt information is lost and can only be found by introspecting the threaded receipts.

Security considerations

None.

Unstable prefix

No unstable prefix required as no new endpoints / JSON keys are added in this proposal.

Dependencies

None.