Internet-Draft | CoMETRE | July 2023 |
Steele, et al. | Expires 11 January 2024 | [Page] |
This specification describes verifiable data structures and associated proof types for use with COSE. The extensibility of the approach is demonstrated by providing CBOR encodings for RFC9162.¶
This note is to be removed before publishing as an RFC.¶
Source for this draft and an issue tracker can be found at https://github.com/ietf-scitt/draft-steele-cose-merkle-tree-proofs.¶
This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.¶
Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.¶
Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."¶
This Internet-Draft will expire on 11 January 2024.¶
Copyright (c) 2023 IETF Trust and the persons identified as the document authors. All rights reserved.¶
This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License.¶
Merkle trees are one of many verifiable data structures that enable tamper evident secure information storage, through their ability to protect the integrity of batches of documents or collections of statements.¶
Merkle trees can be constructed from simple operations such as concatenation and digest via a cryptographic hash function, however, more advanced constructions enable proofs of different properties of the underlying verifiable data structure.¶
Verifiable data structure proofs can be used to prove a document is in a database (proof of inclusion), that a database is append only (proof of consistency), that a smaller set of statements are contained in a large set of statements (proof of disclosure, a special case of proof of inclusion), or proof that certain data is not yet present in a database (proofs of non inclusion).¶
Differences in the representation of verifiable data structures, and verifiable data structure proof types, can increase the burden for implementers, and create interoperability challenges for transparency services.¶
This document describes how to convey verifiable data structures, and associated proof types in COSE envelopes.¶
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.¶
A data structure which supports one or more Proof Types.¶
A verifiable process, that proves properties of one or more Verifiable Data Structures.¶
An encoding of a Proof Type in CBOR.¶
A COSE Sign1 encoding of a specific Proof Type for a specific Verifiable Data Structure.¶
This section describes representations of verifiable data structure proofs structures in CBOR.¶
Different verifiable data structures support the same proof types, but the representations of the proofs varies greatly.¶
For example, construction of a merkle tree leaf, or an inclusion proof from a leaf to a merkle root, might have several different representations, depending on the verifiable data structure used.¶
Some differences in representations are necessary to support efficient verification of different kinds of proofs and for compatibility with specific implementations.¶
Some proof types benefit from standard envelope formats for signing and encryption, whilst others require no further cryptographic intervention at all.¶
In order to improve interoperability we define two extension points for enabling verifiable data structures with COSE, and we provide concrete examples for the structures and proofs defined in [RFC9162].¶
This document establishes a registry of verifiable data structure algorithms, with the following initial contents:¶
Identifier | Algorithm | Reference |
---|---|---|
0 | N/A | |
1 | RFC9162_SHA256 | [RFC9162] |
Proof types are specific to their associated "verifiable data structure", for example, different Merkle trees might support different representations of "inclusion proof" or "consistency proof".¶
Implementers should not expect interoperability accross "verifiable data structures", but they should expect conceptually similar properties across registered proof types.¶
For example, 2 different merkle tree based verifiable data structures might both support proofs of inclusion. Protocols requiring proof of inclusion ought to be able to preserve their functionality, while switching from one verifiable data structure to another, so long as both structures support the same proof types.¶
This document establishes a registry of verifiable data structure proof types tags, with the following initial contents:¶
Identifier | Proof Type | Reference |
---|---|---|
0 | N/A | |
TBD_2 | inclusion | Section 4.2 |
TBD_3 | consistency | Section 4.3 |
Editors note: The registry requirements needs to address the case of multiple proofs of a given type.¶
Inclusion proofs provide a mechanism for a verifier to validate set membership.¶
The integer identifier for this Proof Type is TBD_2. The string identifier for this Proof Type is "inclusion".¶
Section 5.2 provides a concrete example.¶
Consistency proofs provide a mechanism for a verifier to validate the consistency of a verifiable data structure.¶
The integer identifier for this Proof Type is TBD_3. The string identifier for this Proof Type is "consistency".¶
Section 5.3 provides a concrete example.¶
This section defines how the data structures described in [RFC9162] are mapped to the terminology defined in this document, using cbor and cose.¶
RFC9162_SHA256 requires the following:¶
The integer identifier for this Verifiable Data Structure is 1. The string identifier for this Verifiable Data Structure is "RFC9162_SHA256".¶
See Section 3.1.¶
See [RFC9162], 2.1.1. Definition of the Merkle Tree, for a complete description of this verifiable data structure.¶
See [RFC9162], 2.1.3.1. Generating an Inclusion Proof, for a complete description of this verifiable data structure proof type.¶
The cbor representation of an inclusion proof for RFC9162_SHA256 is:¶
inclusion-proof = #TBD_2([ tree-size: int leaf-index: int inclusion-path: [+ bstr] ])¶
In a signed inclusion proof, the previous merkle tree root, maps to tree-size-1, and is a detached payload.¶
Other specifications refer to signed inclusion proofs as "receipts", profiles of proof signatures are encouraged to make additional protected header parameters mandatory.¶
TODO: reference to scitt receipts.¶
The protected header for an RFC9162_SHA256 inclusion proof signature is:¶
Editors note: Recommend removing crit
and mandating kid
. See issue 21.¶
The unprotected header for an RFC9162_SHA256 inclusion proof signature is:¶
The payload of an RFC9162_SHA256 inclusion proof signature is:¶
A previous Merkle tree hash as defined in [RFC9162].¶
The payload MUST be detached.¶
Detaching the payload forces verifiers to recompute the root from the inclusion proof signature, this protects against implementation errors where the signature is verified but to root does not match the inclusion proof.¶
The following example needs to be converted to proper CDDL:¶
# COSE_Sign1 18([ # Protected Header h'a2012604588368747470733a2f2f73636974742e78797a2f75726e3a696574663a706172616d733a7472616e733a696e636c7573696f6e3a726663393136325f7368613235363a303a65343263333764326638306361613464323035353635376534303463386538363838313534346136663264313731356530663564616435643436343833633531', # { # "alg" : "ES256", # 1 : -7, # "verifiable-data-structure" : "RFC9162_SHA256", # TBD_1 : 1, # } # Unprotected Header { # "inclusion-proof" : "h'3133312c322c302c3132392c3231362c36342c38382c33322c3235342c3132382c33392c34392c3131382c312c3230352c38372c3235332c3136312c31332c3136312c38352c3139302c3133322c3234312c3137332c34352c3132372c32302c35302c35342c31332c3134342c33332c3233372c3234382c3132382c32332c3138392c3133352c3932'" TBD_2 : h'3133312c322c302c3132392c3231362c36342c38382c33322c3235342c3132382c33392c34392c3131382c312c3230352c38372c3235332c3136312c31332c3136312c38352c3139302c3133322c3234312c3137332c34352c3132372c32302c35302c35342c31332c3134342c33332c3233372c3234382c3132382c32332c3138392c3133352c3932' }, # Detached Payload # Signature h'4862c1dced27ceeb1f7a6277d13be127a8969a7171ae000ffa90ef5757b817ca8ee61d57645d1a087251a97f06eb61aec46ecf958e4a0fb94ae37f410c7f77ea' ])¶
See [RFC9162], 2.1.4.1. Generating a Consistency Proof, for a complete description of this verifiable data structure proof type.¶
The cbor representation of a consistency proof for RFC9162_SHA256 is:¶
consistency-proof = #TBD_3([ tree-size-1: int ; size of the tree, when the previous root was produced. tree-size-2: int ; size of the tree, when the latest root was produced. consistency-path: [+ bstr] ; consistency path, from previous root to latest root. ])¶
Editors note: tree-size-1, could be ommited, if an inclusion-proof is always present, since the inclusion proof contains, tree-size-1.¶
In a signed consistency proof, the latest merkle tree root, maps to tree-size-2, and is an attached payload.¶
The protected header for an RFC9162_SHA256 consistency proof signature is:¶
Editors note: Recommend removing crit
and mandating kid
. See issue 21.¶
The unprotected header for an RFC9162_SHA256 consistency proof signature is:¶
The payload of an RFC9162_SHA256 consistency proof signature is:¶
The latest Merkle tree hash as defined in [RFC9162].¶
The payload MUST be attached.¶
The following example needs to be converted to proper CDDL:¶
# COSE_Sign1 18([ # Protected Header h'a2012604588568747470733a2f2f73636974742e78797a2f75726e3a696574663a706172616d733a7472616e733a636f6e73697374656e63793a726663393136325f7368613235363a303a66653830323733313736303163643537666461313064613135356265383466316164326437663134333233363064393032316564663838303137626438373563', # { # "alg" : "ES256", # 1 : -7, # "verifiable-data-structure" : "RFC9162_SHA256", # TBD_1 : 1, # } # Unprotected Header { # "consistency-proof" : "h'3133312c312c312c3132392c3231362c36342c38382c33322c3235342c3132382c33392c34392c3131382c312c3230352c38372c3235332c3136312c31332c3136312c38352c3139302c3133322c3234312c3137332c34352c3132372c32302c35302c35342c31332c3134342c33332c3233372c3234382c3132382c32332c3138392c3133352c3932'" TBD_3 : h'3133312c312c312c3132392c3231362c36342c38382c33322c3235342c3132382c33392c34392c3131382c312c3230352c38372c3235332c3136312c31332c3136312c38352c3139302c3133322c3234312c3137332c34352c3132372c32302c35302c35342c31332c3134342c33332c3233372c3234382c3132382c32332c3138392c3133352c3932' }, # Protected Payload h'fe8027317601cd57fda10da155be84f1ad2d7f1432360d9021edf88017bd875c', # Signature h'fe476fcddb783805fe344fc96837f4a5531c2e5a56d6f6353831e84e17ac69d4407a5a0d6eadf27f3a570bcf604181fd11b4692d3ac17b116c6226ba43726113' ])¶
See the privacy considerations section of:¶
Although the word transparency implies to some degree read access, it is important to note that transparency logs might include sensitive information.¶
Depending on the verifiable data structure used, a service provider might be able to count unique entries.¶
In the case that an entry is produced from a cose sign 1 envelope, adding information to the unprotected header can be used to produce a unique entry.¶
However, this could impact privacy, and some transparency service operators might prefer only integrity protected content be made transparent.¶
In cases where a single merkle root and multiple inclusion paths are used to prove inclusion for multiple payloads. There is a risk that an attacker may be able to learn the content of undisclosed payloads, by brute forcing the values adjacent to the disclosed payloads through application of the cryptographic hash function and comparison to the the disclosed inclusion paths. This kind of attack can be mitigated by including a cryptographic nonce in the construction of the leaf, however this nonce must then disclosed along side an inclusion proof which increases the size of multiple payload signed inclusion proofs.¶
Tree algorithm designers are encouraged to comment on this property of their leaf construction algorithm.¶
Implementers wishing to leverage multiple inclusion proofs to support selective disclosure, can prepend each payload with extra data before computing the inclusion proof, where extra data is a cryptographic nonce.¶
See the security considerations section of:¶
The choice of cryptographic hash function is the primary primitive impacting the security of authenticating payload inclusion in a merkle root. Tree algorithm designers should review the latest guidance on selecting a suitable cryptographic hash function.¶
This document requests IANA to add new values to the 'COSE Algorithms' and to the 'COSE Header Algorithm Parameters' registries in the 'Standards Action With Expert Review category.¶
Editors note: Authors are discussing how to avoid flooding the cose header parameters registry with new proof types.¶
IANA will be asked to establish a registry of tree algorithm identifiers, named "Verifiable Data Structures" to be administered under a Specification Required policy [RFC8126].¶
Template:¶
IANA will be asked to establish a registry of tree algorithm identifiers, named "Verifiable Data Structures Proof Types" to be administered under a Specification Required policy [RFC8126].¶
Template:¶
Note to RFC Editor: Please remove this section as well as references to [BCP205] before AUTH48.¶
This section records the status of known implementations of the protocol defined by this specification at the time of posting of this Internet-Draft, and is based on a proposal described in [BCP205]. The description of implementations in this section is intended to assist the IETF in its decision processes in progressing drafts to RFCs. Please note that the listing of any individual implementation here does not imply endorsement by the IETF. Furthermore, no effort has been spent to verify the information presented here that was supplied by IETF contributors. This is not intended as, and must not be construed to be, a catalog of available implementations or their features. Readers are advised to note that other implementations may exist.¶
According to [BCP205], "this will allow reviewers and working groups to assign due consideration to documents that have the benefit of running code, which may serve as evidence of valuable experimentation and feedback that have made the implemented protocols more mature. It is up to the individual working groups to use this information as they see fit".¶
An open-source implementation was initiated and is maintained by the Transmute Industries Inc. - Transmute.¶
An application demonstrating the concepts is available at https://scitt.xyz.¶
An open-source implementation is available at:¶
The current version ('main') implements the tree algorithm, inclusion proof and consistency proof concepts of this draft.¶
The project and all corresponding code and data maintained on GitHub are provided under the Apache License, version 2.¶
The implementation builds on concepts described in SCITT [I-D.ietf-scitt-architecture] (https://scitt.io/).¶
The implementation uses the Concise Binary Object Representation [RFC7049] (https://cbor.io/).¶
The implementation uses the CBOR Object Signing and Encryption [RFC9053], maintained at: - https://github.com/erdtman/cose-js¶
The implementation uses an implementation of [RFC9162], maintained at:¶