Merkle Trees
If you have been in the world of crypto for a while, it is highly likely that you have heard the term Merkle Tree, also known as Merkle Hash Tree.
Definition
A Merkle Tree is a hash-based data structure, and can serve as a cryptographic commitment scheme.
You can commit to a set of values using a Merkle Tree, such as:
- Evaluations of a function
- Coefficients of a polynomial
- Files in your database
Here is an example, where we commit to a vector using a Merkle Tree:
graph BT subgraph Merkle Tree h1["h1 (Merkle Root)"] h2 --> h1; h3 --> h1 h4 --> h2; h5 --> h2 h6 --> h3; h7 --> h3 h8 --> h4; h9 --> h4 h10 --> h5; h11 --> h5 h12 --> h6; h13 --> h6 h14 --> h7; h15 --> h7 end m -- hash --> h8 y -- hash --> h9 v -- hash --> h10 e -- hash --> h11 c -- hash --> h12 t -- hash --> h13 o -- hash --> h14 r -- hash --> h15
In a Merkle Tree, every node is made up of the hash of its children. In this example binary tree, that would be:
- and so on.
The leaf nodes are the hashes of elements of the committed set of data. The final hash at the root of the tree is called the Merkle Root.
Merkle Trees are often implemented as binary trees, but the concept works for -ary trees as well, where each node has children.
Merkle Proof
At some point, we may be asked to show that indeed some element of the comitted data exists at some position. For our example above, a verifier could ask "is there really a at position 6?".
A naive method here would be give the entire comitted set of data, and let them prove the Merkle Root; however, we can do much better than that! With Merkle Trees, we can answer such queries in a much more efficient way, and without revealing any other data than the one we are asked to reveal.
The trick is to provide the hashes needed to compute all the way from the requested element up to the Merkle Root. In total, we only need to provide one hash per level (in the case of a binary-tree Merkle Tree) and the verifier can compute the root!
For instance, to show that we have at position 6, we need to provide the hashes that are used to compute the parent nodes.
- is computed by verifier.
- requires to be provided.
- requires to be provided.
- requires to be provided.
- is our commitment, which the verifier knows already.
The proof is visualized below for the same tree, where the values known & computed by the verifier are colored green and the values provided by the prover are colored blue:
graph BT classDef g fill:#afa classDef b fill:#aaf subgraph Merkle Tree h1["h1 (Merkle Root)"]:::g h2:::b --> h1 h3:::g --> h1 h4[fa:fa-eye-slash] --> h2 h5[fa:fa-eye-slash] --> h2 h6:::g --> h3 h7:::b --> h3 h8[fa:fa-eye-slash] --> h4 h9[fa:fa-eye-slash] --> h4 h10[fa:fa-eye-slash] --> h5 h11[fa:fa-eye-slash] --> h5 h12:::b --> h6 h13:::g --> h6 h14[fa:fa-eye-slash] --> h7 h15[fa:fa-eye-slash] --> h7 end t:::g -- hash --> h13
You see, we only needed to provide 3 hashes here, although our data had 8 elements! In fact, if you have elements you only need to provide elements to the verifier, this is so much more efficient than the naive method of sending all the data to the verifier.
A Merkle Root can serve as a cryptographic commitment to a set of data.
- It is hiding because you can't find the preimage of an hash efficiently.
- It is binding because assuming otherwise would require you to find a hash-collision efficiently, which is known to be intractable.
To reveal that some value is part of the comitted set of data at a specific point, you only need to reveal the path from that node to the root, along with the value itself, as described above.
Further Reading
- The original paper is "A Digital Signature Based on a Conventional Encryption Function" by Ralph Merkle.
- "Providing Authentication and Integrity in Outsourced Databases using Merkle Hash Tree's" by Mykletun, Narasimha, Tsudik is another paper that is slightly easier to read.
- RareSkills Merkle Tree second-preimage attack blog post.