Practical Advice

Foundation

Keep personal data off-chain

Store the record off-chain, keep only an artefact on-chain

Whenever possible, no personal data should be written to the chain at all. Store the record relating to a data subject completely off-chain and put only a verification artefact (e.g. a hash or a commitment) on-chain. This keeps the on-chain footprint minimal and preserves the ability to delete or correct the underlying data off-chain.

Never combine a hash with additional data on-chain

Combining a hash with further data on the chain creates a linkage problem: because the original object relates to an identifiable person, the on-chain hash can then tie that person to everything stored alongside it. Hashing only the sensitive parts of a record still leaves the rest of the record linkable to the data subject — so a record must be stored off-chain as a whole.

Cryptography

Hashing done right — salt & pepper

A hash on a blockchain can leak data in three ways. Understanding them tells you when salting or peppering is needed.

The three ways a hash can leak

Lack of entropy: if the input can be guessed, the hash merely confirms a correct guess and thus reconstructs the original data.
A broken algorithm: a future break could reconstruct parts of the input. This risk is currently low — even with the insecure MD5, the practical effect is only collisions, not reconstruction.
Verification adds additional information: if the verification adds information not contained in the document itself, the verification itself might qualify as personal data. For example, endorsements or revocation statements could do so. There might still be a justification for doing so on a public blockchain — e.g. the rectification of a certified document.

Note. The EDPB treats decentralised verification artefacts — hashes, commitments or zero-knowledge proofs that can be used to verify personal data — as personal data, while centralised verification artefacts such as qualified electronic certificates for qualified electronic seals, which serve the same purpose and come with the same risks, are not regarded as personal data. Nothing in the GDPR or in the jurisprudence of the CJEU justifies this different treatment of decentralised and centralised verification artefacts.

Salt and pepper — borrowed from password storage

Salt is a random key added per entry before hashing; it defeats parallel and rainbow-table attacks and ensures two equal inputs do not share a hash. It can be stored next to the hash. Pepper is a constant but secret key kept in a secure place; it additionally blocks rainbow-table attacks and makes brute force much harder. Unlike passwords, many on-chain objects already carry enough entropy, and a pepper that must be known to a larger group cannot stay secret.

When to add salt:

Enough entropy in the object → no salt needed.
Not enough entropy → supplement with random data, if the object allows it.
Not enough entropy and supplementing would invalidate the object → use a separate random key (salt).

When to add pepper:

A secret key (pepper) should be added if the hashing serves as an endorsement separate from the document itself. By deleting the secret key (pepper), this separate endorsement can be deleted. If the verification does not add additional information, it does not need to be separated by an additional key.

Note. The EDPB regards all decentralised verification artefacts for documents containing personal data as personal data as long as the verification artefacts can be used. It therefore demands that all verification artefacts can be deactivated, e.g. by deleting a secret key (pepper). This seems to be based neither on the text of the GDPR nor on the jurisprudence of the CJEU.

Advanced techniques

Commitments & zero-knowledge proofs

Prove without revealing

Commitments and zero-knowledge proofs allow — as do hashes — to separate the proof from the information itself. Unlike hashes, a zero-knowledge proof can even prove the possession of an information without having to reveal that information, while a hash requires the information itself to verify it against the hash.

Note. From the EDPB's view, commitments and zero-knowledge proofs are still regarded as personal data if they allow the verification of personal data. This unfortunate view — not based on the text of the GDPR nor on the jurisprudence of the CJEU — adds a compliance risk to the use of privacy-preserving technologies.

Advanced techniques

Chameleon hashes & pruning

Chameleon hashes / redactable ledgers

Chameleon hashes allow editing hashes for those who know a secret key. This could allow invisible redactions. For some use cases — probably centralised use cases — this might be the appropriate technical architecture. See the analysis of Accenture's blockchain redaction approach: Forrester (Bennett, Matzke, Hoppermann, McPherson).

Pruning

A blockchain might also include a pruning mechanism that automatically removes transaction information under certain conditions, e.g. after a certain time. A book-keeping blockchain could automatically erase all old blocks after 10 years. Any information that is required for a longer period needs to be copied before.

Design & compliance

Choose the right chain — and run a DPIA

Match the architecture to the purpose

Decide deliberately between permissionless public and permissioned chains, and identify who — if anyone — determines purposes and means at the transaction, smart-contract and infrastructure level. The architecture choice drives both the privacy properties and the allocation of GDPR roles.

Use the DPIA as an enabler

Where processing is likely to result in high risk, a data protection impact assessment is required — and even where it is not, it helps document necessity, proportionality and mitigations. Treat it as a design tool rather than a hurdle. See the DPIA section on the overview page. If you are considering a DPIA, I am happy to support you in carrying one out.

Examples

Privacy-enhancing applications

Cases from the BC4EU report where decentralised technology is used to strengthen — not weaken — the privacy of data subjects. Read the full report (PDF).

●

Certifying diplomas via a smart contract

A smart contract stores only the hash of a diploma; a flag is added if it is revoked. Without revocation the hash is a mere verification artefact and reveals nothing about a person. The design enables independent, durable verification even after the issuer ceases to exist, limits the revocation information to those who hold a copy of the diploma, and — unlike a centralised revocation registry — leaves no trace of who checks a diploma, when or how often.

●

Proof of reserves

To prove that a custodian's liabilities are covered without exposing individual balances, account balances are hashed into a Merkle tree and a zero-knowledge proof attests the total of all liabilities. The Merkle root (optionally anchored on-chain) is only a verification artefact and not personal data; each customer can still verify that their balance is included — privacy by design instead of an auditor inspecting personal data.

●

TRISA travel-rule data exchange

Crypto-asset service providers must exchange originator and beneficiary data under the travel rule (FATF Rec. 16; in the EU the Transfer of Funds Regulation). With TRISA/TRP, the identity data is exchanged off-chain via mutually authenticated, encrypted, field-minimised messages (IVMS 101), while on-chain data stays pseudonymous and limited to what is technically required — a privacy-by-design implementation of a legal obligation.