21 Aug 2024

Let’s All Agree to Use Seeds as ML-KEM Keys

Last week, NIST published the final version of the ML-KEM¹ specification, FIPS 203. One change from the draft is that the final document explicitly allows storing the private decapsulation key as a seed. This is a plea to the cryptography engineering community: let’s all agree to only use seeds as the storage format of ML-KEM keys, and forget that a serialized format for expanded decapsulation keys even exists.

Seeds have multiple advantages. The most obvious is size: a seed is 64 bytes, while an expanded decapsulation key is 1 632, 2 400, or 3 168 bytes depending on the ML-KEM parameter set.

More importantly, though, a 64-byte seed is always valid, while an expanded decapsulation key needs to be validated. FIPS 203, Section 7.3, requires the following check

H(dk[384𝑘 : 768𝑘 + 32])) == dk[768𝑘 + 32 : 768𝑘 + 64]

to ensure two parts of the input key are consistent: the pre-computed hash of the encapsulation key, and the encapsulation key itself.

That’s not all, though. The decapsulation key expanded format is

ByteEncode₁₂(s) || ByteEncode₁₂(t) || ρ || H(ekPKE) || z

where s and t are vectors of NTT elements. An NTT element is in turn a vector of field elements, and a field element is a number between zero and 3 329, encoded as a 16-bit integer. What if an encoded field element is higher than 3 329? It’s invalid! What then? Well. If you find one in an encapsulation key you are required to reject it by FIPS 203, Section 7.2. But what if you find it in the encapsulation key that’s part of a decapsulation key (the t value)? And what if you find it in a different part of a decapsulation key (the s value)? The spec doesn’t say!

This whole can of worms just stays on the shelf with seeds, because it’s the implementation itself that derives s, t, and H(ekPKE), so it can be certain they are valid.

Ok, so seeds are smaller and simpler, reducing the margin for error, but are they expensive? They are not, to the point that loading the expanded decapsulation key over a Gigabit link is almost as expensive as expanding a seed², but most importantly it doesn’t matter!

Keys have implicitly two representations: one on the wire, one in memory. The latter can include precomputed values to enable faster operations and doesn’t need a fixed bytes format. The cost of going from one to the other—the cost of seed expansion—is not important for private keys: either keys are ephemeral, in which case they can go straight into the in-memory representation, or they are reused a lot, in which case the deserialization cost is amortized.

The ML-KEM expanded decapsulation key format is actually not even a good in-memory format, because it doesn’t include the full expanded matrix A but its seed ρ. It’s a weird in-between where some values are expanded (so you need to validate them) but some others are not (so you need to expand them).

Seeds have obvious advantages and are explicitly allowed by the specification, so we’ll definitely see them used in the wild. If we also support the expanded format, we’re going to see interoperability issues, and we’ll have to carry all the complexity of both code paths. Let’s just not.

Speaking of interoperability, FIPS 203 does a great job of specifying everything in terms of byte sequences³… except the seed, which is defined as ‌(d, z). I think every implementation I’ve seen just stores them by concatenating ⁴ them as a 64-byte buffer, so hopefully we can all agree on that. If anyone suggests an ASN.1 SEQUENCE of BIT STRINGs or whatever I will quit.

To really put this to rest, we’re going to need (sooner rather than later!) a specification like RFC 8410 which specifies the key format and assigns OIDs for use in e.g. PKCS #8. Interestingly, the OID arc used by RFC 8410 for Ed25519, 1.3.101, is now managed as an IANA registry with policy Specification Required, not RFC Required. See RFC 8411. That means that anyone could make e.g. a C2SP document referencing FIPS 203 and request OID assignments for ML-KEM. Unless work is already underway to assign OIDs, this might very well be the fastest path, and the longer we wait the more we risk ecosystem fractures.

EDIT (minutes after pressing send): Well, scratch all that, I was a day behind on my mailing lists inbox and I just saw that NIST has published OIDs for all ML-KEM parameters and linked to draft-ietf-lamps-kyber-certificates-03 for the key format specification. The “Private Key Format” section of that draft doesn’t say what the actual private key format is, and hopefully it will land on 64-byte seeds.

Concretely, for implementations, I am suggesting not implementing expanded key parsing and validation at all, or relegating it to an internal function like the derandomized variants, not exposed in the public API. That’s what I did for filippo.io/mlkem768 and probably what we will do for the Go standard library. We will also ensure that the test vectors in CCTV and Wycheproof don’t require it (i.e. that they provide seeds as inputs). This will make it hard or impossible to test issues that require malformed decapsulation keys but that’s the point: if you don’t expose the API you don’t need to test its failure cases. All other vectors should be reproducible by bruteforcing seeds.⁵

If you got this far, you might also want to follow me on Bluesky at @filippo.abyssdomain.expert or on Mastodon at @filippo@abyssdomain.expert.

The picture

In Madeira there is this incredible forest, Fanal, full of centenary twisty trees in a mysterious eery atmosphere. One of a few places, along with Barcelona, that are best experienced with the fog. (I grew up in the Padan Plain, and was recently almost ran over by a pack of spooked cows in the thick fog driving a motorcycle up the Matese Apennines, don’t @ me about fog.)

A group of gnarled, moss-covered trees with thick trunks and twisted branches standing on a foggy, green field, creating a mysterious and enchanting atmosphere.

All this work is funded by the awesome Geomys clients: Latacora, Interchain, Smallstep, Ava Labs, Teleport, SandboxAQ, Charm, and Tailscale. Through our retainer contracts they ensure the sustainability and reliability of our open source maintenance work and get a direct line to my expertise and that of the other Geomys maintainers. (Learn more in the Geomys announcement.)

Here are a few words from some of them!

Latacora — Latacora bootstraps security practices for startups. Instead of wasting your time trying to hire a security person who is good at everything from Android security to AWS IAM strategies to SOC2 and apparently has the time to answer all your security questionnaires plus never gets sick or takes a day off, you hire us. We provide a crack team of professionals prepped with processes and power tools, coupling individual security capabilities with strategic program management and tactical project management.

Teleport — For the past five years, attacks and compromises have been shifting from traditional malware and security breaches to identifying and compromising valid user accounts and credentials with social engineering, credential theft, or phishing. Teleport Identity Governance & Security is designed to eliminate weak access patterns through access monitoring, minimize attack surface with access requests, and purge unused permissions via mandatory access reviews.

Ava Labs — We at Ava Labs, maintainer of AvalancheGo (the most widely used client for interacting with the Avalanche Network), believe the sustainable maintenance and development of open source cryptographic protocols is critical to the broad adoption of blockchain technology. We are proud to support this necessary and impactful work through our ongoing sponsorship of Filippo and his team.

SandboxAQ — SandboxAQ’s AQtive Guard is a unified cryptographic management software platform that helps protect sensitive data and ensures compliance with authorities and customers. It provides a full range of capabilities to achieve cryptographic agility, acting as an essential cryptography inventory and data aggregation platform that applies current and future standardization organizations mandates. AQtive Guard automatically analyzes and reports on your cryptographic security posture and policy management, enabling your team to deploy and enforce new protocols, including quantum-resistant cryptography, without re-writing code or modifying your IT infrastructure.

Charm — Have you ever created an image of your code or terminal output from your terminal? This little command line tool makes it easy ❄️ 📸 https://github.com/charmbracelet/freeze

ML-KEM, f.k.a. Kyber, is a post-quantum key exchange mechanism, which we can use alongside or in place of Elliptic Curve Diffie-Hellman to protect encrypted data from future quantum computers. ↩
Expanding a seed on my M2 takes 40µs, loading 2 400 bytes over Gigabit takes 19µs. This is not a fair comparison, though: after you load the expanded key you still need to check the hash and expand the A matrix, both of which you would have done “for free” as part of expanding the seeds. ↩
Unironically, maybe the most important progress of cryptography engineering in the last twenty years has been transitioning to specifying APIs in terms of bytes. Everything starts and ends as bytes, if you write a specification in terms of “integers modulo P” or whatever, it just means implementations will YOLO the deserialization, validation, and serialization parts of the spec. ↩
X-Wing briefly had them reversed, highlighting the importance of specifying all the way to bytes. ↩
I strongly believe that edge cases should be either so common that they can be easily tested with random inputs (ideally > 2⁻¹⁶ chance, but anything > 2⁻⁴⁰ we’ll bruteforce a vector for) or so unlikely that they can be ignored and replaced with a exit(1) (< 2⁻¹²⁰ chance). ML-KEM seems to match this ideal. ↩