Support 64-byte seed form of ML-KEM private keys #1985

bifurcation · 2024-11-08T19:11:54Z

Describe the bug
There is increasing community consensus that ML-KEM private keys should be stored in the form of 64-byte seeds (i.e., the inputs to ML-KEM.KeyGen_internal()), not in the expanded form described in FIPS 203. See, for example:

https://words.filippo.io/dispatches/ml-kem-seeds/
https://datatracker.ietf.org/meeting/121/materials/slides-121-pquip-fips-issues-with-deploying-ml-kem-and-ml-dsa-04

It is impossible to implement protocols that are defined in terms of seeds. The OQS_KEM_ml_kem_XXX_keypair() method produces an expanded private key, and OQS_KEM_ml_kem_XXX_decaps() consumes an expanded private key.

The minimal possible fix here would be to split out a method that creates an expanded private key from a seed. That way an app could mostly just use the current API, except that when generating a completely fresh key, they would have to fill the seed with random data. To even that out, you could make a parallel seed-only API, either by refactoring the existing interface or creating a parallel one. Anything short of removing the current API of course, will leave the risk that a caller will use expanded keys, with the associated risk of corruption.

If we can agree on an approach here, I would be happy to send a PR.

To Reproduce
N/A

Expected behavior
N/A

Screenshots
N/A

Environment (please complete the following information):

liboqs version: main branch

Additional context
N/A

The text was updated successfully, but these errors were encountered:

baentsch · 2024-11-09T07:01:25Z

Thanks for that proposal and offer to contribute. Can I ask how you'd want to go about this? Would you want to provide patches to the upstream code liboqs pulls in? Are you willing to maintain them in the face of upstream code changes? Either way, this proposal goes against two of the most core liboqs design principles, though:

Algorithm independence: all algs behave identically under the same API
No algorithm maintenance: all core crypto code is developed and maintained in a responsible manner by upstream sources specialized in this, not by liboqs.

If we drop 1) adding algorithm-specific APIs integration of liboqs to other applications becomes much harder and more support-intensive for the OQS team ("which algorithm are you using which API with?").
If we drop 2), liboqs becomes a totally different project by taking ownership of the implementations of specific algorithms. I'm not convinced the project has the right people for this on board.

Thus, what about the suggestion that you propose a general change of (private) key format to the upstreams that liboqs pulls its code from? If those change their logic to generate/return/process private keys in the "seed" format "under the hood", liboqs can be maintained in its current form.

bifurcation · 2024-11-10T14:55:47Z

Thanks for the clarification @baentsch. I didn't have the context that liboqs isn't owning these implementations.

It actually looks like no upstream changes are needed, because the required API is already supported by the underlying implementation. If I understand correctly, The coins argument to pqcrystals_kyberXXX_ref_keypair_derand() is what I have been referring to as the "seed". So we should be able to respect principle (2) with no problem.

Principle (1) then seems like it would guide towards the "parallel seed-only API" approach. We would create another set of algorithm identifiers (say, ML-KEM-XXX-seedonly, following the current), which would use the derand API for key generation and decapsulation (encap would be identical to current ML-KEM).

Does that seem acceptable?

baentsch · 2024-11-10T16:12:43Z

Thanks for these explanations, @bifurcation . This proposal looks very good, if I get it right: It would allow liboqs to retain its (KEM) API (keygen/encaps/decaps), require no patches to the upstream code (but use different functions/params) and require downstream code just to change algorithm name (and happily make use of shorter private keys), right? That really sounds too good to be true :-) So I guess to judge that on merit, allow me to come back to your original offer

I would be happy to send a PR

--> looking forward to reviewing that. It will be unique in that it actually creates 2 implementations out of 1 upstream when running copy_from_upstream, so it may not be quite as straightforward as we think....

bhess · 2024-11-11T11:08:31Z

Thanks @bifurcation for this proposal!

Principle (1) then seems like it would guide towards the "parallel seed-only API" approach. We would create another set of algorithm identifiers (say, ML-KEM-XXX-seedonly, following the current), which would use the derand API for key generation and decapsulation (encap would be identical to current ML-KEM).

I like the idea to have separate algorithm identifiers to support the seed-only variant.
Do I understand the proposal correctly that the OQS_KEM_keypair won't be changed and in this case just returns the 64-byte private key as secret_key? While encaps would be identical to current ML-KEM, it seems that decaps would need to be changed because the upstream API expects the expanded private key as input. With the current upstream implementation this could be solved by calling pqcrystals_kyberXXX_ref_keypair_derand as part of decaps.

Related to this - as a heads-up - there is ongoing discussion in the PQCP ML-KEM implementation pq-code-package/tsc#4 on extending the upstream API. PQCP is a candidate to replace the pq-crystals upstream in liboqs in the future. Some of the goals are to efficiently accommodate aspects like seed-only representation of private keys, handing expanded keys and supporting key validation. Some of the adaptations there would help us here.

baentsch · 2024-11-11T11:28:12Z

it seems that decaps would need to be changed because the upstream API expects the expanded private key as input. With the current upstream implementation this could be solved by calling pqcrystals_kyberXXX_ref_keypair_derand as part of decaps

This is exactly how I understand the proposal by @bifurcation : Same OQS API calling a different pqcrystals API for the new key type.

Some of the goals are to efficiently accommodate aspects like seed-only representation of private keys, handing expanded keys and supporting key validation. Some of the adaptations there would help us here

That now begs the question: What upstreams does OQS keep supporting and (by/until) when? It surely would be great if PQCP would provide API, support and quality warranties OQS currently doesn't get from any upstream (and accordingly, cannot provide such qualities on to users of liboqs). Maybe worth while a separate discussion thread @SWilson4 @dstebila ?

dstebila · 2024-11-12T16:30:54Z

This discussion seems related to a few other discussions in liboqs and PQ Code Package: pq-code-package/tsc#4 and #1877.

baentsch · 2024-11-12T18:32:06Z

Thanks for the pointer to the PQCP discussion, @dstebila. Do you also read it such that PQCP may decide to support only one SK representation option (e.g., as per NIST guidance)? If so, I see a problem triggered by the reminder of @SWilson4 as to the original mission of OQS, namely to support research -- and as such to enable all options all algorithms permit: If PQCP were to only support one variant, where does OQS get the other from? Keep using a different upstream just for that seems inefficient / unnecessary work.

github-project-automation bot added this to liboqs planning Nov 8, 2024

github-project-automation bot moved this to Todo in liboqs planning Nov 8, 2024

baentsch mentioned this issue Nov 9, 2024

Add ML-DSA / FIPS 204 final #1919

Open

8 tasks

baentsch added enhancement New feature or request help wanted Asking for support from non-core team labels Nov 11, 2024

bhess mentioned this issue Nov 11, 2024

Determine any cross-implementation API requirements pq-code-package/tsc#4

Open

bifurcation linked a pull request Nov 13, 2024 that will close this issue

Use seed as private key format for ML-KEM #1994

Draft

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support 64-byte seed form of ML-KEM private keys #1985

Support 64-byte seed form of ML-KEM private keys #1985

bifurcation commented Nov 8, 2024

baentsch commented Nov 9, 2024

bifurcation commented Nov 10, 2024

baentsch commented Nov 10, 2024

bhess commented Nov 11, 2024 •

edited

Loading

baentsch commented Nov 11, 2024

dstebila commented Nov 12, 2024

baentsch commented Nov 12, 2024

Support 64-byte seed form of ML-KEM private keys #1985

Support 64-byte seed form of ML-KEM private keys #1985

Comments

bifurcation commented Nov 8, 2024

baentsch commented Nov 9, 2024

bifurcation commented Nov 10, 2024

baentsch commented Nov 10, 2024

bhess commented Nov 11, 2024 • edited Loading

baentsch commented Nov 11, 2024

dstebila commented Nov 12, 2024

baentsch commented Nov 12, 2024

bhess commented Nov 11, 2024 •

edited

Loading