rfc: proposal for standard private key handling by frrist · Pull Request #54 · storacha/RFC

frrist · 2025-06-03T00:01:44Z

alanshaw

👍 I'm fine with this, for the reason of switching to PEM is mildy better from the standpoint of developer familiarity - I'm not convinced there's really any concrete advantages beyond that, which the existing format doesn't already support.

alanshaw · 2025-06-03T06:10:40Z

+- PEM files (with inconsistent encryption practices).
+- JSON files (non-standard, service-specific schemas).
+- Base64 encoded strings (raw key material, no metadata).
+- Multibase base64 padded strings (uncommon in standard tooling).


Ok a few observations:

I believe you introduced PEM.

I don't know if we've ever actually used JSON to pass around private keys although I know the w3 CLI has the option to print them as JSON.

I don't believe we've ever passed plain base64 encoded strings around but maybe I'm incorrect.

So there was only really the 1 format prior - for which some of the reasons below don't apply.

I'm not trying to oppose the proposal here but I think if this was phrased more as "switch from multibase base64 padded private keys to PEM" then I think it would be more accurate.

You're absolutely right on all points - thank you for the clarification.

I don't believe we've ever passed plain base64 encoded strings around but maybe I'm incorrect.

You're correct - looking at my shell history, we've been using multibase base64 padded strings, not plain base64.

I believe you introduced PEM

Yes, I did introduce PEM to the storage node. The current state is:

storage id gen returns JSON with a DID and multibase-base64-padded private key.

I added --type=pem option for PEM output (needed for Curio's API authentication).

I added storage id parse to convert between formats, and to extract DIDs from PEM files.

The PEM format was specifically introduced to work with Curio's API, which requires PEM-formatted public keys for authentication.

I think if this was phrased more as "switch from multibase base64 padded private keys to PEM" then I think it would be more accurate

Yeah, the accurate framing is probably: "Switch from multibase base64 padded private keys to PEM". JSON is only used as an output wrapper in some cases, not as a key format itself.

It's worth noting that at the end of this doc the need for custom tooling to extract DIDs from PEM files is outlined. Curious what your thoughts are on this.

alanshaw · 2025-06-03T06:13:54Z

+
+### Why PEM?
+
+1. **Universal tooling support**: OpenSSL, ssh-keygen, and all major crypto libraries support PEM natively.


Yes, but typically we want to display the DID at the same time, so we have to dip back into our own tooling anyways.

Yeah, mentioned that here: https://github.com/storacha/RFC/pull/54/files#diff-d7bd9ff16dbeeb26dbb100c755bc613b14dfd9603e87cbcd5a7e56a694586c9cR170

alanshaw · 2025-06-03T06:18:08Z

+### Why PEM?
+
+1. **Universal tooling support**: OpenSSL, ssh-keygen, and all major crypto libraries support PEM natively.
+2. **Self-documenting format**: Clear headers (`-----BEGIN PRIVATE KEY-----`) identify content type.


🤷 the multibase encoded string is also self describing - the decoded bytes are also prefixed with the alogorithm. PEM has the advantage of being human readable to some extent.

My main concerns with multibase for private key material center around tooling, file conventions, and security.

Most common cryptographic tools (OpenSSL, ssh-keygen, keytool, etc.) have native support for PEM format when working with private keys. With multibase-encoded keys, we typically need custom tooling or conversion steps for common operations like key generation, inspection, or format conversion.

PEM has well-established conventions - .pem files, standard permissions (600), and clear practices around key storage. With multibase strings, we'd need to define our own conventions: What file extension? Plain text files? JSON wrapper?

PEM has built-in support for passphrase encryption. For multibase, we'd need to implement our own encryption scheme or wrap it in another format, essentially recreating what PEM already provides

At the end of the day, specifically for the storage node, I'd prefer to maintain key material in a file, rather than a string passed over the CLI or environment variable.

alanshaw · 2025-06-03T06:18:41Z

+
+1. **Universal tooling support**: OpenSSL, ssh-keygen, and all major crypto libraries support PEM natively.
+2. **Self-documenting format**: Clear headers (`-----BEGIN PRIVATE KEY-----`) identify content type.
+3. **Algorithm agnostic**: Supports our current Ed25519 keys and future algorithms (RSA, ECDSA, etc.).


Again, same for existing format.

alanshaw · 2025-06-03T06:25:55Z

+2. **No environment variables**: Private keys MUST NOT be passed via environment variables.
+3. **Encryption at rest**: Keys SHOULD be encrypted when stored.
+4. **Memory handling**: Keys SHOULD be loaded once at startup, not repeatedly read.
+5. **Secure deletion**: Key material SHOULD be securely overwritten when removed from memory.


We have historically used AWS SSM for deployed services and would continue to do so regardless of encoding (I understand this is not the same situation for storage nodes).

frrist · 2025-06-03T15:42:13Z

+## DID Derivation from Private Keys
+
+Users often need to derive a DID (Decentralized Identifier) from the private key for identity purposes. This section briefly covers how to extract the DID from a PEM-encoded private key.
+
+### What is a DID and How do we Encode ours?
+
+A DID is a decentralized identifier that corresponds to your public key. In our system, Ed25519 keys produce DIDs in the format: `did:key:...` where the suffix is a Base58BTC-encoded representation of the public key with multicodec prefixes.
+
+### Deriving DID from PEM
+
+At present, there does not exist any standard tooling to derive a DID used by our system from a PEM File containing a private key, or public key for that matter. This necessitates the creation of a custom CLI tool which allows users to derive a DID string from their public key.


@alanshaw assuming we align on PEM as our mechanism for handling key material, curious to hear your thoughts on these bits.

rfc: proposal for standard private key handling

7f2f552

frrist self-assigned this Jun 3, 2025

frrist requested a review from a team June 3, 2025 00:03

alanshaw approved these changes Jun 3, 2025

View reviewed changes

frrist commented Jun 3, 2025

View reviewed changes

alanshaw mentioned this pull request Jun 4, 2025

AgentData exists and can persist to filesystem storacha/guppy#21

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfc: proposal for standard private key handling#54

rfc: proposal for standard private key handling#54
frrist wants to merge 1 commit intomainfrom
frrist/rfc/private-keys

frrist commented Jun 3, 2025

Uh oh!

alanshaw left a comment

Uh oh!

alanshaw Jun 3, 2025

Uh oh!

frrist Jun 3, 2025

Uh oh!

alanshaw Jun 3, 2025

Uh oh!

frrist Jun 3, 2025

Uh oh!

alanshaw Jun 3, 2025

Uh oh!

frrist Jun 3, 2025

Uh oh!

alanshaw Jun 3, 2025

Uh oh!

alanshaw Jun 3, 2025

Uh oh!

frrist Jun 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		### Why PEM?

		1. Universal tooling support: OpenSSL, ssh-keygen, and all major crypto libraries support PEM natively.

Conversation

frrist commented Jun 3, 2025

Uh oh!

alanshaw left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants