Codex32 (BIP93) BIP85 Application reference implementation #68

BenWestgate · 2025-08-19T19:58:49Z

Closes #67
WIP

Deterministically generates codex32 share sets from entropy using the BIP85-DRNG instead of physical dice rolls.

I still need to add support to the cli, add BIP93 test vectors and keep the BIP85 pull request concordant with this implementation.

The BIP85 pull request diff (also draft) can be viewed here:
bitcoin/bips@master...BenWestgate:bips:codex32

…lock

Unsure to find these, haven't looked, may have to make them from the BIP93 text.

Checks the derived entropy, codex32 strings, hrp, threshold, identifier, bytes length, share count match the vector and that the set is valid.

akarve · 2025-09-02T17:57:56Z

@BenWestgate howdy i didn't see this until now. contributions are welcome. mentioning this because it's now closed and i was not sure if you closed because it wasn't reviewed? in any case, that was not a silent no.

BenWestgate · 2025-09-02T21:09:55Z

@akarve I closed this because I ended up forking ethankosakovsky/bip85@master...BenWestgate:bip85:master to create a reference implementation of #67 so merging this is no longer necessary. It should save you a few minutes updating bipsea by reopening this if it appears likely bip85 application 93' (codex32) will merge to bips/master

akarve · 2025-09-02T22:33:51Z

@BenWestgate copy. idk if ethan will ever review your PR tho as we were unable to get a hold of him for recent BIP-85 changes. in any case i'm open to a new app (still grokking details though). did you look at and/or consider the dice application? it can generate passwords of any length over any character set (e.g. bech32)?

BenWestgate · 2025-09-03T00:26:32Z

@akarve I don't expect ethan to ever review ethankosakovsky/bip85@master...BenWestgate:bip85:master, in the BIP-0085 text PR I linked to benwestgate/bip85 as the v1.4.0 reference.

Yes, I did consider using the dice application. However that could have lead to some differences between implementations. Also we need derivation path indexing to make sure the DRNG is seeded with a unique entropy when the identifier, threshold, n, bitlength or hrp parameters change. Using dice application would mean having to store which index' indices were already used, rather feeding the entire header and payload length to the derivation path. This would be a huge security flaw if you generated two child backups that were the same unintentionally.

And also a codex32 secret is analogous to bip39 mnemonic. Same objective: so if both are BIPs they both belong in BIP85 it seems.

BenWestgate · 2025-09-10T22:15:39Z

@akarve Thank you for offering to help! I just saw your ML feedback.

It's unavoidable codex32 needs the most parameters of BIP85 applications. So what would help me most is you adapt the cli to accept more parameters and write the tests for my codex32 library and then I'll finish bipsea's bip93 application implementation making the it as easy to read as possible, with a simplified derivation path, add the application's tests and test vectors and then update the BIP-0085.mediawiki document.

Regarding path simplification:

we can eliminate t, n, byte_length and the 4 identifier indices and have implementations sum them into one value.

I'm partial to a derivation path where {hrp}'/{t*5 + n + byte_length}'/{index}' and identifier is computed from the index, index=0 gives qqqq, index=1 gives qqqp, etc this prevents reusing the identifier for different sets of shares.

We can drop the complication of allowing individual identifier characters to be defaulted, so once the index hits 1048576 and beyond, the whole thing can default to the fingerprint.

akarve · 2025-09-13T23:13:30Z

@BenWestgate

So what would help me most is you adapt the cli to accept more parameters

easiest thing to do is probably take raw paths (arbitrary length and structure) with a new escape hatch cli switch. it'll be a while before i can get to this; lots to do at day job :/

Regarding path simplification:

no need to retrofit the spec to the CLI. i just want to make sure that we have exactly as many segments as needed and no more? if it's already at a minimum, yolo.

i had some comments on byte draws and unpacking as well, wdyt?

Drawing and truncating a single byte per character is certainly clear to understand but I keep wondering > if implementers shouldn't draw ((chars * 5) // 8) bytes in one shot? These bytes aren't expensive or
anything but it's less iterations, less total reads, etc. If my suggestion becomes hard to read or write
then feel free to keep as is.

BenWestgate · 2025-09-14T19:40:57Z

just want to make sure that we have exactly as many segments as needed and no more? if it's already at a minimum, yolo.

The only segment that could be dropped is the identifier:

It SHOULD be unique across different seeds like the bip85 {index} and we could encode the bip85 index to 4 bech32 characters and use it as the identifier. Then default to a fingerprint as identifier after index 32^4.
Doesn't require 4 paths to encode 20 bits, a single path can do.

My other consolidation was to combine threshold, n and byte_length into a single path segment, by concatenating them as decimals. I don't see the downside of this versus separate path depths for each.

Give me your thoughts on encoding the bip85 index into the codex32 identifier.

It's 4 characters intended to disambiguate different backups. You want them to all have unique identifiers, so an identifier in the derivation path seems to enable risky behavior (different backups with the same identifier) when the bip85 index is incremented.

My original idea dropped the index for this reason but that breaks bip85 conventions.

i had some comments on byte draws and unpacking as well, wdyt?

I prefer your suggestion to draw threshold * byte_length bytes in one shot, because then our initial t byte strings can be padded to a multiple of 5 bits with a CRC for more error detection.

akarve · 2025-10-06T02:13:24Z

@BenWestgate howdy i didn't forget about this and hoping to get to it later this quarter

BenWestgate · 2025-10-06T03:39:03Z

@akarve I haven't forgotten about it either. I will adapt your feedback into a bipsea pull request this month.

BenWestgate · 2025-10-06T22:44:01Z

I got a start on it, added a codex32 application

ben@zenbook15:~/Documents/GitHub/bipsea$ poetry run bipsea codex32 --help
Usage: bipsea codex32 [OPTIONS]

  Generate a BIP-93 codex32 backup from `secrets.randbits`.

Options:
  -h, --hrp TEXT                  Codex32 human-readable prefix.
  -l, --length INTEGER RANGE      Number of secret bytes.  [16<=x<=64]
  -t, --threshold INTEGER RANGE   Number of shares required to reconstruct the
                                  secret.  [0<=x<=9]
  -n, --num-shares INTEGER RANGE  Total number of shares to generate.
                                  [1<=x<=31]
  -i, --identifier TEXT           Optional identifier to include in each
                                  share.
  -p, --indices TEXT              String of unique characters to use as share
                                  indices.
  --pretty / --not-pretty         Print a number before, and a newline after,
                                  each codex32 share.
  --help                          Show this message and exit.

While this grabs from /dev/urandom and not bip85 entropy it's directly applying the byte_length bytes rather than drawing characters. Most of it will be reused when I do the bip85 app.

akarve · 2025-10-06T23:16:56Z

one thing to consider especially if it makes your life easier: what about just implementing:

bipsea apply -a arbitrary -p "what/ever/derivation/path/the/user/wants"

i mention this because 1-2 other new applications are in PR now and i plan to implement the same (but you are welcome to tackle it) because it basically makes all future apps just work. and then we don't need to even touch the reference app for most cases.

i have not thought carefully about other params and switches per application so there might be more to plan here. we could possibly soft warn if it's not a recognized application code.

akarve · 2025-10-06T23:20:03Z

there might be more to plan here

maybe we need a python protocol or something and then new implementations just have their own path handlers under arbitrary? and they could even register proper apps if they want to. just thinking out loud, do whatever is best for codex32.

BenWestgate · 2025-10-07T08:18:46Z

one thing to consider especially if it makes your life easier: what about just implementing:

bipsea apply -a arbitrary -p "what/ever/derivation/path/the/user/wants"
...and then we don't need to even touch the reference app for most cases.

That sounds nice for bipsea maintenance but the point of applications is they standardize how the derived is encoded.

The derivation itself is not useful, any BIP32 library will provide that.

BenWestgate · 2025-10-07T12:17:30Z

do whatever is best for codex32.

I published a codex32 project on PyPI so Bipsea can import it for the proposed bip85 application 93'.

recover recovers a codex32 secret from a list of strings, hopefully. should also support space and common separation. optional --target to derive a share other than the secret. Probably should warn somewhere to be sure the target index is unused in this backup. --codex32 flag for xprv accepts a validated set of codex32 strings or Pipe from bipsea recover and

BenWestgate · 2025-10-13T23:52:34Z

See if these changes to the README.md are amenable to you before I implement them all.

BenWestgate@965f020

If you want the scope limited then I'll move them to codex32[cli] however they're analogous to the mnemonic, validate and xprv commands for BIP39.

You can skim the diff below to see what functionality is new.

My python-codex32 library has full passing tests from bip93.

The sooner you let me know functionality to leave out of bipsea, the better.
https://github.com/BenWestgate/bipsea/pull/2/files

akarve · 2025-10-14T00:06:48Z

haven't looked at this yet what i'm hoping to do is create some kind of python protocol for apps and you just register callbacks. it might be easier if yours is just a PyPI package that bipsea depends on but I haven't thought about that yet. the protocol would then call into your module.

more importantly did you see these comments on your PR? that's a more fundamental bridge for us to cross first.

akarve · 2025-10-14T00:08:35Z

src/bipsea/bip93.py

@@ -0,0 +1,358 @@
+#!/bin/python3


this for sure should be a module dependency. idk if the authors have it on PyPI but you could perhaps host it as part of a codex module?

I put it on PyPI this week! And a much cleaner library at that with full BIP-93 tests.

https://pypi.org/project/codex32/

Do you want me to put the codex32[cli] stuff in there so we can drop commands like:
bipsea codex32
bipsea recover

and just PR in the bipsea derive -a codex32 needed for the BIP-0085 application? And some tests for that new bip85 application?

I'll try to make it composable so with both libraries installed users can do things like:

bipsea derive -a codex32 -t3 | codex32 recover to print a codex32 secret from a derived codex32 backup

Combine patches 1 and 2

BenWestgate · 2025-10-14T03:45:03Z

what i'm hoping to do is create some kind of python protocol for apps and you just register callbacks.

That sounds a great way to do this.

more importantly did you see these comments on your PR? that's a more fundamental bridge for us to cross first.

Yes, he convinced me to drop some parameters.

The bip85 app should just produce the initial k (threshold) shares, those are the only ones that need deterministic entropy.

We only need: hrp in the first derivation level, the second can be length (bytes) like other applications, and the final can be bip85 index like other applications.

Index should affect the encoded identifier IF we allow users to provide their own, otherwise the default bip32 fingerprint should be unique for each seed and vice versa.

num_shares is dropped and so is share_idx, we don't need those to get the minimum data that defines a set (the initial threshold of strings).

Importing the codex32 library which has this test coverage

Importing the codex32 library which tests these vectors

…patch-1

BenWestgate · 2025-10-14T05:27:48Z

src/bipsea/bip85.py

+    elif app == APPLICATIONS["codex32"]:
+        header, n_bytes = indexes[:2]
+        hrp, data = bech32_decode(header)
+        if hrp not in INDEX_TO_HRP:
+            raise ValueError(f"Unsupported human-readable prefix: {hrp}.")
+        k = int(CHARSET[data[0]])
+        if k == 1:
+            raise ValueError(
+                f"Threshold '{k}' is not an allowed value (2 through 9, or 0)."
+            )
+        ident = header[1:5]
+        byte_length = int(n_bytes.rstrip("'"))
+        if not 16 <= byte_length <= 64:
+            raise ValueError(
+                f"Byte length '{byte_length}' is not an allowed value (16 through 64)."
+            )
+        drng = DRNG(entropy)
+        alphabetized_charset = "sacdefghjk"  # threshold above 9 is invalid
+        shares = []
+        for share_idx in alphabetized_charset[bool(k) : k + 1]:
+            shares += Codex32String.from_seed(
+                drng.read(byte_length), ident, hrp, k, share_idx
+            ).s
+
+        return {
+            "entropy": entropy,
+            "application": " ".join(shares),
+        }


I think this new proposed path and codex32 derivation address both yours and scgbckbone feedback:

Fewer derivation levels: {header}, {n_bytes}, {index}
Fewer parameters: {share_idx} and {num_shares} are dropped

Indices output are deterministic based on k. No derived shares, just the initial k

{header} is the first 8 characters of a codex32 string, it would be some serialization of {hrp}|{threshold}{identifier} and fits if converted from bech32 to an int.

I also want to feed the bip85 app {index} into the ident as it should be unique for different seeds

BenWestgate · 2025-10-21T20:02:20Z

more importantly did you see these comments on your PR? that's a more fundamental bridge for us to cross first.

@akarve I gave his excellent feedback a reply. If you also like the proposal I can implement it in bipsea in the coming week or so.

I'm now thinking: each bip85 index = each seed. Regardless of the threshold and identifier. Threshold only matters when extracting from the DRNG, so share payloads are unique at different thresholds.

Identifier we either default to a bip32 fingerprint and let users relabel the strings to change it or if we want "resharing" functionality (multiple sets, same threshold) then it would feed into the DRNG seed so different identifiers produced different share payloads (but still recover the same secret.)

BenWestgate added 2 commits August 19, 2025 14:36

Add codex32 application to bip85.py

4271f95

fix imports, ValueError, don't assign unused data, remove junk code b…

37e4c9c

…lock

BenWestgate marked this pull request as draft August 19, 2025 19:59

BenWestgate added 6 commits August 19, 2025 18:26

BIP93: fix t-n = 1 bug, fixup imports, sanity checks

0c1bcc7

Create bip93.py

a505d0d

bip85 vectors: add BIP93 test vectors

fbdb997

WIP: placeholder for the full BIP93 test vectors

275bf26

Unsure to find these, haven't looked, may have to make them from the BIP93 text.

test_BIP85: add test_codex32 to validate vectors

e9d4c11

Checks the derived entropy, codex32 strings, hrp, threshold, identifier, bytes length, share count match the vector and that the set is valid.

WIP: Create test_bip93.py need to find test vector

535d47b

BenWestgate closed this Sep 2, 2025

akarve mentioned this pull request Sep 2, 2025

Proposal: Add Codex32 (BIP93) as a BIP85 Application #67

Open

BenWestgate reopened this Sep 10, 2025

akarve mentioned this pull request Sep 10, 2025

BIP85: Add Codex32 as application 93' bitcoin/bips#1958

Open

BenWestgate added 7 commits October 6, 2025 17:34

Add INDEX_TO_HRP dict

3abca70

Remove electrum, add entropy_to_codex32

276f0e5

Create settings.json

66ca66d

Add cli tool "codex32" for generating codex32

bfa13bd

Update bip93_vectors.py

b1873d5

fix import typo

193aae3

Update test_bip93.py

33777e0

BenWestgate added 5 commits October 12, 2025 10:54

fix codex32 help string as it uses token_bytes

d93858d

Add codex32 & recover features & bip85 codex32 app

965f020

Add codex32 dependency for bip93 encoding.

9a4ae77

bip85: Import things from codex32 we'll need

65bd0b1

akarve reviewed Oct 14, 2025

View reviewed changes

BenWestgate added 4 commits October 13, 2025 22:17

Update README.md

2c63697

Update README.md

3b34cc7

Update pyproject.toml

86308ba

Merge pull request #3 from BenWestgate/patch-2

676b864

Combine patches 1 and 2

BenWestgate added 11 commits October 13, 2025 23:17

Delete test_bip93.py

5b2f1e3

Importing the codex32 library which has this test coverage

Delete bip93_vectors.py

f24e6ac

Importing the codex32 library which tests these vectors

import codex32, delete non-bip85 codex32 commands

9c26d8b

Update pyproject.toml to import codex32

143e81e

doc: add example of the proposed bip85 codex32 app

df2a599

Delete bip93.py

be05e2e

Merge branch 'patch-1' of https://github.com/BenWestgate/bipsea into …

ddb119e

…patch-1

Delete settings.json

14fd2d8

Update .gitignore

38814cb

Update bipsea.py

0a9708c

new bip85 bip93 derive proposal

a1b860b

BenWestgate commented Oct 14, 2025

View reviewed changes

Codex32 (BIP93) BIP85 Application reference implementation #68

Are you sure you want to change the base?

Codex32 (BIP93) BIP85 Application reference implementation #68

Uh oh!

Conversation

BenWestgate commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akarve commented Sep 2, 2025

Uh oh!

BenWestgate commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akarve commented Sep 2, 2025

Uh oh!

BenWestgate commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenWestgate commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akarve commented Sep 13, 2025

Uh oh!

BenWestgate commented Sep 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akarve commented Oct 6, 2025

Uh oh!

BenWestgate commented Oct 6, 2025

Uh oh!

BenWestgate commented Oct 6, 2025

Uh oh!

akarve commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akarve commented Oct 6, 2025

Uh oh!

BenWestgate commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenWestgate commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenWestgate commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

akarve commented Oct 14, 2025

Uh oh!

akarve Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

BenWestgate Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

BenWestgate commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

BenWestgate Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

BenWestgate commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BenWestgate commented Aug 19, 2025 •

edited

Loading

BenWestgate commented Sep 2, 2025 •

edited

Loading

BenWestgate commented Sep 3, 2025 •

edited

Loading

BenWestgate commented Sep 10, 2025 •

edited

Loading

BenWestgate commented Sep 14, 2025 •

edited

Loading

akarve commented Oct 6, 2025 •

edited

Loading

BenWestgate commented Oct 7, 2025 •

edited

Loading

BenWestgate commented Oct 7, 2025 •

edited

Loading

BenWestgate commented Oct 13, 2025 •

edited

Loading

BenWestgate Oct 14, 2025 •

edited

Loading

BenWestgate commented Oct 14, 2025 •

edited

Loading