Update merge protocol & various improvements for production stability #188

popduke · 2025-11-07T06:08:13Z

Base-KV improvements:
Loosens the merge prerequisite on config alignment, enhances the RedundantRangeRemovalBalancer to detect and purge zombie ranges, refactors the local engine for pluggability, and rolls out multiple optimizations—snapshot session reuse, obsolete snapshot filtering, leaner installation flow, lower split/merge memory pressure, faster distributed range lookup, RocksDB internal metrics toggles, redefined WAL compaction thresholds, improved WAL read path, and better default local-engine settings—alongside pluggable split hinters.
Inbox service optimizations:
Cuts read/write overhead in InboxStoreCoProc and accelerates BatchInsertRequest marshalling.
Multi-tenancy enhancements:
Adds tenant-level observability metrics and introduces a tenant-level switch to publish will messages during shutdown.
Bug fixes:
Resolves a NoSuchElementException when parsing AgentMetadata from CRDT and eliminates task duplication in BatchQueryCall.

…equisite 2. RedundantRangeRemovalBalancer supports detecting and removing 'zombie' ranges

…sDB Snapshot 2. enable heuristic compaction for data engine by default for dist/inbox/retain

…nagement and error handling

…ia config file

… WAL compaction 2. add a metric to monitor linearization latency

…NoSuchElementException

… server shutting down

…updates

…earDown and improve server shutdown handling

…ing FIFO response handling

2. Reduce sizing overhead

1. move signalling fetch to dedicated executor 2. reduce flush operation

2. enable merge behavior for EngineConfig 3. close rocksdb object in preferred order

Gujiawei-Edinburgh · 2025-11-10T14:20:56Z

base-env/base-env-provider/src/main/java/org/apache/bifromq/baseenv/ZeroCopyParser.java

     */
    public static <T> T parse(ByteString bytes, Parser<T> parser) throws InvalidProtocolBufferException {
        CodedInputStream input = bytes.newCodedInput();
        input.enableAliasing(true);


From this PR, zero-copy parser is used more widely. How about the case of direct buffer memory usage?

zero-copy parser is only applied in the context of parsing PB from byte string backed by byte array. For direct buffer, things become way more complex, and it will be considered in future release.

I am not pretty sure, since from the enableAliasing comments, it says: "Enables ByteString aliasing of the underlying buffer, trading off on buffer pinning for data copies. Only valid for buffer-backed streams". So i think the pinning buffer here means the direct buffer memory will be pinned until the PB obj is garbage collected.

Gujiawei-Edinburgh · 2025-11-11T13:28:29Z

...mq-starter/src/main/java/org/apache/bifromq/starter/config/StandaloneConfigConsolidator.java

+        for (IKVEngineProvider p : providers.values()) {
+            if (p.type().equalsIgnoreCase(type)) {
+                if (found != null) {
+                    throw new IllegalStateException("Duplicate storage engine provider type: " + type);


Would it be better to log a warning and ignore the duplicate providers?

This is by design. It's dangerous to have name-conflict local engine impls running in a bifromq process, the loading order is not guaranteed between restart.

Gujiawei-Edinburgh · 2025-11-11T15:13:54Z

.../main/java/org/apache/bifromq/basekv/localengine/rocksdb/RocksDBKVSpaceMigratableWriter.java

+                    boundary.getEndKey().toStringUtf8()));
+            try (IKVSpaceIterator itr = new RocksDBKVSpaceIterator(new RocksDBSnapshot(dbHandle, null), boundary,
+                new IteratorOptions(false, 52428))) {
+                for (itr.seekToFirst(); itr.isValid(); itr.next()) {


During this loop, if an error occurs midway, what happens to the data that has already been flushed in the same loop? Each operation checks shouldFlush.

data migration is not transactional at this level, the split/merge process will ensure the consistency.

popduke added 30 commits October 18, 2025 11:17

1. Refactoring the merge process to relax the 'config-alignment' prer…

1bd86e6

…equisite 2. RedundantRangeRemovalBalancer supports detecting and removing 'zombie' ranges

1. local-engine refactoring to enforce read consistency based on Rock…

1db6032

…sDB Snapshot 2. enable heuristic compaction for data engine by default for dist/inbox/retain

Update DevOnlySettingProvider to use initial values for settings

e4721b9

Refactor AdaptiveReceiveQuota and related classes to improve quota ma…

f6c4d47

…nagement and error handling

Add MQTT metrics for tracking resend and deduplication bytes

fcf06ed

Enabling output rocksdb internal stats via metrics and configurable v…

d4fa499

…ia config file

1. reinterpret compactWALThreshold as max log bytes before triggering…

815d2e1

… WAL compaction 2. add a metric to monitor linearization latency

Optimize read path for WAL engine

8a1a304

Optimize InboxStoreCoProc to reduce read/write overhead

2178231

Fixed a bug during parsing AgentMetadata out of CRDT which may throw …

ad9e404

…NoSuchElementException

Optimize local engine's default configs

7afc967

New tenant-level setting for controlling if publish will message when…

d0e6448

… server shutting down

Add StorageEngineConfigUtil for dynamic storage engine configuration …

8a867c8

…updates

Refactor MQTT session handlers to unify channelInactive method as doT…

25084d9

…earDown and improve server shutdown handling

Reduce the computational overhead of calculating size

db83fb6

Fix a bug in BatchQueryCall which may causing task duplicating

4d45754

improve cancellation handling

37274ef

Enhance inbox processing by improving hash distribution and implement…

5fea1b9

…ing FIFO response handling

InboxStoreConfig correction

b4c13a9

1. Reduce query/mutation tail latency by optimized thread usage

6574843

2. Reduce sizing overhead

Refactoring scheduler using more generalized SPI

229087c

Improved data plane tail latency by:

0bdfeef

1. move signalling fetch to dedicated executor 2. reduce flush operation

Refactor to make local engine pluggable

4ba96bc

Update batchinsert proto with compact format

1d2e9c1

Ignore license header for files under META-INF

33f910b

Closing checkpoint before closing DB to avoid crash

f5a4ea8

Prevented Checkpoint from being gc during reader opening

c57178a

1. remove "dbCheckpointRootDir" from WAL config

df53d1a

2. enable merge behavior for EngineConfig 3. close rocksdb object in preferred order

Adjust warn log ouput during config consolidation

3a35fd2

Disable legacy non-compact format of BatchInsertRequest

7474e3c

popduke added 7 commits November 5, 2025 09:34

Enforce FIFO semantic for dist rpc

b6bd59b

Optimize cacheKey to reduce mem overhead

8120404

Improve test stability in CI env

077d1e1

Optimize cluster join process to speedup convergence

f687a9c

Enable zero-copy parsing for inter-cluster message parsing

a123caf

Support pluggable split hinters and factor out related interfaces

8ae261a

Decompose base-kv-local-engine for better code organization

11612a0

popduke assigned Gujiawei-Edinburgh Nov 7, 2025

popduke requested a review from Gujiawei-Edinburgh November 7, 2025 08:03

popduke unassigned Gujiawei-Edinburgh Nov 7, 2025

Gujiawei-Edinburgh reviewed Nov 10, 2025

View reviewed changes

Gujiawei-Edinburgh reviewed Nov 11, 2025

View reviewed changes

Gujiawei-Edinburgh approved these changes Nov 11, 2025

View reviewed changes

popduke merged commit 2b342e9 into apache:main Nov 12, 2025
6 checks passed

popduke deleted the feat-merge-protocol-upgrade branch November 12, 2025 01:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update merge protocol & various improvements for production stability #188

Update merge protocol & various improvements for production stability #188

Uh oh!

popduke commented Nov 7, 2025

Uh oh!

Gujiawei-Edinburgh Nov 10, 2025 •

edited

Loading

Uh oh!

popduke Nov 11, 2025

Uh oh!

Gujiawei-Edinburgh Nov 11, 2025

Uh oh!

Gujiawei-Edinburgh Nov 11, 2025 •

edited

Loading

Uh oh!

popduke Nov 12, 2025

Uh oh!

Gujiawei-Edinburgh Nov 11, 2025

Uh oh!

popduke Nov 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update merge protocol & various improvements for production stability #188

Update merge protocol & various improvements for production stability #188

Uh oh!

Conversation

popduke commented Nov 7, 2025

Uh oh!

Gujiawei-Edinburgh Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

popduke Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Gujiawei-Edinburgh Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

Gujiawei-Edinburgh Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

popduke Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Gujiawei-Edinburgh Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

popduke Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Gujiawei-Edinburgh Nov 10, 2025 •

edited

Loading

Gujiawei-Edinburgh Nov 11, 2025 •

edited

Loading