LiteBalancer

LiteBalancer routes LiteAPI requests across liteservers.

Peer Metadata

A complete peer record should include:

liteserver socket address,
public key,
connection state,
last masterchain seqno,
latency EWMA,
in-flight request count,
last success time,
last failure time,
archival capability,
reconnect attempt count.

Peer States

Healthy: normal candidate.
Suspect: degraded candidate.
Dead: not eligible.
Recovering: reconnect or probe in progress.

Scoring

Priority should consider:

state,
archive requirement,
masterchain freshness,
EWMA latency,
in-flight load,
recent failures.

Retry Policy

Retry on:

connection reset,
timeout,
ADNL transport error,
end of stream.

Do not blindly retry on:

liteServer.error,
contract execution exit code,
malformed local request,
proof verification failure.

Current offline tests cover representative retry and non-retry paths with in-memory peers: typed helper calls retry after ADNL transport errors, do not retry liteServer.error, and do not retry local BoC decode failures. The same tests pin the current failure bookkeeping behavior: failed attempts decrement in-flight counters, update request statistics, remove the peer from the alive set, and mark the peer Dead.

Rate Limiting

The balancer supports two independent limiter placements:

per-peer limits stored on each owned LiteClient;
a global limit stored on LiteBalancer.

The global limiter is acquired in the shared request execution path after a peer has been selected and before in-flight counters are incremented. Retries therefore consume additional tokens. send_message also consumes one global token per peer attempt, not one token for the high-level method call.

Missing Work

Reconnect descriptors.
EWMA implementation.
Reconnect and timeout state-machine tests.
Shared method dispatch with LiteClient.
Live validation against rented liteserver quota behavior.

Keyboard shortcuts