这份文档真正覆盖什么
容量控制器文档解释的是高级编排如何防止工作量、委派或并发超出操作者的控制面。
怎么用这页
- 先看右侧目录,直接跳到你当前最关心的小节。
- 如果你只是来解决具体问题,优先读正文里的相关标题,再回站内对应 hub。
- 如果你要核对原始来源,可以直接打开 GitHub 原文链接。
容量控制器文档解释的是高级编排如何防止工作量、委派或并发超出操作者的控制面。
容量控制器文档解释的是高级编排如何防止工作量、委派或并发超出操作者的控制面。
deepseek-tui includes an opt-in capacity-aware context controller. In the default V4 path it is disabled, because its active interventions can rewrite the live prompt and break prefix-cache affinity. Treat it as telemetry or an experimental guardrail unless capacity.enabled = true is set explicitly.
Each checkpoint computes:
H_hat (runtime pressure proxy)C_hat (model capacity prior)slack = C_hat - H_hatN=8 observationsH_hat)action_complexity_bits = log2(1 + action_count_this_turn)tool_complexity_bits = log2(1 + tool_calls_recent_window)ref_complexity_bits = log2(1 + unique_reference_ids_recent_window)context_pressure_bits = 6.0 * context_used_ratioFormula:
H_hat = 0.35*action_complexity_bits + 0.30*tool_complexity_bits + 0.20*ref_complexity_bits + 0.15*context_pressure_bits
C_hat)Per-model priors:
deepseek_v3_2_chat = 3.9deepseek_v3_2_reasoner = 4.1deepseek_v4_pro = 3.5deepseek_v4_flash = 4.23.8 (used for other DeepSeek IDs, including future releases)Using rolling profile fields:
final_slackmin_slackviolation_ratioslack_volatilityslack_dropFormula:
z = -1.65*final_slack -0.85*min_slack +1.35*violation_ratio +0.70*slack_volatility +0.28*slack_drop -0.12
p_fail = sigmoid(z) clamped to [0,1].
Risk bands:
p_fail <= low_risk_maxp_fail <= medium_risk_maxAction mapping when the controller is explicitly enabled:
NoInterventionTargetedContextRefreshmin_slack <= severe_min_slack or violation_ratio >= severe_violation_ratio) -> VerifyAndReplanVerifyWithToolReplayWhen enabled, the engine evaluates controller policy at:
MessageRequest assembly).Interventions are not part of the default v0.7.5 V4 path. The default path is: append messages, preserve prefix-cache reuse, suggest manual /compact near real model pressure, and use overflow recovery only if the request would exceed the model input budget.
TargetedContextRefreshcompact_messages_safe) when possible.VerifyWithToolReplayVerifyAndReplanmax_replay_per_turn).Path:
DEEPSEEK_CAPACITY_MEMORY_DIR (if set)~/.deepseek/memory/<session_id>.jsonl<workspace>/.deepseek/memory/<session_id>.jsonl when home path is unavailable/unwritableRecord fields:
id, ts, turn_index, action_triggerh_hat, c_hat, slack, risk_bandcanonical_statesource_message_idsreplay_infoLoader utility supports fetching last K snapshots for rehydration.
[capacity] keys:
enabled (default false)low_risk_max (default 0.50)medium_risk_max (default 0.62)severe_min_slack (default -0.25)severe_violation_ratio (default 0.40)refresh_cooldown_turns (default 6)replan_cooldown_turns (default 5)max_replay_per_turn (default 1)min_turns_before_guardrail (default 4)profile_window (default 8)deepseek_v3_2_chat_prior (default 3.9)deepseek_v3_2_reasoner_prior (default 4.1)deepseek_v4_pro_prior (default 3.5)deepseek_v4_flash_prior (default 4.2)fallback_default_prior (default 3.8)Equivalent environment overrides are available with DEEPSEEK_CAPACITY_*.