1
0
health_monitor/端到端主流程.md

9.9 KiB
Raw Permalink Blame History

PRDv5 全流程时序图(严格依据 v5 文档与引用约束)

说明:

  • 本文仅使用 PRDv5 中明确存在的对象、状态与动作;不引入任何未在文档出现的状态或动作。
  • 对象与状态集合(摘自 PRD
    • Task.status ∈ {created, running, finished, failed, revoking, revoked}
    • Mention.status ∈ {active, invalid}
    • Candidate.status ∈ {pending, locked, orphaned}
    • Confirm.status ∈ {CONFIRMED, PUBLISHED, ROLLED_BACK}
    • sync_status ∈ {PENDING, SYNCING, SYNCED, FAILED}
  • 统一过滤约束revoking / revoked 的查询/内部消费需执行过滤(参见 2.11)。

A. 端到端主流程(创建任务 → 回调写入 → 生成候选 → 审核确认 → 发布 → 图投影)

sequenceDiagram
autonumber
actor Admin as Admin管理员
participant UI as 前端UI
participant API as 后端API服务
participant PG_Task as PostgreSQLkg_task
participant Ext as 外部抽取服务
participant PG_M as PostgreSQLkg_entity_mention/kg_relation_mention
participant PG_Cand as PostgreSQLkg_*_candidate
participant PG_Conf as PostgreSQLkg_*_confirm
participant Audit as PostgreSQLaudit_log
participant SyncW as 同步Worker
participant Neo4j as Neo4j投影

Note over Admin,PG_Task: 任务创建与门禁3.1.3.1 / 3.1.3.1.1
Admin->>UI: 配置任务Schemaext_entity_codes / ext_relation_codes
UI->>API: 提交创建任务请求(创建门禁检查)
API->>PG_Task: 创建kg_taskstatus=created持久化tenant上下文
API-->>UI: 返回task信息external_task_id在触发抽取时获取

Note over API,Ext: 任务运行期status: running → finished/failed
API->>Ext: 触发抽取外部异步执行分配external_task_id
Ext-->>PG_Task: 任务状态更新running → finished/failed由系统对接P0仅约束状态口径

Note over Ext,API: 抽取结果回调5.1 回调/导入)
Ext->>API: POST /api/callback/extractexternal_task_id, records...
API->>PG_Task: 基于external_task_id反查kg_task仅接受status=finished
API->>PG_Task: 行级锁/门禁(单消费者/重复通知幂等处理)
API->>PG_M: 同事务写入Mention
activate API
API->>API: Schema白名单裁决 + Task生命周期过滤\n→ mention.status=active/invalid3.2.4
API->>Audit: 写审计filtered_by_schema / callback_*
deactivate API
API-->>Ext: 回调处理结果(成功/冲突)

Note over Admin,PG_Cand: 候选生成3.3 / 5.3
Admin->>UI: 在任务详情发起“由mentions生成candidate”
UI->>API: POST /api/candidates/from-mentions
API->>PG_M: 仅读取status=active 且 task非revoking/revoked 的mentions\n统一可见性过滤 2.11
API->>PG_Cand: 单事务聚合去重 + 规范化 + 决策canonical\n→ candidate.status=pending3.3.4
API-->>UI: 返回候选数据

Note over Admin,PG_Conf: 审核确认至Confirm3.4 / 5.3
Admin->>UI: 在候选审核页选择通过approve
UI->>API: POST /api/candidates/{id}/approve
API->>PG_Conf: Confirm Upsert复合键/UID→ status=CONFIRMED\n统一置 sync_status='PENDING'3.4/4.3
API->>PG_Cand: 将被锁定的候选设为lockedlocked_by_confirm=true
API->>Audit: 写审计review.accept / merge 等)
API-->>UI: 返回审核结果

Note over Admin,PG_Conf: 发布6.1 / 5.4
Admin->>UI: 发起发布publish
UI->>API: POST /api/publish?task_id=...(影响预估/阻断策略)
API->>PG_Conf: 事务内将相关Confirm置 status=PUBLISHED\n统一置 sync_status='PENDING'
API->>Audit: 写审计publish
API-->>UI: 返回发布受理结果

Note over SyncW,Neo4j: 同步器执行4.3 指令集)
SyncW->>PG_Conf: 扫描 sync_status IN ('PENDING','FAILED')
SyncW->>PG_Conf: 原子置 SYNCING加锁避免并发
alt status=PUBLISHED
  SyncW->>Neo4j: 幂等MERGE/SET节点/关系及属性)
else status=ROLLED_BACK
  SyncW->>Neo4j: 物理删除DETACH DELETE
end
SyncW->>PG_Conf: 写回 sync_status='SYNCED' 或 'FAILED'
SyncW->>Audit: 写审计action='sync',记录结果)

B. 回滚全过程(任务 → Mention → Candidate → Confirm → 投影清理)

sequenceDiagram
autonumber
actor Admin as Admin管理员
participant UI as 前端UI
participant API as 后端API服务
participant PG_Task as PostgreSQLkg_task
participant PG_M as PostgreSQLkg_*_mention
participant PG_Cand as PostgreSQLkg_*_candidate
participant PG_Conf as PostgreSQLkg_*_confirm
participant Audit as PostgreSQLaudit_log
participant SyncW as 同步Worker
participant Neo4j as Neo4j投影

Note over Admin,PG_Task: 回滚触发6.2
Admin->>UI: 发起回滚rollback
UI->>API: POST /api/rollback?task_id=...(影响预估/阻断策略)
API->>PG_Task: 事务首步将task.status置为revoking强中间态可见
API->>Audit: 写审计rollback 受理)

Note over API,PG_M: 生命周期与可见性2.11 / 3.2.4
API->>PG_M: 与该task关联的mentions在查询/下游消费中被过滤\nrevoking/revoked 全局过滤契约)
API-->>UI: Candidate/图谱界面呈现“回撤处理中,只读提示”

Note over API,PG_Conf: Confirm 源集合更新3.4 / 6.2
API->>PG_Conf: 从source_tasks中移除该task_id若为空→active_flag=false\n必要时关系随实体失活按3.7.5约束)
API->>PG_Conf: 统一置 sync_status='PENDING'(脏标记)
API->>Audit: 写审计rollback 细节)

Note over API,PG_Task: 任务完成回撤
API->>PG_Task: 将task.status置为revoked终态
API-->>UI: 返回回滚完成

Note over SyncW,Neo4j: 投影清理4.3
SyncW->>PG_Conf: 扫描 sync_status IN ('PENDING','FAILED')
SyncW->>PG_Conf: 原子置 SYNCING
alt status=ROLLED_BACK
  SyncW->>Neo4j: 物理删除DETACH DELETE
else status=PUBLISHED保持发布态但属性/来源已变更)
  SyncW->>Neo4j: 幂等MERGE/SET更新属性、来源集合
end
SyncW->>PG_Conf: 写回 sync_status='SYNCED' 或 'FAILED'
SyncW->>Audit: 写审计action='sync'

约束摘录:

  • revoking 首步可见2.11 / 6.2所有读取统一过滤revoking/revoked 不可被消费)。
  • 关系恢复与失活遵循 3.7.5:任一端实体失活 → 关系失活;回滚的物理扩散通过 sync_status=PENDING 驱动同步器清理 Neo4j。

C. 候选与确认的联动与并发安全(审核通过 → Confirm Upsert → 锁定候选)

sequenceDiagram
autonumber
participant API as 后端API服务
participant PG_Cand as PostgreSQLkg_*_candidate
participant PG_Conf as PostgreSQLkg_*_confirm
participant Audit as PostgreSQLaudit_log

API->>PG_Cand: 读取待审核候选status=pending\n统一过滤task非revoking/revoked
API->>PG_Conf: Upsert Confirm实体或关系\n- 命中ROLLED_BACK则复位为CONFIRMED复活规则\n- 统一置 sync_status='PENDING'
API->>PG_Cand: 条件更新pending→lockedlocked_by_confirm=true
API->>Audit: 审计写入review.accept / relation_uid_mismatch 等)

并发与一致性要点(摘自 3.3.3、3.4、4.3

  • 候选状态更新采用条件更新或行级锁,禁止盲写。
  • Confirm Upsert、复活与脏标记sync_status=PENDING必须同事务完成。
  • 审核拒绝不引入 Candidate 新状态(保持 pending通过 audit_log 表达。

D. 同步器标准指令集(实体/关系投影的一致性保障)

sequenceDiagram
autonumber
participant SyncW as 同步Worker
participant PG_Conf as PostgreSQLkg_*_confirm
participant Neo4j as Neo4j投影
participant Audit as PostgreSQLaudit_log

SyncW->>PG_Conf: 扫描 sync_status ∈ {'PENDING','FAILED'}(批次/按租户)
SyncW->>PG_Conf: 原子置 SYNCING加锁防重
alt Confirm.status == 'PUBLISHED'
  SyncW->>Neo4j: MERGE 节点/关系 + SET 属性 + 合并source_tasks
else Confirm.status == 'ROLLED_BACK'
  SyncW->>Neo4j: 物理删除DETACH DELETE
else Confirm.status == 'CONFIRMED'
  Note right of SyncW: 保持仅事实确认不投影(不对图操作)
end
alt 成功
  SyncW->>PG_Conf: 更新 sync_status='SYNCED'
  SyncW->>Audit: 写审计action='sync', result='success'
else 失败
  SyncW->>PG_Conf: 更新 sync_status='FAILED'
  SyncW->>Audit: 写审计action='sync', result='failed', error_msg
end

要点4.3

  • 脏标记必须与业务写入同事务PENDING 幂等)。
  • 同步器不改变业务事实,仅保障投影一致性与可观测性。

E. 统一可见性过滤(跨层防脏读约束)

sequenceDiagram
autonumber
participant DAO as 统一DAO/Service封装
participant PG_Task as PostgreSQLkg_task
participant Reader as 读取者Candidate生成器/查询)

Reader->>DAO: selectVisible(...)(封装统一过滤)
DAO->>PG_Task: Join kg_task 并过滤\nWHERE kg_task.status NOT IN ('revoking','revoked')
DAO-->>Reader: 返回过滤后的可见数据集

要点2.11 / 3.2.4 / 3.3.3

  • 过滤逻辑必须统一封装调用(例如 baseMapper.selectVisible禁止散落实现。
  • Candidate 生成器、图谱读取等内部流程均需复用本封装。

F. 状态机摘录对照PRD冻结口径

  • Task3.7.1
    • created → running → finished/failed → revoking → revoked
  • Mention3.7.2
    • — → activetask.finished\nactive → invalidtask.revoking/revoked\ninvalid → activetask.re-import 同一任务)
  • Candidate3.7.3
    • — → pending由mentions生成\npending → lockedconfirm.publish 或映射既有Confirm\nlocked → locked/pendingrollback 后视 source_tasks 是否为空)\npending → orphaned无来源孤立\norphaned → pending新来源出现
  • Confirm3.4、3.7.4/3.7.5
    • CONFIRMED → PUBLISHED → ROLLED_BACK复活命中ROLLED_BACK时复位为CONFIRMED
  • sync_status4.3
    • PENDING → SYNCING → SYNCED / FAILED失败可重试

以上时序与状态转换严格沿用 PRDv5 所定义的动作与状态集合。