Skill · data-pipeline-walkthrough · 卫星 Agent 端到端 10 步

10 步逐项

每步左边数字是 task id, 命令复制即可执行

STEP 1 · SCRAPE

X 推文抓取

expert/X/x_agent scrape — twikit + cookies, 47 家种子公司

/Users/john/InvesResearch/agent/scripts/\
  run_x_scrape.sh --only SpaceX \
  --tweets-per-account 30

耗时 19sSpaceX 55→57

⚠ elonmusk 不在 SEED_COMPANIES, 静默忽略不报错

STEP 2 · INGEST

抓取入库

x.sqlite3 → ontology 分类 → agent.db.events

X_SQLITE_PATH=/Users/john/InvesResearch/expert\
/X/data/x.sqlite3 \
python3 -m satellite_agent.cli job run \
  x-ingest-daily

fetched 500filtered 147ingested 0dup 353

⚠ fetch CLI 不支持 x-sqlite, 只能走 job 间接调

STEP 3 · DECISION

决策周报 (CEO + 投资)

Phase 3a 双视角周报, 规则版 V1.1

python3 -m satellite_agent.cli decision \
  --view both --window 7 --format md

窗口事件 05 主线全观察

⚠ 窗口空时所有 Δ=0 — 是窗口现实不是 bug

STEP 4 · THESIS

主线评分 baseline

5 主线 thesis_state 表 baseline (建议 30 天)

python3 -m satellite_agent.cli thesis \
  --refresh --window 30

核心网 3.25芯片 1.2终端 0.95

⚠ 必须先跑 thesis --refresh, Step 8 报告才有非 0 分

STEP 5 · DEBATE

多 agent 辩论

Bull/Bear/Judge × N 主线, 规则版 / LLM 版

python3 -m satellite_agent.cli debate \
  --thread 核心网 --window 30 --format md

核心网 看多 ×1.30终端 看多 ×1.12芯片 分歧 ×0.96

⚠ bear=0 时无脑看多顶到 1.3 上限

STEP 6 · TRIGGER + ALERT

阈值触发 + 风险预警

trigger set/check + alerts 查询

python3 -m satellite_agent.cli trigger set \
  --thread 核心网 \
  --type thread_sentiment_below \
  --params '{"thread":"核心网",\
  "threshold":1.0,"window":7}'

30 天 alerts 2

⚠⚠⚠ thread_sentiment_below 用窗口 sentiment 不是 thesis_state.score, 窗口空必假警报

STEP 7 · VALIDATE

ADVICE D V1 对照

corpus v2.0 (96 sample), 4 字段 exact/partial/mismatch

python3 -m satellite_agent.cli validate \
  --format md

overall 27%threads 37%impact 55%

⚠ 规则版 baseline, 留待 D V2 真 LLM compare 拉升

STEP 8 · REPORT

周报

7 天投研周报, 5 主线 + 关键事件 + 风险 + 跟踪清单

python3 -m satellite_agent.cli report \
  --window 30 --format md

事件 8风险 2被点名 6 家

STEP 9 · NOTIFY

飞书通知

EH-3 飞书 webhook 送达验证

python3 -m satellite_agent.cli notify-test \
  --webhook 'https://open.feishu.cn/\
  open-apis/bot/v2/hook/XXX' --card

本次 跳过实发

⚠ webhook URL 是机密, 走 .env, 不写仓库

STEP 10 · 沉淀

skill + 文档

本 SKILL.md + 本 HTML + 6 份 walkthrough md

ls agent/reports/walkthrough-2026-06-09/
ls skills/data-pipeline-walkthrough/

归档 6 mdskill SKILL.md + skill.html

字段	exact	partial	mismatch	n/a	exact rate
threads	26	16	29	25	37%
thesis_impact	53	0	43	0	55%
strategy	51	0	45	0	53%
thread_in_focus	33	0	36	27	48%
overall ≥ 3 字段 exact	26/96	—	—	—	27%

本次发现 + follow-up

跑通过程中暴露的真问题, 不在本 skill 解决, 进 NEXT-STEPS

⚠ 5 个候选 bug / 优化项

Step 1 elonmusk 静默忽略 — --only 不在 SEED_COMPANIES 时无 warn 日志。10 行代码 follow-up。
Step 2 fetch CLI 不支持 x-sqlite — 只能走 job run。可加 --source x-sqlite。
⚠⚠⚠ Step 6 trigger 假警报 (重要) — thread_sentiment_below 看窗口 sentiment 而非 thesis_state.score, 窗口空 (无新 events) 必触发, 即使 thesis 在 3.25。建议改读 thesis_state 或加 min_events 兜底。
Step 9 飞书 webhook 无签名 — notify.py 当前只支持无签名模式, 接 sign 模式提升安全。
Step 7 ADVICE 27% baseline — 规则版盲点已暴露, 留 D V2 真 LLM compare follow-up。

卫星 Agent 10 步
端到端数据流

5 段 data flow

10 步逐项

关键数字

ADVICE D V1 对照表

本次发现 + follow-up

⚠ 5 个候选 bug / 优化项

5 段 data flow

10 步 逐项

关键 数字

ADVICE D V1 对照表

本次发现 + follow-up

⚠ 5 个候选 bug / 优化项

10 步逐项

关键数字