责任边界 + 团队划分
Source: Notion | Last edited: 2025-10-30 | ID: 29b2d2dc-3ef...
🧭 一、总体逻辑:从“功能域”划分,而非技术栈
Section titled “🧭 一、总体逻辑:从“功能域”划分,而非技术栈”我们按 QuantOS 的自然分层与数据流,拆成 六个核心团队 + 两个辅助团队。
每个团队既能独立运作,也能通过明确的 API / contract 接口协同。
┌─────────────────────────────┐│ AI Agent / Strategy Studio │ ← Research / DSL / Experimentation├─────────────────────────────┤│ Quant Compiler & Orchestration │ ← Translate DSL → DAG → Execution├─────────────────────────────┤│ Data & Feature Layer │ ← Ingest, clean, store, serve features├─────────────────────────────┤│ Execution Layer / Engine │ ← Risk, OMS, Routing, Gateways├─────────────────────────────┤│ Governance & Observability │ ← Metadata, lineage, audit, policies├─────────────────────────────┤│ Infra & DevOps Platform │ ← CI/CD, secrets, K8s, Terraform├─────────────────────────────┤│ Security & Compliance │ ← Vault, OPA, KMS, internal audit├─────────────────────────────┤│ Product & System Integration│ ← Spec alignment, UX, docs, roadmap└─────────────────────────────┘🧩 二、团队划分与职责说明
Section titled “🧩 二、团队划分与职责说明”1. Team A — Data & Feature Layer
Section titled “1. Team A — Data & Feature Layer”关键词: ingestion · schema · feature engineering · ML infra
职责:
-
设计统一的 Data Schema Registry 与 Feature Store(offline + online);
-
构建 ETL / streaming pipelines(Redpanda + Iceberg + ClickHouse);
-
维护 Data Lake 与 Feature Catalog;
-
提供高效的数据访问接口(Arrow Flight / Trino / Feast)。 所需技能:
-
Python / SQL / Airbyte / dbt / Spark / Flink / ClickHouse;
-
Data modeling, schema design, time-series understanding;
-
Familiarity with S3 / Iceberg / data versioning。 安全考虑:
-
数据访问分层(read-only / write / admin);
-
数据脱敏、PII 区隔;
-
Schema 版本锁定,防止 feature drift。
2. Team B — Strategy Compiler & Orchestration
Section titled “2. Team B — Strategy Compiler & Orchestration”关键词: DSL · DAG · Argo · Ray · autoscaling
职责:
-
设计 QuantOS DSL 与 compiler;
-
维护 DAG 生成逻辑与 runtime 参数管理;
-
编排训练、回测、实验任务(Argo / Ray);
-
负责调度策略、算力调度与作业生命周期管理;
-
管理模型 artifacts registry 与 lineage。 所需技能:
-
Python (DSL parser, compiler AST), Kubernetes, Argo, Ray;
-
Terraform, Helm, YAML schema, event-driven system;
-
Familiar with gRPC, JSON schema validation。 安全考虑:
-
Execution sandbox;
-
Secrets injection control;
-
Job-level permission scopes;
-
Quota management。
3. Team C — Execution Layer / Trading Engine
Section titled “3. Team C — Execution Layer / Trading Engine”关键词: OMS · router · risk · exchange gateway
职责:
-
负责实盘信号的执行、下单、路由与风控;
-
实现 intent → order → execution report 的闭环;
-
实现 Simulation / Paper trading;
-
提供统一 execution API 给 Orchestration;
-
实现 multi-asset 支持(crypto, equities, futures, options)。 所需技能:
-
Async Python / Rust / Go;
-
Redis, ClickHouse;
-
WebSocket / FIX / REST 接口;
-
Quant trading experience, pre/post trade logic。 安全考虑:
-
Vault-managed exchange credentials;
-
Pre-trade compliance & OPA rules;
-
Immutable audit trail;
-
Throttling & risk guard。
4. Team D — Governance & Observability
Section titled “4. Team D — Governance & Observability”关键词: lineage · metadata · audit · metrics
职责:
-
统一系统的 metadata、lineage、audit trail;
-
接入 OpenLineage + OpenMetadata;
-
管理 model registry, dataset registry;
-
指标监控(Prometheus / Grafana / Loki / Tempo)。 所需技能:
-
Python / SQL / YAML;
-
Prometheus, OpenTelemetry, ClickHouse;
-
Data catalog / MLFlow / OpenMetadata integration。 安全考虑:
-
RBAC access to metadata;
-
Immutable logs;
-
Alerting policy on data quality & lineage integrity。
5. Team E — AI Agent & Strategy Studio
Section titled “5. Team E — AI Agent & Strategy Studio”关键词: LLM integration · experiment automation · meta-learning
职责:
-
构建 Agent 研究环境(DSL → compile → run → analyze);
-
负责 agentic strategy generation、评估与优化;
-
训练 meta-controller (strategy selection, reward shaping);
-
建立 “AI Quant Researcher” 流水线。 所需技能:
-
Python, PyTorch / JAX;
-
LLM orchestration frameworks (LangChain, OpenDevin, AutoGen);
-
RL / bandit optimization;
-
Familiarity with quant evaluation metrics (IC, IR, Sortino, drawdown)。 安全考虑:
-
沙盒环境执行;
-
实验结果隔离;
-
防止 agent 注入执行层;
-
API permission gating。
6. Team F — Infra & DevOps Platform
Section titled “6. Team F — Infra & DevOps Platform”关键词: Terraform · K8s · CI/CD · monitoring
职责:
-
维护所有环境(dev / staging / prod);
-
管理 Kubernetes clusters、ArgoCD、Vault、networking;
-
建立 CI/CD pipeline(build → test → deploy);
-
统一 observability & incident response。 所需技能:
-
Terraform, Helm, ArgoCD;
-
AWS / GCP / on-prem;
-
GitHub Actions / Jenkins;
-
Network / security ops knowledge。 安全考虑:
-
Network segmentation;
-
Secrets lifecycle;
-
Backup / DR;
-
Audit of infrastructure changes。
7. Team G — Security & Compliance
Section titled “7. Team G — Security & Compliance”关键词: Vault · OPA · KMS · audit
职责:
-
管理系统密钥与凭证;
-
定义 OPA policy rules;
-
维护合规标准(internal, exchange, jurisdiction);
-
定期安全审计。 所需技能:
-
OPA / Rego / Vault / KMS;
-
Familiarity with regulatory compliance (crypto + finance);
-
Secure configuration pipelines。 安全考虑:
-
Policy-based enforcement;
-
Credential rotation;
-
Zero-trust network access;
-
Regular penetration testing。
8. Team H — Product & System Integration
Section titled “8. Team H — Product & System Integration”关键词: roadmap · UX · docs · vision alignment
职责:
-
将技术与业务目标对齐;
-
管理系统级 roadmap;
-
维护 Notion / docs / diagram;
-
组织 cross-team reviews;
-
对外接口(潜在合作伙伴、投资人)。 所需技能:
-
System thinking;
-
Technical writing;
-
Product sense + storytelling;
-
Understanding of trading and AI strategy lifecycle。 安全考虑:
-
信息分级管理;
-
文档脱敏;
-
对外接口审批。