Agent-driven market data interpretation pipeline · feed → LLM → signal → trading system
On this page
- TL;DR
- Wiki route
- Four-stage pipeline · stage-by-stage
- Stage 1 · Ingest
- Stage 2 · Interpret · LLM ingestion
- Stage 3 · Route · signal → trading system
- Stage 4 · Execute · deterministic algo + pre-trade controls
- Real-world deployments · 2026-05 public disclosures
- Goldman Marquee + Marquee AI
- JPMorgan IndexGPT / SpectrumGPT
- BlackRock Aladdin Copilot
- Morgan Stanley AI @ Morgan Stanley
- Bloomberg Terminal AI · BloombergGPT productized
- Renaissance / Two Sigma / Citadel internal AI research
- Japan-specific · Mizuho / SMBC / MUFG internal LLMs
- Hallucination control · five-layer defense
- Audit trail requirement · compliance composition
- Latency tier · what LLM can / cannot do
- Composition with agent-economy stack
- Sources
- Related
TL;DR
The 2026 agent-driven market data pipeline is a four-stage compound: (1) ingest raw market data from Bloomberg, Reuters / Refinitiv, Nikkei, exchange direct feeds, regulator releases, corporate filings; (2) interpret via LLM (BloombergGPT, JPM internal LLM, Anthropic Claude or OpenAI GPT through licensed deployment) to produce structured features (sentiment, event-extraction, summary, scenario); (3) route the structured features as signals into downstream trading / risk / portfolio systems; (4) execute through deterministic algorithms governed by pre-trade controls. Real-world deployments include Goldman Marquee + Marquee AI, JPMorgan IndexGPT / SpectrumGPT, BlackRock Aladdin Copilot, Morgan Stanley AI @ MS for FA-side use, and BloombergGPT productized via Bloomberg Terminal AI. The two failure modes that dominate operational design are hallucination (LLM generates a “fact” that does not appear in the source) and stale-state interpretation (LLM responds based on training data rather than current feed). Mitigations are layered: retrieval-augmented generation (RAG) against the actual feed, structured-output constraints, citation-required outputs, model-card + version stamping, prompt + output logging for compliance audit. The audit-trail requirement traces back to SR 11-7 (US Fed model risk management), MIFID-II RTS 6 (EU algo governance), FSA AI principles (Japan), and FCA AI in financial services guidance — all of which require the deployer to reproduce any AI-driven decision in post-hoc supervisory review.
Wiki route
This entry sits under agent-economy index. Read it against LLM agent finance application overview for the broader application landscape, AI-driven trading regulation Japan 2026 for the regulator overlay that constrains how this pipeline is governed, agent custody and authorization framework for the spend / authorization granularity on the execution side, agent identity DeFi and TradFi bridge for the identity attestation of pipeline outputs, and agent payment protocol four-way comparison for the settlement layer. For market-data infrastructure context see Japan market maker and liquidity provider landscape and CEX API SDK ecosystem comparison.
Four-stage pipeline · stage-by-stage
Stage 1 · Ingest
Sources (2026-05 production patterns):
- Bloomberg Terminal feed + B-PIPE (Bloomberg Professional Information Pipeline) — most G-SIBs license the feed for ~$25K/year/seat plus B-PIPE for programmatic access. The feed includes news, prices, fundamentals, regulatory disclosures, corporate-action notifications.
- Reuters Eikon / Refinitiv Workspace (now LSEG Workspace post-LSE acquisition) — peer to Bloomberg with stronger Western news + macro feed
- Nikkei feed (Nikkei QUICK) — Japan-specific corporate disclosure + JGB / equity / FX news
- Exchange direct feeds — TSE, OSE, NYSE, NASDAQ, CME, LSE direct binary feeds via ITCH / OUCH protocols; latency-sensitive HFT path
- Regulator releases — EDGAR (SEC), TDnet (JPX), Companies House (UK), AMF (FR), MAS / BaFin / ASIC releases; usually scraped or via licensed redistributor
- Corporate IR releases — corporate press release wires (BusinessWire, PR Newswire, KYODO, JIJI in Japan); LLM-friendly because text-rich
- Alternative data — satellite (Planet Labs), credit card spend (Yodlee, Plaid), web scraping (compliant), ESG (MSCI, Sustainalytics)
Why ingest is non-trivial: each source has its own licensing, latency, schema, and access pattern. A production pipeline must normalize all of them into a single structured event stream before feeding the LLM. The 2025-2026 reference architecture uses Kafka / Kinesis / Pub/Sub at the bus layer with Avro / Protobuf schemas per source type.
Stage 2 · Interpret · LLM ingestion
The LLM options (2026-05):
| LLM | Provenance | Finance-tuned? | Used by (public) |
|---|---|---|---|
| BloombergGPT | Bloomberg in-house, 50B params, trained on 40+ years Bloomberg finance corpus | Yes (finance-only training) | Bloomberg Terminal AI surfaces (productized 2024-2025); not licensed externally |
| Anthropic Claude (Opus / Sonnet) | Anthropic frontier model | No (general), but fine-tuneable | BBVA, Mizuho, Goldman dev tooling; ad-hoc HF research |
| OpenAI GPT-4o / GPT-5 | OpenAI frontier model | No (general), but fine-tuneable | Morgan Stanley AI assistant, JPM SpectrumGPT (variant), BofA pilot |
| JPM internal LLM | JPMorgan in-house, trained on internal corpus | Yes | JPM IndexGPT, SpectrumGPT |
| Google Gemini Pro / Ultra | Google frontier model | No, but Vertex AI tuning | Citi pilot disclosed |
| Cohere Command R+ | Cohere general model | Specialized for RAG / search | BlackRock Aladdin Copilot adjacent |
| NEC cotomi / NTT tsuzumi / PFN PLaMo | Japan domestic models | Some finance specialization | Mizuho / MUFG / SMBC internal pilots |
Why finance-tuned vs general matters: a general LLM trained on web data is weaker at parsing 10-Q / 有価証券報告書 / IFRS financial statements than a model fine-tuned on millions of such documents. BloombergGPT’s 2023 publication (arxiv.org/abs/2303.17564) demonstrated material outperformance on finance-specific benchmarks vs general LLMs at the same parameter count. The trade-off: BloombergGPT is not externally licensed; firms wanting equivalent capability must either license Bloomberg’s productized Terminal AI surfaces or fine-tune a general model on their own finance corpus.
Interpretation operations (what the LLM is asked to do at this stage):
- Event extraction — “extract M&A announcements from this news flow”
- Sentiment scoring — “sentiment of this earnings call transcript on a -1 to +1 scale”
- Summary — “summarize the key changes in this 10-K vs prior period”
- Q&A retrieval — “what did the CFO say about FX hedging?” against a corpus
- Scenario generation — “given this central bank statement, generate three plausible market reactions”
- Structured-data extraction — “extract net income, revenue, EPS from this earnings press release”
Stage 3 · Route · signal → trading system
The LLM output is a structured signal, not a trading order. The signal carries the LLM’s interpretation as a typed feature that feeds:
- Discretionary PMs / traders — signal surfaces in Bloomberg Terminal / internal portfolio screen as a recommendation with confidence + citations
- Systematic strategies — signal becomes a feature in a multi-factor model (e.g. “news_sentiment_score” combined with traditional momentum / value factors)
- Risk management — signal flags scenarios that change the portfolio’s tail-risk estimate
- Order routing — for execution algo, signal influences which child-order tactic to use (e.g. heightened sentiment volatility → more conservative TWAP)
Critical design choice: the LLM does NOT directly emit trading orders. It emits structured features that downstream systems consume under their own risk controls. This boundary is what keeps the pipeline within MIFID-II Article 17 + RTS 6 + FIEA Article 38-2 + Reg SCI compliance.
Stage 4 · Execute · deterministic algo + pre-trade controls
The execution layer is unchanged from non-AI algo trading. Pre-trade risk controls (price collar, size cap, repeated-order velocity, account-level exposure), kill-switch, post-trade surveillance — all apply identically. The AI signal flows in as one of many inputs to the execution algo; the algo’s decision logic is deterministic and auditable.
This is the key safety property of the production architecture: AI is on the idea-generation side; deterministic systems are on the execution side. A regulator examining a trade can trace the order back through the execution algo’s deterministic logs, and separately examine the AI’s signal-generation logs.
Real-world deployments · 2026-05 public disclosures
Goldman Marquee + Marquee AI
Goldman’s institutional client portal Marquee has integrated LLM-driven analytics layered over Goldman’s proprietary market data + research corpus. Marquee AI provides PMs with conversational interface for research retrieval, scenario simulation, and trade idea generation. Public disclosure has emphasized PM-facing use, not direct execution; trades that result still go through Goldman’s execution algo with full risk controls.
JPMorgan IndexGPT / SpectrumGPT
IndexGPT (launched 2024-2025): LLM-driven thematic basket construction. User specifies a theme in natural language (“AI infrastructure exposure with low correlation to large-cap tech”); IndexGPT proposes constituent weights based on JPM’s internal LLM + structured-feature engine. The output is a basket recommendation; users decide whether to trade it.
SpectrumGPT (compliance / research-side): LLM for compliance document review, research surfacing, and trade rationale documentation. Internal-facing; not customer-facing.
BlackRock Aladdin Copilot
Aladdin is BlackRock’s portfolio risk and analytics platform used internally and licensed to ~$21T of assets across institutional clients. Aladdin Copilot layers Cohere-powered conversational AI on top, providing PMs with natural-language access to portfolio state, risk analytics, and what-if scenarios. PM-facing; does not emit trades directly.
Morgan Stanley AI @ Morgan Stanley
OpenAI-powered FA-facing assistant for retrieving research from MS’s 100K+ document corpus, drafting client communications, and answering procedural questions. FA reviews and approves before any communication goes to client. Not a market-data interpretation pipeline per se but the largest publicly disclosed LLM-in-finance deployment by transaction count (FA queries per day).
Bloomberg Terminal AI · BloombergGPT productized
BloombergGPT was research-published 2023. By 2024-2025, Bloomberg productized LLM-driven surfaces in the Terminal: natural-language search across feed, summary of long news articles, Q&A against earnings transcripts, sentiment-tagged news flow. Terminal users access these as features rather than calling the LLM directly.
Renaissance / Two Sigma / Citadel internal AI research
Not publicly disclosed in detail. Public hiring patterns + paper publications indicate substantial in-house LLM and ML research targeting market data interpretation. Production-trading impact is opaque by design (alpha-generating firms don’t disclose).
Japan-specific · Mizuho / SMBC / MUFG internal LLMs
Mizuho M-AI Insight, SMBC GAI, MUFG internal AI — all deployed at scale for internal use across compliance, research, and customer-service-support. Public disclosures emphasize internal use; customer-facing financial decisions remain gated through human reviewers.
Hallucination control · five-layer defense
LLMs reliably hallucinate. In finance, a single hallucinated number can produce a real loss. The 2026 production defense layers:
- Retrieval-Augmented Generation (RAG) — the LLM is given the actual source documents (news article, filing, transcript) as context; output is conditioned on the retrieved text. Reduces “from-memory” hallucination but doesn’t eliminate it.
- Structured output constraints — require LLM to output JSON conforming to a schema (e.g.
{"event_type": "M&A", "acquirer": str, "target": str, "value_usd": float, "citation": str}). Format constraint reduces free-text fabrication. - Citation requirement — require LLM to include a span citation (
"citation": "Reuters article ID 12345, paragraph 3") so downstream systems can verify the claim against source. If citation doesn’t resolve, the output is rejected. - Confidence threshold — discard outputs below a calibrated confidence (using log-prob or self-consistency sampling). Routes uncertain cases to human review.
- Cross-validation — run multiple LLMs against the same input and compare; disagreement triggers human review.
No single layer is sufficient. Production systems compose 3-5 of these. The audit trail logs every layer’s verdict for the regulator to examine.
Audit trail requirement · compliance composition
The audit trail must reproduce the LLM-driven decision in post-hoc supervisory review. The 2026 reference fields:
| Field | Why |
|---|---|
| Input data hash + timestamp | Reproduce the inputs that led to the decision |
| Model identifier + version + checkpoint | Reproduce the exact model that produced the output |
| Full prompt + system prompt | Reproduce the LLM call |
| Full output text | Show what the LLM said |
| Structured-output validation result | Show whether output conformed to schema |
| Citation validation result | Show whether claims trace to source |
| Confidence score / log-prob | Show calibrated certainty |
| Downstream consumer system + decision | Show what was done with the LLM output |
| Final trade / signal / recommendation | Show end-to-end traceability |
| Human review event (if any) | Show whether HITL was triggered |
The logging cost is non-trivial: a high-volume pipeline can generate terabytes/day of audit logs. The 2026 reference architecture uses tiered storage (hot: 30 days in Postgres / TimescaleDB; warm: 1 year in S3 / GCS; cold: 7 years in Glacier / Archive Storage) to satisfy retention requirements (SR 11-7: 5-7 years typical; MIFID-II RTS 6: 5 years; FIEA: 10 years for some records).
This composition is the operational realization of the deployer-accountability principle in agent legal and tax liability framework and the regulatory frame in AI-driven trading regulation Japan 2026.
Latency tier · what LLM can / cannot do
| Latency tier | LLM applicable? | Use case |
|---|---|---|
| HFT / microsecond (binary feeds → orders) | No (LLM inference is 100ms-10s) | LLM cannot live in this loop |
| Intraday quant (seconds-to-minutes) | Yes (for signal generation, not direct execution) | Sentiment / event-extraction signals |
| Day-trading / swing (minutes-to-hours) | Yes | Earnings call interpretation, news flow analysis |
| Position / portfolio management (hours-to-days) | Yes | Research synthesis, scenario simulation, portfolio rebalancing recommendations |
| Strategy research / backtest (offline) | Yes (heavy use) | Generating hypotheses, summarizing literature, drafting strategy documentation |
The 2026 production reality: LLMs are concentrated in the research / portfolio / signal-generation tiers, with deterministic systems handling the execution tier. Crossing the boundary (LLM directly issuing orders without deterministic algo) is technically possible but regulatorily and operationally avoided.
Composition with agent-economy stack
This pipeline is the upstream signal-generation side of the agent-economy. The composition:
- Pipeline (this entry) — produces structured signals from market data
- Application — LLM agent finance application overview (c) trading and execution category consumes signals
- Identity / attestation — agent identity DeFi / TradFi bridge for attesting pipeline outputs
- Custody / authorization — agent custody and authorization framework for spend / scope on the execution side
- Regulator overlay — AI-driven trading regulation Japan 2026
- Liability — agent legal and tax liability framework
Sources
- BloombergGPT research paper (arxiv.org/abs/2303.17564)
- Bloomberg Terminal AI product announcements (bloomberg.com/professional/blog and /products/ai)
- Reuters / Refinitiv (now LSEG Workspace) product pages (reuters.com / refinitiv.com)
- Nikkei feed and IR releases (nikkei.com/info/en)
- Goldman Marquee + Marquee AI (goldmansachs.com/marquee)
- JPMorgan IndexGPT public press (jpmorgan.com/technology/news/indexgpt)
- Morgan Stanley AI @ MS press releases (morganstanley.com)
- BlackRock Aladdin (blackrock.com/aladdin)
- US Federal Reserve SR 11-7 model risk management (federalreserve.gov)
- BIS WP 1194 (bis.org)
- Japan FSA news (fsa.go.jp/en/news)
- UK FCA discussion papers on AI (fca.org.uk)
- ESMA documents (esma.europa.eu)
Related
- Wiki Index
- agent-economy index
- LLM agent finance application overview
- AI-driven trading regulation Japan 2026
- agent custody and authorization framework
- agent identity DeFi and TradFi bridge
- agent legal and tax liability framework
- agent payment protocol four-way comparison
- Claude Code extension architecture
- Japan market maker and liquidity provider landscape
- CEX API SDK ecosystem comparison
- derivatives INDEX
- fintech INDEX