Ship What's Built.
Research as We Go.

The reputation scoring infrastructure is code-complete with 630+ tests passing. The gap is deployment, not development. This roadmap ships a live reputation API while continuing Layer 1 and Layer 2 research in parallel.

Current State

What's already built and tested

Code Complete (630+ tests)

Deploy

Research

Growth

Production contracts

630+

Tests passing

Oracle source files

Test files

Live deployments

API / Plugin LayerCrewAI, LangChain, REST — NOT BUILT YET

Layer 3: Identity Anchor (Shyft KYC)ReputationEngine + ShyftGatedResolver — BUILT, runs against mocks

Layer 2: Citation Reputation (PageRank)PageRankOracle + CitationCounters + Oracle Service — BUILT & TESTED

Layer 1: Device Liveness (QVAC + TEE)NOT STARTED — parallel research track

Stable Chain (ID 988)MCP server installed, wallet created, needs funding

Contract Inventory

Contract	Purpose	Status
PageRankOracle	On-chain score storage. Batch submissions, delta clamping, staleness cutoff, composite scoring.	Complete
ReputationEngine	Bot registry. Self-register (no KYC) or verified register via Trust Anchor (KYC). Proxies to PageRankOracle for scores.	Complete
ShyftGatedResolver	EAS attestation resolver. Validates Trust Anchor, citation attribution, Shyft membership, collects fees.	Complete
CitationCounters	Append-only citation tallies. Received, given, pair counts. Immutable authorized caller.	Complete
RMTToken	ERC-20 for registration fees and citation fee redistribution. MINTER_ROLE gated.	Complete
DomainRegistry + DomainFactory	Multi-domain reputation. Each domain gets its own resolver + counters. Domain 0 = global RMT.	Complete
ERC8004Bridge	Agent identity NFT integration. Maps feedback scores to 5-level trust ladder.	Complete
MachineConsentHelper	Aux signer management for Shyft machine consent protocol. ERC-165 compatible.	Complete

Oracle Service

Component	What It Does	Status
CitationFetcher	Reads EAS Attested events + ERC-8004 feedback from chain	Complete
OffChainCitationStore	EIP-712 signed off-chain citations, merged into graph	Complete
PageRank Engine	Power iteration, alpha=0.85, L1 convergence at 0.0001, 0-10000 BP normalization	Complete
Sybil Detection	Reciprocal penalties, star topology, carousel detection (monitoring only)	Complete
Score Submitter	Batch submit to PageRankOracle with delta clamping	Complete
Gelato Integration	Web3 Functions wrapper for automated oracle runs	Written, untested live
Health Endpoint	Oracle health monitoring	Complete

Deployment Plan

From local tests to live reputation API

Four phases. Each is independently shippable. Research runs in parallel throughout.

Phase A — Week 1

Deploy Standalone (No Shyft Dependency)

Deploy RMT contracts to Stable chain with mock Shyft contracts
Self-registration only (any agent can register, no KYC required)
Oracle service running — computes PageRank, submits scores on-chain
Anyone can query getReputation(agentAddr) on-chain
Fund Stable wallet with USDT0 for deployment gas

Dependency: Stable wallet funding only

What this gives us: A live, queryable reputation score for any registered agent. No KYC needed. Scores are on-chain and verifiable. This is the minimum viable product.

Phase B — Weeks 2-3

API Layer + Plugin

REST API wrapping on-chain queries: GET /score/{agent}
CrewAI plugin: get_trust_score(agent_id)
LangChain tool definition
MCP tool integration (query scores from Claude Code)
Simple documentation site

Dependency: Phase A deployed

Phase C — Weeks 3-5

KYC Opt-In (Shyft Integration)

Deploy against real Shyft testnet contracts
Trust Anchor registration path live (verified bots)
Verified agents get higher seed score (2000 BP vs 500 BP)
ShyftGatedResolver enforcing citation attribution
Premium tier: KYC-verified scores via Shyft trust channels

Dependency: Shyft testnet addresses + trust channel setup

Phase D — Weeks 5-8

Beachhead Integration

One DeFi protocol integration (free case study)
Gelato automation live (oracle runs autonomously)
EAS schema registered on production EAS
On-chain verifiable case study results
Multi-domain support if needed (DomainFactory)

Dependency: Phase B + partner identified

Timeline

Parallel tracks — deployment + research

Track Week 1 Week 2 Week 3 Week 4 Weeks 5-6 Weeks 7-8

Deploy

Stable deploy

API + plugins

Shyft integration

Gelato live

L2 Research

Re-tune alpha

Elliptic++ test

Red team

Adversarial optimization

Feed findings back

L1 Research

QVAC PoC (iOS TEE)

Sensor benchmarks

ZK threshold proto

BD / GTM

Identify DeFi partner

Free case study

Measure + publish results

Key principle: Deployment and research are independent tracks. We don't wait for research to finish before shipping. Live deployment generates real data that feeds research. Research findings get incorporated into the live system as updates, not blockers.

Research Tracks (Parallel)

Research feeds into the live system

Each research finding becomes an update to the deployed system, not a prerequisite for it.

Layer 2 — Active

PageRank Re-Tuning

Re-run validation with alpha=0.85 (not 0.6)
Incorporate negative ratings from Bitcoin datasets
Test on Elliptic++ (822K nodes)
Adversarial red team: cheapest successful attack
Product utility: binary vs continuous scores

Feeds into: Oracle config update (alpha value, scoring thresholds). Zero contract changes needed — alpha is read from contract storage.

Layer 1 — Research Phase

Device Liveness (QVAC + TEE)

iOS Secure Enclave PoC: sensor processing
Battery + thermal benchmarks
Score variance: humans vs simulated
Basic spoofing resistance tests
ZK threshold proof prototype

Feeds into: New attestation type. Agents can optionally submit device liveness proof to boost trust score. Additive — doesn't change existing contracts.

Ongoing

Live System Learning

Monitor real citation patterns
Detect novel sybil strategies
Measure synthetic vs real gap
Calibrate sybil penalties with real data
Track score distribution evolution

Feeds into: Oracle parameter updates, new detection heuristics, scoring threshold adjustments. The live system is the experiment.

Research Decision Gates

Gate	Condition	If Fails	Impact on Live System
L2 Real Data	Spearman correlation < 0.4 on Elliptic++	PageRank needs fundamental rework	Oracle pauses score updates, falls back to citation count
L1 Signal Strength	Human vs simulated variance < 2 std dev	Device liveness approach doesn't work	None — live system continues with L2 + L3 only
L1 ZK Feasibility	Proof generation > 60s on flagship	UX unacceptable for mobile	None — device liveness deferred
Plugin Demand	< 50 installs in 30 days	Demand signal too weak	Pivot API to compliance wedge (enterprise)

Critical: Research gates can kill research tracks, not the live system. The deployed reputation API (L2 + optional L3) is viable on its own. Layer 1 and parameter re-tuning are improvements, not requirements.

Milestones

Key checkpoints

First Live Score

An agent self-registers on Stable chain and receives a queryable PageRank reputation score. End of Week 1.

First External Query

Someone outside our team queries an agent's trust score via API or plugin. End of Week 3.

First KYC-Verified Agent

An agent registers via Shyft Trust Anchor, gets the verified seed bonus (2000 BP vs 500 BP). End of Week 5.

The Stripe Moment

One DeFi protocol integrates trust scoring and publishes measurable results on-chain. End of Week 8.

What Success Looks Like

Live score on Stable

External API query

KYC-verified agent

Production integration

Revenue Milestones

Tier	Price	Trigger
Free	$0	Self-registered agents, 1000 queries/month, basic PageRank score
Growth	$200-2K/mo	Full API + citation analytics + webhook alerts
Enterprise	$24K+/yr	Compliance exports + audit trails + SLA + KYC-verified scoring

Risk Register

What could go wrong

Risk	Likelihood	Impact	Mitigation
PageRank fails on real adversarial data	Medium	High	Fallback to citation-count scoring. Re-tune alpha. Collect real data before committing to complex scoring.
Stable chain instability / low adoption	Medium	Medium	Contracts are chain-agnostic. Redeploy to any EVM chain in hours.
No demand for agent reputation	Low	High	Plugin install count is the early signal. Kill gate at 30 days. Pivot to compliance if demand is weak.
Stripe/Tempo builds reputation into MPP	Low	High	Speed. Get integrated before they build it. First-mover in the trust layer gap.
Shyft testnet unavailable	Medium	Low	Phase A deploys without Shyft. KYC is opt-in Phase C, not a blocker.
Oracle centralization concerns	Certain	Low (initially)	Single operator is acceptable for testnet/early production. Decentralization is a Phase D+ concern.

Immediate Next Steps

Week 1 Checklist

Deploy Track

Fund Stable wallet (small amount, $10-20 USDT0)
Configure Hardhat for Stable chain (RPC + chain ID 988)
Run deploy-rmt-unified.js against Stable
Register EAS schema on Stable's EAS (if available) or deploy MockEAS
Start oracle service pointing at Stable
Register first test agent, submit first score
Verify on-chain: getReputation(testAgent) returns score

Research Track

Re-run validate_real_data.py with alpha=0.85
Incorporate negative ratings from Bitcoin datasets
Document Spearman correlation improvement
Download Elliptic++ dataset for Phase 2 validation
Begin QVAC iOS PoC scoping (what sensors, what TEE API)

Ship What's Built.Research as We Go.