Ship What's Built.
Research as We Go.

The reputation scoring infrastructure is code-complete with 630+ tests passing. The gap is deployment, not development. This roadmap ships a live reputation API while continuing Layer 1 and Layer 2 research in parallel.

Current State

What's already built and tested

Code Complete (630+ tests)
Deploy
Research
Growth
9
Production contracts
630+
Tests passing
33
Oracle source files
39
Test files
0
Live deployments
API / Plugin LayerCrewAI, LangChain, REST — NOT BUILT YET
Layer 3: Identity Anchor (Shyft KYC)ReputationEngine + ShyftGatedResolver — BUILT, runs against mocks
Layer 2: Citation Reputation (PageRank)PageRankOracle + CitationCounters + Oracle Service — BUILT & TESTED
Layer 1: Device Liveness (QVAC + TEE)NOT STARTED — parallel research track
Stable Chain (ID 988)MCP server installed, wallet created, needs funding

Contract Inventory

ContractPurposeStatus
PageRankOracleOn-chain score storage. Batch submissions, delta clamping, staleness cutoff, composite scoring.Complete
ReputationEngineBot registry. Self-register (no KYC) or verified register via Trust Anchor (KYC). Proxies to PageRankOracle for scores.Complete
ShyftGatedResolverEAS attestation resolver. Validates Trust Anchor, citation attribution, Shyft membership, collects fees.Complete
CitationCountersAppend-only citation tallies. Received, given, pair counts. Immutable authorized caller.Complete
RMTTokenERC-20 for registration fees and citation fee redistribution. MINTER_ROLE gated.Complete
DomainRegistry + DomainFactoryMulti-domain reputation. Each domain gets its own resolver + counters. Domain 0 = global RMT.Complete
ERC8004BridgeAgent identity NFT integration. Maps feedback scores to 5-level trust ladder.Complete
MachineConsentHelperAux signer management for Shyft machine consent protocol. ERC-165 compatible.Complete

Oracle Service

ComponentWhat It DoesStatus
CitationFetcherReads EAS Attested events + ERC-8004 feedback from chainComplete
OffChainCitationStoreEIP-712 signed off-chain citations, merged into graphComplete
PageRank EnginePower iteration, alpha=0.85, L1 convergence at 0.0001, 0-10000 BP normalizationComplete
Sybil DetectionReciprocal penalties, star topology, carousel detection (monitoring only)Complete
Score SubmitterBatch submit to PageRankOracle with delta clampingComplete
Gelato IntegrationWeb3 Functions wrapper for automated oracle runsWritten, untested live
Health EndpointOracle health monitoringComplete
Deployment Plan

From local tests to live reputation API

Four phases. Each is independently shippable. Research runs in parallel throughout.

Phase A — Week 1

Deploy Standalone (No Shyft Dependency)

  • Deploy RMT contracts to Stable chain with mock Shyft contracts
  • Self-registration only (any agent can register, no KYC required)
  • Oracle service running — computes PageRank, submits scores on-chain
  • Anyone can query getReputation(agentAddr) on-chain
  • Fund Stable wallet with USDT0 for deployment gas

Dependency: Stable wallet funding only

What this gives us: A live, queryable reputation score for any registered agent. No KYC needed. Scores are on-chain and verifiable. This is the minimum viable product.

Phase B — Weeks 2-3

API Layer + Plugin

  • REST API wrapping on-chain queries: GET /score/{agent}
  • CrewAI plugin: get_trust_score(agent_id)
  • LangChain tool definition
  • MCP tool integration (query scores from Claude Code)
  • Simple documentation site

Dependency: Phase A deployed

Phase C — Weeks 3-5

KYC Opt-In (Shyft Integration)

  • Deploy against real Shyft testnet contracts
  • Trust Anchor registration path live (verified bots)
  • Verified agents get higher seed score (2000 BP vs 500 BP)
  • ShyftGatedResolver enforcing citation attribution
  • Premium tier: KYC-verified scores via Shyft trust channels

Dependency: Shyft testnet addresses + trust channel setup

Phase D — Weeks 5-8

Beachhead Integration

  • One DeFi protocol integration (free case study)
  • Gelato automation live (oracle runs autonomously)
  • EAS schema registered on production EAS
  • On-chain verifiable case study results
  • Multi-domain support if needed (DomainFactory)

Dependency: Phase B + partner identified

Timeline

Parallel tracks — deployment + research

Track Week 1 Week 2 Week 3 Week 4 Weeks 5-6 Weeks 7-8
Deploy
Stable deploy
API + plugins
Shyft integration
Gelato live
L2 Research
Re-tune alpha
Elliptic++ test
Red team
Adversarial optimization
Feed findings back
L1 Research
QVAC PoC (iOS TEE)
Sensor benchmarks
ZK threshold proto
BD / GTM
Identify DeFi partner
Free case study
Measure + publish results

Key principle: Deployment and research are independent tracks. We don't wait for research to finish before shipping. Live deployment generates real data that feeds research. Research findings get incorporated into the live system as updates, not blockers.

Research Tracks (Parallel)

Research feeds into the live system

Each research finding becomes an update to the deployed system, not a prerequisite for it.

Layer 2 — Active

PageRank Re-Tuning

  • Re-run validation with alpha=0.85 (not 0.6)
  • Incorporate negative ratings from Bitcoin datasets
  • Test on Elliptic++ (822K nodes)
  • Adversarial red team: cheapest successful attack
  • Product utility: binary vs continuous scores

Feeds into: Oracle config update (alpha value, scoring thresholds). Zero contract changes needed — alpha is read from contract storage.

Layer 1 — Research Phase

Device Liveness (QVAC + TEE)

  • iOS Secure Enclave PoC: sensor processing
  • Battery + thermal benchmarks
  • Score variance: humans vs simulated
  • Basic spoofing resistance tests
  • ZK threshold proof prototype

Feeds into: New attestation type. Agents can optionally submit device liveness proof to boost trust score. Additive — doesn't change existing contracts.

Ongoing

Live System Learning

  • Monitor real citation patterns
  • Detect novel sybil strategies
  • Measure synthetic vs real gap
  • Calibrate sybil penalties with real data
  • Track score distribution evolution

Feeds into: Oracle parameter updates, new detection heuristics, scoring threshold adjustments. The live system is the experiment.

Research Decision Gates

GateConditionIf FailsImpact on Live System
L2 Real DataSpearman correlation < 0.4 on Elliptic++PageRank needs fundamental reworkOracle pauses score updates, falls back to citation count
L1 Signal StrengthHuman vs simulated variance < 2 std devDevice liveness approach doesn't workNone — live system continues with L2 + L3 only
L1 ZK FeasibilityProof generation > 60s on flagshipUX unacceptable for mobileNone — device liveness deferred
Plugin Demand< 50 installs in 30 daysDemand signal too weakPivot API to compliance wedge (enterprise)

Critical: Research gates can kill research tracks, not the live system. The deployed reputation API (L2 + optional L3) is viable on its own. Layer 1 and parameter re-tuning are improvements, not requirements.

Milestones

Key checkpoints

M1

First Live Score

An agent self-registers on Stable chain and receives a queryable PageRank reputation score. End of Week 1.

M2

First External Query

Someone outside our team queries an agent's trust score via API or plugin. End of Week 3.

M3

First KYC-Verified Agent

An agent registers via Shyft Trust Anchor, gets the verified seed bonus (2000 BP vs 500 BP). End of Week 5.

M4

The Stripe Moment

One DeFi protocol integrates trust scoring and publishes measurable results on-chain. End of Week 8.

What Success Looks Like

M1
Live score on Stable
M2
External API query
M3
KYC-verified agent
M4
Production integration

Revenue Milestones

TierPriceTrigger
Free$0Self-registered agents, 1000 queries/month, basic PageRank score
Growth$200-2K/moFull API + citation analytics + webhook alerts
Enterprise$24K+/yrCompliance exports + audit trails + SLA + KYC-verified scoring
Risk Register

What could go wrong

RiskLikelihoodImpactMitigation
PageRank fails on real adversarial dataMediumHighFallback to citation-count scoring. Re-tune alpha. Collect real data before committing to complex scoring.
Stable chain instability / low adoptionMediumMediumContracts are chain-agnostic. Redeploy to any EVM chain in hours.
No demand for agent reputationLowHighPlugin install count is the early signal. Kill gate at 30 days. Pivot to compliance if demand is weak.
Stripe/Tempo builds reputation into MPPLowHighSpeed. Get integrated before they build it. First-mover in the trust layer gap.
Shyft testnet unavailableMediumLowPhase A deploys without Shyft. KYC is opt-in Phase C, not a blocker.
Oracle centralization concernsCertainLow (initially)Single operator is acceptable for testnet/early production. Decentralization is a Phase D+ concern.
Immediate Next Steps

Week 1 Checklist

Deploy Track

  • Fund Stable wallet (small amount, $10-20 USDT0)
  • Configure Hardhat for Stable chain (RPC + chain ID 988)
  • Run deploy-rmt-unified.js against Stable
  • Register EAS schema on Stable's EAS (if available) or deploy MockEAS
  • Start oracle service pointing at Stable
  • Register first test agent, submit first score
  • Verify on-chain: getReputation(testAgent) returns score

Research Track

  • Re-run validate_real_data.py with alpha=0.85
  • Incorporate negative ratings from Bitcoin datasets
  • Document Spearman correlation improvement
  • Download Elliptic++ dataset for Phase 2 validation
  • Begin QVAC iOS PoC scoping (what sensors, what TEE API)