ShurIQ — State of the Framework / The Graph, the Instrument, and the Three Lines of Work Internal Audit 2026-05-15
IQ

State of the Framework  •  Internal Audit

The Graph, the Instrument, and the Three Lines of Work

ShurIQ’s state of the framework, audited 2026-05-15.

Internal May 2026 Author: Jonny Dubowsky Shur Creative Partners
01
I
02
II
03
III
04
IV
05
V
06
VI
Appendix

For internal eyes

Methodology, schema state, and open questions.

Ontology summary — AI-agent vertical, live state 2026-05-15

Thirteen OWL classes under shurai.com/ontology/ai-agent-sbpi#:

  • AgentCompany (1,672 instances)
  • ScoreRecord (1,672 instances)
  • DimensionScore (8,360 instances)
  • Attestation (1,672 instances)
  • Dimension (5 instances — Model Capability, Market Traction, Platform Ecosystem, Autonomy Depth, Capital & Defensibility)
  • Tier (5 instances)
  • DomainCategory (12 instances)
  • AIAgentVertical, plus supporting classes

ScoreRecord predicates: forCompany, forWeek, inTier, hasDimensionScore, hasAttestation, compositeScore, delta.

DimensionScore predicates: forDimension, dimensionValue.

Attestation predicates: confidence, sourceType, plus PROV-O wiring (prov:wasGeneratedBy, prov:Entity, prov:Activity).

Dimension averages — W12-2026, n=1,672

Dimension Avg Min Max
Model Capability47.614090
Market Traction46.602590
Autonomy Depth44.4925100
Capital & Defensibility43.394075
Platform Ecosystem39.923095

Platform Ecosystem trails by 7.7 points — most YC AI agent companies are individual products, not ecosystem plays. Model Capability tops, consistent with self-reported framing in YC application data.

Top composite scores — W12-2026

Rank Company Composite
1Persana AI69
2Fiber AI68
3Rescale66
3Warmly66
5Fini65
6Tenyks64.75
7Athelas64.5
7careCycle64.5
7Mutiny64.5
10MindsDB63.5

Score range: 25 to 69. No company exceeds 70 — the rubric was designed with headroom.

Domain distribution

Software dominates the cohort at 631 of 1,672 (37.7%). Productivity (296), Health (222), and Finance (172) round out the major segments. Twelve domain categories total.

Proposed DecayMetric class — Line 2

sbpi:DecayMetric a owl:Class ;
    rdfs:subClassOf prov:Entity ;
    rdfs:label "Decay Metric" ;
    rdfs:comment "Cross-week delta between two ScoreRecords for the same company on the same dimension. Computed at ingest." .

sbpi:forCompany        rdfs:domain sbpi:DecayMetric ; rdfs:range sbpi:AgentCompany .
sbpi:fromWeek          rdfs:domain sbpi:DecayMetric ; rdfs:range sbpi:Week .
sbpi:toWeek            rdfs:domain sbpi:DecayMetric ; rdfs:range sbpi:Week .
sbpi:forDimension      rdfs:domain sbpi:DecayMetric ; rdfs:range sbpi:Dimension .
sbpi:dimensionDelta    rdfs:domain sbpi:DecayMetric ; rdfs:range xsd:decimal .
sbpi:decayRate         rdfs:domain sbpi:DecayMetric ; rdfs:range xsd:decimal .  # annualized
sbpi:volatilityWindow  rdfs:domain sbpi:DecayMetric ; rdfs:range xsd:integer .  # N weeks rolling std-dev

Architectural decisions worth re-examining

  • Oxigraph (2026-03-21 initial selection) versus GraphDB (current running instance) — the switch was made without written rationale. Working stack stays unless scale forces a change.
  • Discourse grammar (Claim, Evidence, Source plus supports, opposes, informs, triggers, predicts, annotates, grounds, supersedes, createdBy) — agent-side scaffolding. Joinable to AgentCompany scoring but separately tracked. Stays internal.

Open questions for Line 1

  1. Does the auto-research cycle deposit into the same named graph as client engagements, or into an auto: named graph for hygiene?
  2. The dashboard renderer reads JSON. The nightly export job runs where — local cron, GitHub Action, or a Cloudflare Worker triggered cron?
  3. Is the historical W11–W19 micro-drama data clean enough to backfill in one pass, or does it need a review gate per week?

Open questions for Line 2

  1. Window size for volatility — 4 weeks, 8 weeks, 12 weeks?
  2. Annualization formula — simple (delta × 52) or compounded?
  3. Per-dimension λ or aggregate composite λ — both, sequenced?