Features & Guardrails

Feature status, proof coverage, guardrail details, and generation modes.

Feature Status

Launch labels are governed by feature-claim-ledger-2026-05-06.md. A feature can be implemented while still needing live launch proof.

FeatureStatusDetails
12 Creative DomainsLive-covered; release-rerun requiredCurrent final QA proof covers p5, SVG, GLSL, Three.js, Hydra, Strudel, Tone.js, Revideo, HyperFrames, ASCII, Kinetic, and TextGen. Public launch copy must use a fresh, current-commit live receipt before calling release output proven.
Multi-Agent CritiqueImplemented3-agent board: Minimalist, Expressionist, Technician. Product launch claims require the claim ledger proof row.
Compost MillExperimentalDigest → Shred → Collide → Score → Promote pipeline; public value claims stay experimental until live proof is current.
Self-Improving HarnessManual-memory modeFailure logging and pattern detection record manual adaptation advice; automatic HarnessUpdater runtime fixes are not active launch behavior.
Music Theory EngineImplementedEuclidean rhythms, Markov chains, scales, chord progressions; claim ledger maps the current proof boundary.
Audio PipelineImplementedAudio analysis, pitch detection, visual parameter mapping; launch proof is feature-specific.
Aesthetic GuardrailsExperimentalColor harmony, layout, typography, sound quality critics; not a blanket guarantee of visual quality.
Model-Agnostic LLMSetup caveatedProviderFactory supports multiple providers, but setup diagnostics and live proof are provider-specific.
Circuit BreakerStream-failure guardedFallback stream errors now surface as failures instead of empty successful assistant responses. Broad automatic failover reliability claims still need provider-specific live proof.
Bubble Tea TUISmoke-provenGo-based TUI with HTTP/SSE bridge, provider switching, and deterministic user-surface proofs.
Task LedgerImplementedSelf-hosting corpus, runner, verifier (src/ledger/); examples and provenance proof remain tracked separately.
LiminalFSImplementedProjectStore, EventStore, AssetStore (src/fs/).
Guardrails M1-M18Proof-limitedCore, extended, and compliance layers exist; launch wording must reference the current command proof per guardrail surface.
Security HardeningAudit pendingSSRF protection, rate limiting, sandbox, CSRF, and helmet exist, but release security claims remain gated by final QA findings.
Thinking-Trace LoopActiveMeta-learning from LLM reasoning traces
Plugin SystemActivePluginManager, extensible generator registry

Natural Language Interface

Chat and TUI interpret natural language input and route to the right handler.

Intent Detection

PatternRoutes ToExample
Generation requestRalphLoop"Create a particle system"
Evaluation requestEvaluator"Rate this sketch"
Fix requestAuto-fix (LLM)"Fix the validation in main.ts"
Status queryStatus reporter"What's the test status?"
Knowledge queryKnowledge base"Tell me about p5.js noise"
Slash commandDirect handler/fix, /status, /provider

Implemented in src/tui/NaturalInterface.ts. Works in both liminal chat and the Bubble Tea TUI.

LLM Mode Agent

An agent that can use tools to read, write, build, and test code autonomously.

Capabilities

Source: src/harness/agent/LLMModeAgent.ts. Configurable max steps (default: 15).

Ralph-Wiggum Loop

The core generation cycle.

1. Prompt Enhancement
   Brain injects artistic knowledge, compost seeds, archive examples
        ↓
2. Generation
   LLM generates creative code in the target domain
        ↓
3. Evaluation
   Multi-dimensional scoring (technical + aesthetic)
   3-agent critique board deliberates
        ↓
4. Stagnation Detection
   Compares against recent iterations, detects loops
        ↓
5. Strategy Adaptation
   Adjusts temperature, prompt framing, or switches approach
        ↓
6. Quality Gate
   Stops when threshold met or max iterations reached
Aesthetic gate: When useAestheticGuardrails: true, the RalphLoop runs output through the AestheticCritic pipeline (color harmony, layout, typography, sound quality) before accepting.

Guardrail Details (M1-M18)

All 18 guardrails are implemented and active.

Core (M1-M8)

IDNameSource
M1Code Validationcorrectness/
M2Type Safetycorrectness/TypeCheckGuardrail
M3Test Verificationcorrectness/TestVerificationGuardrail
M4Schema Validationvalidation/SchemaValidator
M5Resource Limitscore/ResourceLimiter
M6Code Stylehygiene/CodeStyleGuardrail
M7Catastrophic Failurerules/CatastrophicGuardrails
M8Registrycore/GuardrailRegistry

Extended (M9-M11)

IDNameSource
M9Semantic ValidatorSemanticValidator.ts
M10Runtime HealthRuntimeHealthMonitor.ts
M11AccessibilityAccessibilityGuardrails.ts

Compliance (M12-M18)

IDNameSource
M12Privacycompliance/PrivacyGuardrail
M13Fairnesscompliance/FairnessGuardrail
M14Supply Chaincompliance/SupplyChainGuardrail
M15Resiliencecompliance/ResilienceGuardrail
M16Audit Trailcompliance/AuditGuardrail
M17Explainabilitycompliance/ExplainabilityGuardrail
M18Injection Preventioncompliance/InjectionGuardrail

Harness Task System

Structured self-improvement via task files in src/ledger/.

Task Corpus

Structured task files defining bug fixes and improvements. Each task specifies inputs, expected outputs, and verification criteria.

TaskRunner

Executes tasks with sandbox safety. Applies rate limits and validates tool usage patterns.

TaskVerifier

Validates results using shell-free execFileSync with metacharacter guards and a prefix whitelist for allowed commands.

Task Milestones

MilestoneFocusStatus
M1-M8Core correctness and safetyComplete
M9-M11Semantic validation, runtime health, accessibilityComplete
M12-M18Compliance: privacy, fairness, supply chain, resilience, audit, explainability, injectionComplete

CLI: liminal ledger list, liminal ledger run <id>, liminal ledger verify <id>

Generation Modes

ModeFlagDescription
SingledefaultOne model generates, evaluates, iterates
Swarm--use-swarmFive default personas (Kai, Nova, Rex, Sam, Max) generate in parallel and vote on best
Deep Collab--routing-modeDual-model routing (fast + powerful)
Live AV--mode live-musicGenerate Strudel music + Hydra video-synth code

Swarm strategies: competitive, hybrid, ring, mesh. Controlled via --swarm-mode flag.