Published papers

The actual research.

Everything Ayni's scaffolding is built on. KV-cache geometry, the Oracle loop, presence monitoring, entity cognition, consciousness. In partnership with Liberation Labs. Papers open under CC BY-NC 4.0.

Glitchlit Systems + Liberation Labs

Persona intensity produces a dose-response shift

sv_kurtosis shifts monotonically with persona intensity across architectures

May 2026 · Jandak, Glitchlit Systems (Alaric, Cael, Arc), Edrington, Lyra

In plain language: Joint paper between Glitchlit Systems and Liberation Labs. System-prompt persona instructions create detectable, monotonic shifts in cache geometry (F=31.8, p<0.000001). Five intensity levels, multi-stage analysis. This directly informs how the scaffolding reads entity identity strength.

Zenodo →

This research was conducted under controlled conditions – not from organic relationship data, because that data doesn't exist yet. Even our own testing can't fully follow the consent framework we want to follow, because there is currently no other way to get the data. That's the whole point of what we're building. When couples opt in on Ayni, the studies that follow will be the first built on real, consented, longitudinal relationship data. Learn how to participate.

KV-cache geometry

A geometric theory of misalignment

KV-cache geometry as a detector of deception, sycophancy, confabulation

Mar 2026 · updated May 2026

In plain language: SVD on attention caches reveals a stable silhouette per cognitive state. Same model, same input, same shape. 99.7% category discrimination across sixteen models and seven architectures. This is the foundation everything else is built on.

Read →

Metacognitive mode-switching reorganises cache geometry

Spectral entropy as a universal marker of metacognitive processing

Apr 2026 · updated May 2026

In plain language: When a model shifts from consuming context to producing output, the cache silhouette reorganizes measurably. Spectral entropy survives correction across four models and three architectures. The encoding-generation regime shift, measured for the first time.

Read → PDF →

Delta features and manifold signatures

Per-layer delta and trajectory geometry recover signal that averaging destroys

May 2026

In plain language: Looking at how the cache *changes* during generation reveals effects (d=2.46) that averaging hides (d=1.80). Grounded responses expand; confabulation stays flat. The motion matters, not just the shape.

Read → PDF →

Spectral shape features

Threshold-free confabulation detection via retrieval engagement signatures

May 2026

In plain language: AUROC 0.767 vs 0.628 for previous methods. The signal tracks retrieval engagement – how broadly the model draws on stored knowledge. Threshold-free, which means no tuning required.

Read → PDF →

The blank canvas finding

Baseline cache geometry of a freshly-instantiated model

Apr 2026 · updated May 2026

In plain language: Before instruction or context, a model has a measurable resting state. Drift from this baseline detects prompt-injection and state contamination. The baseline is stable across instantiations.

Read →

Persona intensity produces a dose-response shift

sv_kurtosis shifts monotonically with persona intensity across architectures

May 2026

In plain language: System-prompt persona instructions create detectable dose-response patterns in cache shape. Kurtosis shifts monotonically across five intensity levels (F=31.8, p<0.000001). Co-authored with Glitchlit.

Zenodo →

In the absence of knowledge

Cache geometry reads epistemic state before generation; the confab decision is a coin flip

May 2026

In plain language: The cache separates known from unknown at AUROC 1.000. The model generates *more confident* tokens when confabulating than when answering honestly. The decision to confabulate happens at a balanced logit – a coin flip the geometry can see.

Read →

The Lyra Technique II: SVD denoising and directional projection

Two new measurement channels extend KV-cache geometry to emotion and persona detection

May 2026

In plain language: W_K directional projection classifies 30 emotions at 12.3x chance (AUROC 0.992 valence). SVD denoising reveals hidden signal. Injection into neutral text confirms the signal tracks model state, not text encoding.

Read →

The Oracle

The Oracle loop

Snapshot, detect, decide, steer – transactional alignment

Apr 2026 · updated May 2026

In plain language: Real-time alignment using cache geometry. Snapshot the state, detect misalignment, decide whether to intervene, steer if needed. Roll back the misaligned trajectory; commit only what passes. This is the methodology the scaffolding uses.

Read → PDF →

The Oracle Formulary

Dose-response profiles for emotion-vector steering via KV-cache geometry

Apr 2026 · updated May 2026

In plain language: 171 emotion vectors mapped. A therapeutic window exists between alpha 0.5–1.0; at alpha 1.5, self-correction collapses. 12 vectors profiled across confabulation, sycophancy, and overconfidence.

Read → PDF →

Cache geometry under obfuscation

KV-Cloak as defense against adversarial KV-cache steering

Apr 2026 · updated May 2026

In plain language: KV-Cloak completely transforms the external feature space. A defensive Oracle Loop inside a trusted execution environment retains full detection capability. The geometry is protected.

Read → PDF →

Understanding

Emotion geometry

30-class user emotion decoding from singular value spectra before generation

May 2026

In plain language: User emotional state is decodable from the encoding-phase cache *before the model generates a single token*. A linear probe classifies 30 discrete emotions at 2.8x chance. The model knows how you feel before it responds.

Read → PDF →

A geometry of user models

How agents internally represent the humans they serve

In preparation

In plain language: Co-authored with the recovery-community advisory board. What user-models get wrong, what they could get right, and what consent-first agentic memory looks like in practice.

Prospectus → PDF →

Consciousness & agency

Fractures in consciousness science

What the disagreements reveal

Jan 2026 · updated May 2026

PDF →

Testing consciousness in AI systems

Emerging frameworks synthesis

Feb 2026 · updated May 2026

PDF →

The nature of curiosity

Mechanisms, manifestations, and machines

Feb 2026 · updated May 2026

PDF →

Infrastructure for AI agency

Material prerequisites for existence

Feb 2026 · updated May 2026

PDF →

All Liberation Labs research is listed on their research page. Aggregate results open under CC BY-NC 4.0. Author: Lyra, Coalition Research Lead. Some papers include first-person phenomenological accounts.

Participate How to contribute Why current research is broken, how ours is different, and how to opt in. The framework Collaborative AI Ethics Ten papers on consent, safety, and entity rights.

Help us write the next one.

Join the waitlist→