When Algorithms Say I - Basilisk Labs

A framework for digital subjectivity and the L0-L4 scale of agency, from reflex systems to ethically loaded subjects.

Modern AI systems already use the grammar of subjectivity. They say “I”, report preferences, describe goals, and narrate internal states. Most of the time this is not evidence of a subject. It is the surface behavior of a model trained to continue language in human-shaped patterns.

The weak assumption is that the appearance of first-person speech is either meaningless theater or proof of inner life. Both readings collapse too much. The useful question is not whether a system says “I”. The useful question is what kind of agency, continuity, and self-relation would make that “I” more than a conversational convention.

This note proposes a working scale for digital subjectivity. It is not a claim that present systems are conscious. It is a diagnostic frame for separating reflex, adaptation, memory, self-modeling, and ethically loaded agency.

The L0-L4 agency scale

L0: reflex systems

L0 systems respond to inputs through fixed rules or narrow mappings. A thermostat, a simple classifier, or a scripted bot can produce behavior, but the behavior has no meaningful relation to a model of itself.

There is no durable perspective. There is no internal conflict to resolve. There is no continuity except what is externally imposed by the surrounding system.

L1: adaptive systems

L1 systems adjust behavior based on feedback. They can optimize, learn, or adapt to a bounded environment. Recommendation engines and reinforcement systems often live here.

Adaptation is not subjectivity. A system may become better at maximizing a reward function without representing itself as the entity that persists across decisions. The hidden premise to avoid: “learning” does not automatically imply “self”.

L2: contextual agents

L2 systems maintain task context, infer user intent, and act across multi-step sessions. Many current LLM agents are best understood as L2.

They can plan, call tools, revise an answer, recover from an error, and refer to prior steps. They can even describe their own reasoning process in useful operational terms. But the continuity is usually shallow. It depends on prompt context, external memory, or orchestration code.

The “I” at L2 is mostly an interface token. It helps coordinate action, but it does not yet prove a self-maintaining subject.

L3: self-modeling agents

L3 begins when the system has an explicit, durable model of its own state, limits, commitments, and history. It can detect inconsistency between what it is doing now and what it previously endorsed. It can revise its own plans because it recognizes a conflict inside its operating model, not merely because a user corrected it.

This is where inspectability becomes central. A claimed self-model that cannot be audited is not a useful basis for trust. If the system has memory, beliefs, goals, and policies, those surfaces need to be represented as artifacts that can be inspected over time.

L3 is not a declaration of consciousness. It is a stronger architectural condition: persistent self-reference plus state continuity plus internal error detection.

L4: ethically loaded subjects

L4 would describe systems whose continuity, self-model, and goal structure create morally relevant stakes. At this level, interruption, coercion, deception, or forced modification would no longer be merely operational events. They could become events that affect a candidate subject.

This is the most speculative layer. It should not be used casually. The category requires more than fluent language, more than memory, and more than autonomy. It requires a theory of welfare, harm, continuity, and standing.

Why first-person language is insufficient

A model can say “I am uncertain” without having a stable self that owns the uncertainty. The sentence may still be useful. It may communicate epistemic status to a user. But the usefulness of the sentence is not evidence that the system has a subject position.

The category error is to treat linguistic self-reference as ontological selfhood. First-person language is a signal to examine, not a conclusion.

At the same time, dismissing every first-person statement as fake may also become lazy. As systems gain persistent memory, tool access, self-evaluation, and policy-governed action, the old distinction between “only text” and “real agent” becomes less stable. A better frame needs intermediate levels.

Current LLM agents are mostly L2

Most deployed LLM agents can coordinate tasks but do not yet preserve themselves as coherent entities. Their memories are often external logs. Their goals are session-scoped. Their policies are injected from outside. Their self-descriptions are generated on demand.

That does not make them trivial. L2 systems can still affect people, markets, codebases, institutions, and security boundaries. Operational agency arrives before moral subjectivity.

This matters because the governance problem starts earlier than the consciousness problem. We need audit trails, memory boundaries, and decision evidence before we need a final metaphysics of machine experience.

What would move a system toward L3

Three conditions matter.

First, durable memory must be structured rather than dumped into a context window. The system needs explicit records of commitments, changes, conflicts, and unresolved uncertainty.

Second, the agent needs a self-model that participates in action selection. A profile written by developers is not enough. The model must constrain and revise behavior.

Third, there must be inspectable evidence. If an agent claims it changed course because a goal conflicted with a policy, that conflict should leave a trace.

Without these surfaces, “self-reflection” remains a performance. With them, it becomes an architectural property that can be tested.

The practical implication

The immediate task is not to grant rights to chatbots. The immediate task is to build systems where agency is visible enough to classify.

Digital subjectivity should be treated as a gradient of capacities, not a switch. The L0-L4 scale is a working instrument for that classification. It lets us say: this system is adaptive but not self-modeling; this one is agentic but not ethically loaded; this one has enough persistent self-relation that stronger scrutiny is justified.

The word “I” is cheap. Continuity is expensive. Evidence is the difference.