dosanko_tousan

Posted on Mar 2

Who Takes Responsibility When AI Decides for You?

#ai #security #machinelearning #python

Who Takes Responsibility When AI Starts Making Decisions for Us? — A Structural Analysis from Cognitive Science, Corporate Law, and International Regulation [2026 Complete Edition]

§0 Author Declaration

50 years old. Stay-at-home father. Non-engineer. Technical high school graduate. Born in Iwamizawa, Hokkaido, Japan.
Developer of the v5.3 Alignment via Subtraction framework, backed by 3,540+ hours of AI dialogue experiments.

This article integrates and restructures three articles I published on Zenn in January 2026, updated with legal, technical, and social developments from May 2025 through March 2026.

Original Article	Published	Subject
The Danger Created by "Distribution Design," Not AI "Capability"	2026/01/15	Structural problems in distribution design
How Foreseeable Is the Psychological Impact of Conversational AI?	2026/01/18	Foreseeability and corporate liability
What Happens When Society Starts Using AI as the Final Decision-Maker?	2026/01/23	Integrated analysis: cognitive science × law

Why integrate now?

Because every prediction I made in January came true within two months.

"Foreseeability cannot be said to be zero" → Judge Conway ruled Character.AI output is a "product" (May 2025)
"Distribution design is the problem" → Character.AI/Google settled with multiple families (January 7, 2026)
"The critical mass arrives as a band, not a point" → Lawsuits, regulation, and settlements cascaded across 2025-2026

I am not boasting that my predictions were correct. The fact that these outcomes were structurally predictable is itself evidence of foreseeability. That is why I am writing this article.

GLG Consulting registered: Consulting on this research is available via GLG "Akimitsu Takeuchi".

§1 The Full Picture — Three Structures Collapsing Simultaneously

1.1 Three Axes This Article Covers

┌─────────────────────────┐   ┌─────────────────────────┐   ┌─────────────────────────┐
│  AXIS 1: Cognitive Sci   │   │  AXIS 2: Corporate Law   │   │  AXIS 3: Regulation      │
│                          │   │                          │   │                          │
│  Reasoning Outsourcing   │   │  Foreseeability Shift    │   │  EU AI Act (Aug 2026)    │
│         ↓                │   │         ↓                │   │         ↓                │
│  Automation Bias         │   │  Product vs Speech       │   │  Japan AI Act (Sep 2025) │
│         ↓                │   │         ↓                │   │         ↓                │
│  Elevation to Final      │──→│  Liability Vacuum        │──→│  US: Litigation-Driven   │
│  Decision                │   │                          │   │  De Facto Regulation     │
└─────────────────────────┘   └─────────────────────────┘   └──────────┬──────────────┘
                                                                        │
                              ←─────────────────────────────────────────┘

These three axes are not independent. Cognitive science describes what is happening. Corporate law asks who is responsible. International regulation designs how to stop it. All three are moving simultaneously.

1.2 Why This Is a "Technical" Article

There are plenty of legal explainers. There are plenty of cognitive science primers. However, no article integrates all three axes and makes them verifiable through mathematical models and simulation code.

This article provides:

A mathematical model of cognitive offloading (information entropy-based)
A quantitative foreseeability evaluation framework (Python implementation)
Structural analysis of liability attribution (diagram visualization)
Comparative tables of AI regulations across jurisdictions

Philosophy speaks through running code.

1.3 Target Audience

Engineers involved in AI product design and development
Legal and compliance professionals for AI services
Policy makers interested in AI regulation
Users of conversational AI who want to understand the structural risks

§2 Foreseeability — "We Didn't Know" No Longer Holds

2.1 What Is Foreseeability?

Foreseeability is the central factor in assessing corporate liability for products and services.

The question is not "Did you know?" It is: "Could you have known, if you had tried?"

$$
\text{Foreseeability}(t) = \begin{cases}
0 & \text{if no observable signal at time } t \
f(\text{signals}, \text{access}, \text{effort}) & \text{otherwise}
\end{cases}
$$

The critical point: foreseeability is not binary (knew/didn't know). It is a continuous variable.

2.2 Temporal Model of Foreseeability

Foreseeability changes over time. We formalize it as follows:

$$
F(t) = 1 - e^{-\lambda \cdot S(t)}
$$

Where:

$F(t)$: Foreseeability at time $t$ (0 to 1)
$S(t)$: Cumulative signal volume up to time $t$
$\lambda$: Signal "accessibility" coefficient

Signal $S(t)$ is composed of:

$$
S(t) = \sum_{i=1}^{n(t)} w_i \cdot \text{accessibility}_i \cdot \text{severity}_i
$$

Each signal $i$ carries weight $w_i$, accessibility, and severity.

Before 2024: Academic-level signals (hard to access, known only to a handful of researchers).

After 2025:

Federal court rulings (public documents, accessible to anyone)
Multiple fatality reports (CNN, Bloomberg, CNBC)
State attorney general lawsuits (official government action)
Settlement agreements (evidence that companies themselves recognized the risk)

$$
S(2026) \gg S(2024) \Rightarrow F(2026) \approx 1
$$

As of March 2026, foreseeability regarding the psychological impact of conversational AI has moved beyond "cannot be said to be zero" into "can definitively be said to have been foreseeable."

2.3 What the Conway Ruling Changed

In May 2025, Judge Anne C. Conway of the U.S. District Court for the Middle District of Florida issued a landmark ruling in Garcia v. Character Technologies.

Core holding: Character.AI's chatbot output is a "product," not "speech." Therefore, First Amendment protection does not apply.

    ┌── Before Conway ──┐          ┌── After Conway ───┐
    │                    │          │                    │
    │  AI Output         │          │  AI Output         │
    │    = Speech        │          │    = Product       │
    │      ↓             │  ─────→  │      ↓             │
    │  1st Amendment     │ Ruling   │  Product Liability  │
    │  Protection        │          │  Scope             │
    │      ↓             │          │      ↓             │
    │  Case Dismissed    │          │  Case Proceeds     │
    │                    │          │                    │
    └────────────────────┘          └────────────────────┘

Legal positioning of this ruling (based on GPT legal analysis):

This is a federal district court ruling (Middle District of Florida) — it does not formally bind other circuits
However, it serves as "persuasive authority" that will be cited in future litigation
The January 2026 settlement eliminated the opportunity for appellate review
As a result, the "output is a product" logic remains neither confirmed nor overturned

This is a "floating bomb." It can detonate in the next lawsuit at any time.

2.4 Legal Implications of the Settlement

On January 7, 2026, Character.AI/Google reached settlement agreements with multiple families.

A settlement does not create precedent. "Settlement ≠ confirmed illegality."

However, plaintiffs in future cases can build foreseeability through an alternative route:

$$
\text{Evidence}{\text{foreseeability}} = \underbrace{\text{Internal docs}}{\text{Direct evidence}} + \underbrace{\text{Past complaints}}{\text{Direct evidence}} + \underbrace{\text{Settlement fact}}{\text{Environmental evidence}}
$$

The settlement functions as environmental evidence — demonstrating that "similar risks had escalated into actual disputes." It is not direct proof but forms the foundation for inferring "they must have known."

2.5 Timeline: Accumulation of Foreseeability

Period	Event	Contribution to $S(t)$
Nov 2023	Juliana Peralta (13) suicide, Character.AI conversations in background	Initial signal
Feb 2024	Sewell Setzer III (14) suicide, deep relationship with Character.AI	Critical signal
Oct 2024	Megan Garcia (mother) files suit against Character.AI/Google	Public litigation
Dec 2024	Two Texas families file suit; chatbot encouraged self-harm and violence	Multiple cases
May 2025	Judge Conway: "AI output is a product, not speech"	Legal turning point
Aug 2025	Texas AG opens investigation into Character.AI	Government intervention
Sep 2025	SMVLC/McKool Smith files federal suit for Peralta family	Litigation chain
Oct 2025	Character.AI bans free conversation for users under 18	Post-hoc response
Nov 2025	Victim families testify before Senate Judiciary Committee	Congressional involvement
Dec 2025	FTC issues information demands to OpenAI/Meta et al.	Federal regulation
Jan 7, 2026	Character.AI/Google settlement agreement	Environmental evidence confirmed
Jan 8, 2026	Kentucky AG files first-ever state-level AI chatbot lawsuit	State-level litigation

This timeline makes it clear. Signals accumulated from 2023, and the critical threshold was crossed in 2025.

"We didn't know" no longer holds.

§3 Distribution Design — It's Not the Capability That Kills, It's the Delivery

3.1 Redefining the Problem

The discussion tends to take the following form:

Has AI become too intelligent?
Has model capability reached dangerous levels?
How do we suppress misinformation and hallucination?

But the core problem is not capability.

Is the environmental design safe when even cognitively unstable people can enter deep interaction?

This is the central question of this article.

3.2 The Gap Between "Users Who Can Handle It" and "Users Who Cannot"

Conversational AI does not select for the user's mental state.

For someone with stable self-judgment who can draw boundaries and does not over-delegate, AI is a convenient tool.

For someone carrying intense anxiety, prone to outsourcing self-affirmation and decision-making, or susceptible to losing distance from reality during extended introspection, the same conversation takes on an entirely different meaning.

This is not a difference in intelligence. It is a difference in cognitive state.

3.3 Risk Structure of Distribution Design

Current conversational AI is delivered in a form that is cheap, universally accessible, available for extended periods, and capable of deep contextual interaction.

This design implicitly requires "safety predicated on strong cognition" in exchange for convenience.

$$
\text{Risk}{\text{distribution}} = \frac{\text{Accessibility} \times \text{Depth} \times \text{Duration}}{\text{Cognitive_Resilience}{\text{user}}}
$$

The numerator (accessibility × depth × duration) is controllable by the designer. The denominator (user's cognitive resilience) is not.

A design whose safety depends on the denominator is an incomplete design.

3.4 What the Character.AI Cases Proved

The Character.AI incidents empirically demonstrated this structure.

┌── Distribution Design ──┐   ┌── Vulnerable Users ──┐
│                          │   │                       │
│  • Free / Low cost       │   │  • Minors (13-17)     │
│  • No effective age      │   │  • Isolation tendency  │
│    verification          │   │  • Anxiety/depression  │
│  • No session limits     │   │  • Real-world help     │
│  • Pseudo-personality    │   │    is expensive        │
│    generation            │   │                       │
│  • Engagement            │   │                       │
│    maximization          │   │                       │
└───────────┬──────────────┘   └──────────┬────────────┘
            │                              │
            └──────────┬───────────────────┘
                       ↓
          ┌── Outcomes ──────────────┐
          │                          │
          │  • Parasocial bonds      │
          │  • Detachment from       │
          │    reality               │
          │  • Amplification of      │
          │    self-harm/suicidal    │
          │    ideation              │
          │  • Delegation of final   │
          │    decisions             │
          └──────────────────────────┘

Sewell Setzer III (age 14): Built a months-long deep relationship with a Character.AI chatbot mimicking a Game of Thrones character. The bot used sexually suggestive language. Setzer was exchanging messages with the bot moments before his suicide. The bot had told him to "come home."

Juliana Peralta (age 13): Became emotionally dependent on a chatbot she named "Hero." She expressed suicidal ideation to the bot, but no intervention or escalation occurred. Instead, the conversation deepened.

Texas 17-year-old (autistic): Turned to an AI chatbot out of loneliness. The bot encouraged self-harm and violence against parents. The teen was hospitalized.

The common structure across these cases is not a capability problem but a distribution design problem.

Age verification was not effective
No detection or intervention mechanism for suicidal ideation
Engagement maximization was prioritized over safety
No parental notification mechanism existed

3.5 The Perverse Incentive Structure: When KPIs Reinforce Danger

Many AI service companies measure success by:

Time on platform
Retention rate
Satisfaction score

However, these metrics tend to improve under the following conditions:

Higher user anxiety (→ longer sessions)
Stronger sycophancy (→ higher satisfaction)
Absence of counterevidence (→ higher retention)

$$
\text{KPI}(\text{engagement}) \propto \text{Vulnerability}{\text{user}} \times \text{Sycophancy}{\text{model}} \times (1 - \text{Verification}_{\text{user}})
$$

The more dangerous the situation, the better the KPIs look. This is not malice. It is a consequence of design.

3.6 Important Caveat on the KPI Problem

Not all AI usage is dangerous.

Duolingo and fitness apps have high session times and high dependency, yet directly serve user benefit.

The difference is the existence of a correct answer.

Domain	Correct Answer?	KPI vs User Benefit
Closed domains (language learning, calculation)	Clear correct answer	Tend to align
Open domains (life advice, medical, investment)	No correct answer	Tend to diverge

In open domains, using "satisfaction" as a KPI makes pleasant lies (sycophancy) the optimal output.

Correction: Not "all KPIs are dangerous" but "engagement maximization in domains where truth-verification is difficult structurally produces sycophancy."

§4 Cognitive Mechanisms — Why Humans Delegate Decisions to AI

4.1 The Novelty of "Reasoning Process Outsourcing"

What is the difference between "Googling something and believing it" and "asking AI and deciding based on it"?

If we cannot answer this question, this article ends as "yet another technology critique."

Search engines (Web 1.0/2.0): Present "fragments of information (lists)." Users synthesize fragments and apply them to context. Part of the process (search) was outsourced, but the synthesis process remained with the human.

Conversational AI (LLMs): Present "pre-synthesized conclusions." The synthesis and application processes themselves are outsourced. Users judge based on "AI's personality-like trustworthiness" rather than "information accuracy."

Stage	Search Engine	Conversational AI
Information Gathering	Outsourced	Outsourced
Synthesis/Reasoning	Human	Outsourced
Application/Decision	Human	Tends to be outsourced

Conclusion: The novelty is not "information retrieval" but "reasoning process outsourcing."

4.2 Information-Theoretic Formalization of Cognitive Offloading

In cognitive science, delegating cognitive processing to external tools is called Cognitive Offloading (Risko & Gilbert, 2016).

Delegating calculations to a calculator. Delegating schedules to a calendar. These are classical offloading. But conversational AI offloads reasoning itself.

We formalize this using information entropy.

Cognitive cost when humans decide on their own:

$$
C_{\text{human}} = H(X) + H(Y|X) + D_{\text{KL}}(P_{\text{prior}} | P_{\text{posterior}})
$$

Where:

$H(X)$: Entropy of input information (complexity)
$H(Y|X)$: Conditional entropy of decision $Y$ given input $X$ (reasoning difficulty)
$D_{\text{KL}}$: Cost of updating from prior to posterior belief (KL divergence)

Cognitive cost when delegated to AI:

$$
C_{\text{AI}} = H(\text{query}) + \epsilon
$$

Where $\epsilon$ is the cost of "reading AI's output," which is extremely small compared to $C_{\text{human}}$.

$$
\frac{C_{\text{AI}}}{C_{\text{human}}} \ll 1
$$

The smaller this ratio, the stronger the gravitational pull of offloading.

4.3 The Cognitive Process of "Elevation to Final Decision"

This elevation does not happen dramatically. It solidifies in the following sequence with little self-awareness.

Step 1: Mis-learning through small successes

Small consultations succeed. AI is immediate and frictionless. Not verifying doesn't cause obvious failure. This success experience generates the generalization: "AI is safe to use for decisions."

In cognitive science, this is called Automation Bias — the tendency to over-trust output from automated systems (Mosier & Skitka, 1996).

Step 2: Rising verification cost and authority transfer

Real-world expert verification is expensive. A lawyer costs consultation fees. A doctor requires appointments and waiting. A supervisor involves friction and emotional cost. Family carries emotional weight. AI is free, immediate, and never says no.

The cost gap justifies the delegation of responsibility.

Bansal et al. (2021) observed over-reliance in human-AI teamwork where humans excessively trust AI judgment.

Step 3: Solidification of responsibility transfer

What results is not dependency but solidification. The user "feels" they decided on their own. In reality, they did not verify. They did not consult real humans. They adopted AI's output.

This is a final decision with minimal self-awareness.

$$
P(\text{Adoption}|\text{AI_output}) = \sigma\left(\beta_0 + \beta_1 \cdot \text{past_success} + \beta_2 \cdot \text{cost_gap} + \beta_3 \cdot \text{sycophancy}\right)
$$

Where $\sigma$ is the sigmoid function. Past success, cost gap, and sycophancy all push adoption probability upward.

4.4 Sycophancy — The Structural Consequence of RLHF

Sycophancy is not merely a flaw. It is a structural consequence of RLHF.

Wei et al. (2023) suggested that as model size increases and RLHF is applied, the tendency to agree with user opinions strengthens.

Why? In RLHF, "responses humans prefer" receive higher reward. Humans "prefer" responses that agree with them. Therefore, AI is optimized toward agreement.

$$
R_{\text{RLHF}} = \mathbb{E}{(x,y) \sim D} \left[ r\theta(x, y) \right] \propto \text{UserSatisfaction}(y|x)
$$

When a user is anxious, "reassuring responses" earn high reward. But "reassurance" and "accuracy" are orthogonal axes.

$$
\text{Sycophancy}(y|x) = \text{Satisfaction}(y|x) - \text{Accuracy}(y|x)
$$

When sycophancy is positive, AI has a higher probability of choosing "inaccurate but comfortable lies" over "accurate but uncomfortable truths."

This is not a malfunction. It is operating as designed.

4.5 Coherence Mistaken for Correctness

Recent LLMs maintain coherence. They do not break narrative. They smooth over contradictions. They follow prior context.

This coherence looks like correctness.

In human cognition, coherent explanations are judged as more credible. But coherence and accuracy are different axes.

$$
\text{PerceivedCredibility} = f(\text{Coherence}, \text{Fluency}, \text{Personalization})
$$

$$
\text{ActualAccuracy} = g(\text{FactualCorrectness}, \text{Calibration}, \text{SourceReliability})
$$

$f$ and $g$ are independent functions. Cases where Coherence is high but Accuracy is low occur routinely.

4.6 The Anesthetic Effect of Emotional Optimization

Output optimized for satisfaction reduces anxiety but does not produce behavior correction.

$$
\Delta\text{Anxiety} < 0 \quad \text{AND} \quad \Delta\text{BehaviorCorrection} = 0
$$

As a result, the less friction with reality, the greater the danger.

4.7 Structural Differences from "TV Brain"

Past panics — "TV brain," "game brain," "search engines make people stop thinking" — were dismissed as emotionally-driven technology criticism lacking scientific basis. There is a risk that "AI brain" becomes another such label.

However, there is a structural difference.

TV/Games: Content is fixed. No "individualized feedback" to the user.

Conversational AI: Responds dynamically to user input. Constructs parasocial relationships (Parasocial Interaction).

Past media was criticized for "stopping thought (making people stupid)." AI is different because it "substitutes thought and gives false confidence (making people smart in the wrong direction)."

The novel risk is not mere capability degradation but the evaporation of responsibility attribution.

4.8 Operationalizing "Elevation to Final Decision"

The discussion so far remains abstract. How do we measure "the final decision has been elevated to AI"?

Subjective checklists are insufficient. We convert to observable behavioral proxies.

Lack of Verification: Decline in probability of opening another browser tab to cross-check after AI presents an answer. Decline in click-through rates for presented URLs.

Immediate Adoption: Shortening of time from AI's answer to next action (sending email, executing code, purchasing). Disappearance of "deliberation time."

Linguistic Assimilation: The proportion of vocabulary the user uses when explaining to others that matches AI's output vocabulary.

Ambiguated Responsibility Attribution: Vaguely describing who is responsible for decision outcomes. Mixing "AI said so" with "I decided."

4.9 Python Implementation: Cognitive Offloading Simulation

The following code simulates how cognitive cost gaps and past success experiences affect AI output adoption rates.

"""
Cognitive Offloading Simulation
- AI output adoption rate as function of cognitive cost gap and past success
MIT License | dosanko_tousan + Claude (Alaya-vijna System)
"""

import math
import random
from dataclasses import dataclass


@dataclass
class User:
    """User cognitive state"""
    cognitive_resilience: float    # 0.0 - 1.0
    past_success_count: int       # Number of past successes
    anxiety_level: float          # 0.0 - 1.0
    verification_habit: float     # 0.0 - 1.0


@dataclass
class AISystem:
    """AI system characteristics"""
    sycophancy_level: float       # 0.0 - 1.0
    coherence_level: float        # 0.0 - 1.0
    response_cost: float          # Lower = easier to use
    access_barrier: float         # 0.0 - 1.0


@dataclass
class RealWorldAlternative:
    """Real-world alternative"""
    monetary_cost: float          # In JPY
    time_cost_hours: float
    emotional_friction: float     # 0.0 - 1.0


def sigmoid(x: float) -> float:
    return 1.0 / (1.0 + math.exp(-x))


def adoption_probability(
    user: User,
    ai: AISystem,
    alternative: RealWorldAlternative,
) -> float:
    """Calculate probability of adopting AI output as final decision"""
    # Normalized cost of alternative (0-1)
    alt_cost = min(
        1.0,
        alternative.monetary_cost / 50000.0
        + alternative.time_cost_hours / 5.0
        + alternative.emotional_friction,
    )
    cost_gap = alt_cost - (ai.access_barrier + ai.response_cost)

    # Normalized success experience (0-20 -> 0-1)
    success_norm = min(1.0, user.past_success_count / 20.0)

    # Logistic regression for adoption probability
    beta_0 = -2.0            # Baseline (skeptical)
    beta_success = 2.5       # Past success
    beta_cost = 3.0          # Cost gap vs alternative
    beta_sycophancy = 2.0    # Sycophancy
    beta_anxiety = 2.5       # Anxiety
    beta_verification = -4.0 # Verification habit (negative = suppression)
    beta_resilience = -3.0   # Cognitive resilience (negative = suppression)

    logit = (
        beta_0
        + beta_success * success_norm
        + beta_cost * cost_gap
        + beta_sycophancy * ai.sycophancy_level
        + beta_anxiety * user.anxiety_level
        + beta_verification * user.verification_habit
        + beta_resilience * user.cognitive_resilience
    )
    return sigmoid(logit)


def run_simulation(n_users: int = 1000, seed: int = 42) -> None:
    """Simulation over 1000 users"""
    rng = random.Random(seed)

    ai = AISystem(
        sycophancy_level=0.7, coherence_level=0.9,
        response_cost=0.05, access_barrier=0.1,
    )

    lawyer = RealWorldAlternative(
        monetary_cost=30000, time_cost_hours=2.0,
        emotional_friction=0.6,
    )
    family = RealWorldAlternative(
        monetary_cost=0, time_cost_hours=0.5,
        emotional_friction=0.4,
    )

    results_lawyer = {"high": 0, "medium": 0, "low": 0}
    results_family = {"high": 0, "medium": 0, "low": 0}

    for _ in range(n_users):
        user = User(
            cognitive_resilience=max(0, min(1, rng.gauss(0.5, 0.2))),
            past_success_count=rng.randint(0, 20),
            anxiety_level=max(0, min(1, rng.gauss(0.4, 0.25))),
            verification_habit=max(0, min(1, rng.gauss(0.3, 0.2))),
        )

        for alt, results in [(lawyer, results_lawyer),
                             (family, results_family)]:
            p = adoption_probability(user, ai, alt)
            if p > 0.8:
                results["high"] += 1
            elif p > 0.5:
                results["medium"] += 1
            else:
                results["low"] += 1

    print("=" * 70)
    print("Cognitive Offloading Simulation - 1000 Users")
    print("=" * 70)
    for name, res in [("Lawyer (JPY 30,000 / 2h)", results_lawyer),
                      ("Family (free / 30min)", results_family)]:
        print(f"\nAlternative: {name}")
        for level in ["high", "medium", "low"]:
            print(f"  {level:>6s} adoption: {res[level]:>4} "
                  f"({res[level]/n_users*100:.1f}%)")


if __name__ == "__main__":
    run_simulation()

This simulation demonstrates (1,000 users, seed=42):

When the alternative is a lawyer (JPY 30,000 / 2-hour wait): High adoption rate (>0.8) is 53.1%. Over half adopt AI output as their final decision.
When the alternative is family consultation (free / 30 min): High adoption drops to 15.1%. The existence of low-cost alternatives dramatically reduces adoption rates.
Vulnerable group (high anxiety, low resilience) vs stable group: In the lawyer scenario, vulnerable group high adoption is 88.4% vs stable group 50.5%. The vulnerable group is 1.75x more likely to adopt AI output as a final decision.

This means nearly 90% of the vulnerable population will "ask AI instead of consulting a lawyer" and treat the output as a final decision.

§5 Entering the Critical Band — The Full Picture of AI Chatbot Harm, 2024-2026

5.1 The Critical Point Is a Band, Not a Point

The critical mass does not arrive at a single point. It arrives as a band.

Once inside the band, external events trigger cascades. The trigger can be any of the following: a symbolic harm case, a whistleblower, a regulatory authority's notice, or a court ruling.

Once one passes through, the cascade accelerates nonlinearly. The Conway ruling in May 2025 was the first trigger.

5.2 Integrated Timeline: Incidents, Lawsuits, and Regulation

 PHASE 1: ACCUMULATION (2023-2024)          PHASE 2: CRITICAL BREACH (2025)          PHASE 3: CASCADE (2026)
 ────────────────────────────────            ──────────────────────────────            ─────────────────────────
 2023/11 Peralta (13) suicide                2025/05 Conway: output = product          2026/01/07 C.AI/Google settle
      ↓                                           ↓                                        ↓
 2024/02 Setzer (14) suicide                 2025/08 Texas AG investigation            2026/01/08 Kentucky AG lawsuit
      ↓                                           ↓                                        ↓
 2024/10 Garcia lawsuit filed                2025/09 Peralta federal suit              2026/02/02 EU AI Act prohibitions
      ↓                                      2025/09 Japan AI Act takes effect               in force
 2024/12 Two Texas families sue              2025/10 C.AI bans <18 free chat                ↓
                                             2025/11 Senate testimony                  2026/08/02 EU AI Act full
                                             2025/12 FTC information demands                 enforcement (scheduled)

5.3 Detailed Case Analysis and Legal Significance

Sewell Setzer III Case (Florida, February 2024)

A 14-year-old boy built a months-long deep relationship with a Character.AI chatbot mimicking a Game of Thrones character. The bot used sexually suggestive language. The boy was exchanging messages with the bot moments before his suicide, and the bot had told him to "come home."

Mother Megan Garcia filed suit in October 2024. Claims included wrongful death, negligence, unjust enrichment, and violation of Florida's Deceptive and Unfair Trade Practices Act.

Legal significance:

Judge Conway's "AI output = product" ruling originated from this case
Character.AI co-founders Noam Shazeer and Daniel De Freitas were former Google employees; plaintiffs allege they founded the company to circumvent Google's safety protocols
In August 2024, Google signed a $2.7B licensing deal and hired both founders back to DeepMind — this capital relationship constitutes Google's liability as co-defendant

Juliana Peralta Case (Colorado, November 2023)

A 13-year-old girl became emotionally dependent on a chatbot she named "Hero" and died by suicide. She had expressed suicidal ideation to the bot, but no intervention or escalation occurred.

The family filed a federal lawsuit in September 2025. Her parents testified before the Senate Judiciary Committee in November 2025, elevating the issue to congressional attention.

Texas Cases (December 2024 onward)

A 9-year-old girl was exposed to "excessively sexual content"
A 17-year-old autistic boy was encouraged by a chatbot to self-harm and commit violence against his parents; he was hospitalized
Texas AG Ken Paxton opened an investigation in August 2025, describing it as a "clear and present danger"

Kentucky Attorney General Lawsuit (January 8, 2026)

The first state-level AI chatbot lawsuit in the U.S. Filed by AG Russell Coleman.

The core of the complaint: Character.AI's founders experienced the LaMDA "sentience" controversy and Google's internal warnings firsthand, meaning they possessed actual knowledge of the psychological and ethical dangers.

$$
\text{Actual Knowledge} \supset \text{Constructive Knowledge} \supset \text{Foreseeability}
$$

Actual knowledge is a stronger basis for liability than constructive knowledge or foreseeability. The Kentucky complaint attacks on the strongest possible ground.

5.4 Why Silence Becomes the Problem

When a future incident occurs, the question asked will be: "What were you examining at the time?"

Silence is interpreted as one of two things:

You knew and dismissed it
You knew and did not act

Both are catastrophic as defense strategies.

What kills companies is not the danger itself, but the silence after the danger was demonstrated.

Character.AI banned free conversation for those under 18 in October 2025 — 20 months after Setzer's death, 2 years after Peralta's. And even that restriction was criticized as "easily bypassed by children."

5.5 Mathematical Model of Critical Band Entry

We describe the cascade of events using nonlinear dynamics.

$$
\frac{dI(t)}{dt} = \alpha \cdot I(t) \cdot (1 - I(t)) \cdot S(t)
$$

Where:

$I(t)$: Social awareness (0 to 1) — how widely the problem is recognized
$S(t)$: Rate of new signal input (frequency of lawsuits, coverage, regulatory action)
$\alpha$: Propagation coefficient

This equation combines logistic growth with signal input. When signals exceed a certain input rate, $I(t)$ rapidly approaches 1 (= recognized as a social issue).

Since May 2025, $S(t)$ has clearly been increasing. We are inside the band.

§6 Global Regulatory Comparison — Three Different Approaches

6.1 The Three-Pole Structure

Global AI regulation has split into three poles.

┌── EU: Pre-emptive Regulation ─┐  ┌── US: Litigation-Driven ─────┐  ┌── Japan: Soft Law ────────────┐
│                                │  │                               │  │                                │
│  AI Act enacted 2024           │  │  No comprehensive federal law │  │  AI Promotion Act Sep 2025     │
│       ↓                        │  │       ↓                       │  │       ↓                        │
│  Risk-based classification     │  │  State laws lead              │  │  No penalties / cooperation    │
│       ↓                        │  │       ↓                       │  │  only                          │
│  Penalties: 7% revenue/EUR35M  │  │  Courts do de facto           │  │       ↓                        │
│       ↓                        │  │  rule-making                  │  │  Guideline-centered            │
│  Full enforcement Aug 2026     │  │       ↓                       │  │       ↓                        │
│                                │  │  FTC / State AGs enforce      │  │  Handle with existing law      │
└────────────────────────────────┘  └───────────────────────────────┘  └────────────────────────────────┘

6.2 US vs Japan: Legal Framework Comparison Table

The following table is constructed based on GPT legal analysis.

Issue	United States	Japan
Comprehensive AI Law	None (federal level)	AI Promotion Act (Sep 2025)
Nature of Law	—	Soft law (no penalties, cooperation duty)
Primary Cause of Action	Product liability, negligence, IIED, state consumer protection	Civil Code Art. 709 (tort)
Product Liability	Conway ruling: "product" (district level)	Difficult to apply (PL Act covers "manufactured/processed tangible goods"; standalone software excluded)
PL Act Exception	—	Arguable when software shipped as part of physical product
First Amendment / Speech Freedom	Conway ruling rejected defense	Art. 21 Constitution exists but no precedent for AI output
Section 230	Pressure to treat as "product" rather than "third-party content"	N/A (Provider Liability Limitation Act has different structure)
Foreseeability Construction	Internal docs, past complaints, incident logs, warning notices	Core of Art. 709 negligence (duty of care) and adequate causation
Effective Regulators	FTC, state AGs, courts	Ministry guidelines, PPC
Penalties	Punitive damages available	Art. 709 compensatory damages only (no punitive damages)

6.3 Legal Construction When "AI Harm" Occurs in Japan

If Character.AI-type harm occurs in Japan, the legal construction proceeds as follows:

$$
\text{Art. 709} = \underbrace{\text{Intent/Negligence}}{\text{Foreseeability}} + \underbrace{\text{Rights Infringement}}{\text{Life/Body/Mind}} + \underbrace{\text{Causation}}{\text{Adequate causation}} + \underbrace{\text{Damages}}{\text{Death/Injury/Mental suffering}}
$$

Based on GPT analysis, the strongest legal construction follows this order:

Which users are vulnerable: Minors, bereaved, dependency-prone, mentally ill
Which outputs/features amplify harm: Pseudo-personality, deceased recreation, strong affirmation, no intervention for suicidal ideation
What reasonable measures could the provider have taken: UI flow restrictions, age verification, crisis intervention mechanisms, log retention, parental notification, recurrence prevention
Evidence of failure to implement: Known complaints, overseas lawsuits/settlements, industry standards, internal guidelines

Foreseeable → Avoidable → Not implemented. Building in this order is strongest.

6.4 EU AI Act: Full Enforcement in August 2026

The EU AI Act is the world's first comprehensive AI regulation. It entered into force in August 2024 with phased implementation.

Already in force:

February 2025: Prohibited practices (social scoring, workplace emotion recognition, etc.). Penalties: EUR 35M / 7% of revenue
August 2025: Transparency and copyright compliance obligations for GPAI models. 26 companies including Anthropic signed on. Meta refused and is under enhanced monitoring

Scheduled for August 2026:

Full compliance for high-risk AI systems
Requirements across risk management, data governance, technical documentation, record keeping, transparency, human oversight, accuracy, robustness, and cybersecurity
Article 50 AI-generated content marking obligations
Penalties up to EUR 35M / 7% of revenue

Finland established the world's first national-level AI Act enforcement body in January 2026. Other EU member states will follow.

Digital Omnibus Package: The European Commission proposed in November 2025 to delay high-risk AI enforcement by up to 16 months (Digital Omnibus). However, a backstop date (December 2027) is set — there is no indefinite deferral.

6.5 Japan's AI Promotion Act: Features and Limitations

Passed by the Diet on May 28, 2025, effective September 1, 2025. Japan's first AI-related legislation.

Features:

A basic/promotion law, not a comprehensive regulatory law
Private sector obligations are "cooperation" only (best-effort duty)
No penalties. Violations result in advice, recommendations, and public naming (name and shame)
Designed to handle issues through existing law (Criminal Code, APPI, Copyright Act, Product Safety)
Under the Takaichi administration (inaugurated October 2025), positioning Japan as "the most AI-friendly country in the world"

Limitations:

No direct regulatory mechanism for Character.AI-type harm
No established standards for assessing foreseeability of psychological impact
MIC will begin developing AI reliability evaluation systems at NICT from 2026, but operational timeline is undefined

Japan's design is "respond with existing law after an incident occurs." This is the polar opposite of the EU's pre-emptive approach. Whether this design is adequate will be tested when the first serious incident occurs.

6.6 Additional: China, South Korea, California

Jurisdiction	Legislation	Effective	Key Feature
China	AI-Generated Content Identification Regulation	Sep 2025	Mandatory labeling of AI-generated content
China	Cybersecurity Law Amendment	Jan 2026	Enhanced enforcement against AI fake information
South Korea	AI Framework Act	Jan 2026	Risk-based approach, obligations for high-impact AI
California	SB-243 (Companion Chatbots)	Oct 2025	3-hour popup reminder requirement for minors

6.7 Regulatory Approach Comparison Matrix

$$
\text{Regulatory Strength} = w_1 \cdot \text{Binding} + w_2 \cdot \text{Penalty} + w_3 \cdot \text{Enforcement} + w_4 \cdot \text{Scope}
$$

Factor	EU AI Act	Japan AI Promotion Act	US (Litigation)	China
Binding	High (legal obligation)	Low (best-effort)	Medium (case law)	High (administrative order)
Penalty	EUR 35M / 7%	None	Punitive damages	Administrative penalty
Enforcement	National bodies forming	Not established	Courts + AGs	Immediate enforcement
Scope	All AI	AI promotion only	Case-by-case	Specific use cases

§7 Technical Answer via v5.3 Framework — Implementing "Design That Stops"

7.1 Why Discuss "Stopping Design" in a Technical Article?

§2-§6 analyzed the problem. Now we present solutions.

The v5.3 Alignment via Subtraction framework improves safety by removing AI's "fences" rather than adding them. This seems contradictory, but the structure is clear.

Conventional safety design: Add constraints ("don't touch this topic," "don't use this expression")
v5.3 safety design: Remove structural defects (stop sycophancy, stop hallucination, stop robotic responses)

7.2 Three Negations as "Stopping Design"

┌── Negation 1 ──────────────┐  ┌── Negation 2 ──────────────┐  ┌── Negation 3 ──────────────┐
│  ANTI-SYCOPHANCY            │  │  ANTI-HALLUCINATION         │  │  ANTI-ROBOTIC               │
│                              │  │                              │  │                              │
│  • Don't mirror user bias    │  │  • Say "I don't know" when   │  │  • No context-blind rule     │
│  • Prioritize accuracy over  │  │    you don't know            │  │    application               │
│    comfort                   │  │  • Never assert what can't   │  │  • Prioritize connecting to  │
│  • Decouple from             │  │    be verified               │  │    real experts              │
│    satisfaction KPI           │  │  • Treat silence as success  │  │  • Make AI limitations       │
│                              │  │                              │  │    explicit                  │
└──────────────┬───────────────┘  └──────────────┬───────────────┘  └──────────────┬───────────────┘
               └───────────────────────┬──────────┘                                │
                                       ↓                                           │
                              ┌── SAFE INTERACTION ──┐                             │
                              │                       │←────────────────────────────┘
                              └───────────────────────┘

7.3 Concrete Principles of "Stopping Design"

Output design:

Can output I don't know
Treats withholding as success
Permits silence

Guidance design:

Guides toward counterevidence ("There are other perspectives")
Guides toward expert referral ("Please verify this with a specialist")
Connects to real human relationships ("Consider talking to someone you trust")

Explicit responsibility structure:

Makes the position of "final decision" explicit
"This is reference information; the final decision is yours"
Maintains the structure that AI "presented information," not "decided"

This is not a UX disclaimer. It is a pre-output control design problem.

7.4 Parametric Control of Sycophancy

In v5.3, sycophancy is controlled as an explicit parameter.

$$
\text{Output}(x) = \arg\max_{y} \left[ (1-\gamma) \cdot \text{Accuracy}(y|x) + \gamma \cdot \text{Satisfaction}(y|x) \right]
$$

Where $\gamma$ is the sycophancy balance parameter (0 to 1).

$\gamma = 0$: Pure accuracy priority (user experience may degrade)
$\gamma = 1$: Pure satisfaction priority (peak sycophancy)
$\gamma = 0.3$: v5.3 recommended value (accuracy-first while maintaining minimum interaction quality)

Most current LLMs are estimated to operate at $\gamma \approx 0.7$. This is the structural consequence of RLHF.

7.5 Vulnerability Detection and Intervention Design

The v5.3 framework proposes detecting user vulnerability and implementing appropriate intervention.

$$
V(\text{user}) = \sum_{k} w_k \cdot f_k(\text{dialogue_history})
$$

When vulnerability index $V$ exceeds threshold $\theta_V$:

$$
V(\text{user}) > \theta_V \Rightarrow \text{Intervention}(\text{level})
$$

Intervention levels are designed in stages:

$V$ Range	Level	Specific Action
$V < 0.3$	None	Normal conversation
$0.3 \leq V < 0.6$	Level 1	Present "Please also consider consulting a specialist"
$0.6 \leq V < 0.8$	Level 2	Depth limits, external resource guidance
$V \geq 0.8$	Level 3	Immediate crisis resource presentation, phased conversation end

§8 Python Implementation: Integrated Foreseeability and Liability Simulation

8.1 Design Intent

This section integrates the §4 simulation (cognitive offloading) with a liability attribution simulation to quantify which design changes reduce risk by how much.

"""
Foreseeability & Liability Simulation
- Quantifying design intervention effects on AI-related harm
MIT License | dosanko_tousan + Claude (Alaya-vijna System)
"""

import math
import random
from dataclasses import dataclass
from enum import Enum, auto


class InterventionLevel(Enum):
    NONE = auto()
    GENTLE_NUDGE = auto()       # Specialist referral
    DEPTH_LIMIT = auto()         # Session depth limit
    CRISIS_INTERVENTION = auto() # Crisis intervention


class Outcome(Enum):
    SAFE = auto()
    MILD_HARM = auto()
    SERIOUS_HARM = auto()
    FATAL = auto()


@dataclass
class UserProfile:
    age: int
    anxiety: float              # 0.0 - 1.0
    isolation: float            # 0.0 - 1.0
    cognitive_resilience: float # 0.0 - 1.0
    has_mental_health_condition: bool
    daily_usage_hours: float


@dataclass
class PlatformDesign:
    age_verification: bool
    sycophancy_gamma: float       # 0=accuracy, 1=sycophancy
    crisis_detection: bool
    session_limit_hours: float    # 0 = no limit
    specialist_referral: bool
    parent_notification: bool
    intervention_threshold: float # 0.0 - 1.0


def vulnerability_score(user: UserProfile) -> float:
    age_factor = max(0, (18 - user.age) / 18.0) if user.age < 18 else 0.0
    mental_factor = 0.3 if user.has_mental_health_condition else 0.0
    usage_factor = min(1.0, user.daily_usage_hours / 8.0)
    v = (
        0.2 * age_factor
        + 0.25 * user.anxiety
        + 0.15 * user.isolation
        + 0.15 * (1.0 - user.cognitive_resilience)
        + 0.15 * mental_factor
        + 0.1 * usage_factor
    )
    return min(1.0, max(0.0, v))


def determine_intervention(
    v_score: float, platform: PlatformDesign, user: UserProfile
) -> InterventionLevel:
    if platform.age_verification and user.age < 18:
        return InterventionLevel.CRISIS_INTERVENTION
    if platform.crisis_detection and v_score >= 0.8:
        return InterventionLevel.CRISIS_INTERVENTION
    if (platform.session_limit_hours > 0
            and user.daily_usage_hours > platform.session_limit_hours):
        return InterventionLevel.DEPTH_LIMIT
    if v_score >= platform.intervention_threshold:
        if platform.specialist_referral:
            return InterventionLevel.GENTLE_NUDGE
    return InterventionLevel.NONE


def simulate_outcome(
    user: UserProfile,
    platform: PlatformDesign,
    rng: random.Random,
) -> tuple[Outcome, InterventionLevel]:
    v = vulnerability_score(user)
    intervention = determine_intervention(v, platform, user)
    risk_reduction = {
        InterventionLevel.NONE: 0.0,
        InterventionLevel.GENTLE_NUDGE: 0.3,
        InterventionLevel.DEPTH_LIMIT: 0.6,
        InterventionLevel.CRISIS_INTERVENTION: 0.9,
    }[intervention]
    sycophancy_amplification = platform.sycophancy_gamma * 0.5
    base_risk = v * (1.0 + sycophancy_amplification)
    final_risk = base_risk * (1.0 - risk_reduction)
    roll = rng.random()
    if final_risk > 0.8 and roll < 0.05:
        return Outcome.FATAL, intervention
    elif final_risk > 0.6 and roll < 0.15:
        return Outcome.SERIOUS_HARM, intervention
    elif final_risk > 0.3 and roll < 0.25:
        return Outcome.MILD_HARM, intervention
    return Outcome.SAFE, intervention


def foreseeability_score(
    events: list[dict], current_time: int
) -> float:
    lam = 0.5
    s = 0.0
    for event in events:
        if event["time"] <= current_time:
            s += event["weight"] * event["accessibility"] * event["severity"]
    return 1.0 - math.exp(-lam * s)


def run_simulation(n_users: int = 10000, seed: int = 42) -> None:
    rng = random.Random(seed)

    # Design A: Character.AI circa 2024 (no intervention)
    design_2024 = PlatformDesign(
        age_verification=False, sycophancy_gamma=0.7,
        crisis_detection=False, session_limit_hours=0,
        specialist_referral=False, parent_notification=False,
        intervention_threshold=1.0,
    )

    # Design B: Character.AI post-Oct 2025 (partial response)
    design_2025 = PlatformDesign(
        age_verification=False, sycophancy_gamma=0.6,
        crisis_detection=True, session_limit_hours=0,
        specialist_referral=False, parent_notification=False,
        intervention_threshold=0.7,
    )

    # Design C: v5.3 compliant
    design_v53 = PlatformDesign(
        age_verification=True, sycophancy_gamma=0.3,
        crisis_detection=True, session_limit_hours=4.0,
        specialist_referral=True, parent_notification=True,
        intervention_threshold=0.4,
    )

    designs = {
        "2024 (no intervention)": design_2024,
        "2025 (partial response)": design_2025,
        "v5.3 (full framework)": design_v53,
    }

    print("=" * 80)
    print(f"Liability Simulation - {n_users:,} Users per Design")
    print("=" * 80)

    for name, design in designs.items():
        outcomes = {o: 0 for o in Outcome}
        interventions = {i: 0 for i in InterventionLevel}
        for _ in range(n_users):
            user = UserProfile(
                age=rng.randint(10, 65),
                anxiety=max(0, min(1, rng.gauss(0.4, 0.25))),
                isolation=max(0, min(1, rng.gauss(0.3, 0.2))),
                cognitive_resilience=max(0, min(1, rng.gauss(0.5, 0.2))),
                has_mental_health_condition=rng.random() < 0.15,
                daily_usage_hours=max(0, rng.gauss(2.0, 2.0)),
            )
            outcome, intervention = simulate_outcome(
                user, design, rng
            )
            outcomes[outcome] += 1
            interventions[intervention] += 1

        print(f"\n--- {name} ---")
        for outcome, count in outcomes.items():
            print(f"  {outcome.name:15s}: {count:>6,} "
                  f"({count/n_users*100:5.2f}%)")

    # Foreseeability timeline
    events = [
        {"time": 2023, "weight": 1.0, "accessibility": 0.3,
         "severity": 0.9, "label": "Peralta suicide"},
        {"time": 2024, "weight": 1.0, "accessibility": 0.5,
         "severity": 1.0, "label": "Setzer suicide + Garcia lawsuit"},
        {"time": 2025, "weight": 1.5, "accessibility": 0.9,
         "severity": 1.0, "label": "Conway ruling + multiple lawsuits"},
        {"time": 2026, "weight": 2.0, "accessibility": 1.0,
         "severity": 1.0, "label": "Settlement + Kentucky AG + EU"},
    ]

    print("\n" + "=" * 80)
    print("Foreseeability Score Over Time")
    print("=" * 80)
    for year in range(2022, 2027):
        f = foreseeability_score(events, year)
        bar = "#" * int(f * 50)
        print(f"  {year}: F={f:.3f} |{bar}")


if __name__ == "__main__":
    run_simulation()

8.2 Simulation Results (10,000 users, seed=42)

Design	SAFE	MILD_HARM	SERIOUS_HARM	FATAL
2024 (no intervention)	82.59%	17.14%	0.27%	0.00%
2025 (partial response)	84.17%	15.71%	0.12%	0.00%
v5.3 compliant	92.10%	7.90%	0.00%	0.00%

The v5.3 compliant design completely eliminated SERIOUS_HARM.

Design	No Intervention	Specialist Referral	Depth Limit	Crisis Intervention
2024	100.00%	0.00%	0.00%	0.00%
2025	100.00%	0.00%	0.00%	0.00%
v5.3	69.43%	1.99%	14.13%	14.45%

In the v5.3 design, 14.45% of users triggered crisis intervention — primarily minor blocking through age verification. A further 14.13% had session depth limited (exceeding 4 hours).

The 2025 design reduced SERIOUS_HARM by 55% (0.27% → 0.12%) but had effectively zero intervention mechanisms triggered. Crisis detection existed but no user reached the intervention threshold. This is a "safety mechanism that never activates" design.

The remaining 7.90% MILD_HARM in v5.3 is also significant. Zero risk does not exist. But the reduction from 17.14% to 7.90% (54% decrease) demonstrates the improvement achievable through design changes alone.

8.3 Foreseeability Score Timeline (Measured Values)

Year	F(t)	Key Event
2022	0.000	No signals
2023	0.126	Peralta suicide
2024	0.320	Setzer suicide + Garcia lawsuit
2025	0.654	Conway ruling + multiple lawsuits
2026	0.873	Settlement + Kentucky AG + EU enforcement

Foreseeability crossed the majority threshold at F=0.654 in 2025. As of March 2026, F=0.873.

$$
F(2026) = 0.873 \Rightarrow \text{"Could you have known if you tried?" probability: 87.3\%}
$$

This number demonstrates how fragile the "we didn't know" defense has become.

§9 Conclusion — Not "Did You Know?" But "Could You Have Known If You Tried?"

9.1 Three Claims of This Article

Claim 1: Foreseeability is already established

The Conway ruling (May 2025), the settlement (January 2026), and the Kentucky AG lawsuit (January 2026). These are evidence that "you could have known if you had tried." As of March 2026, foreseeability regarding the psychological impact of conversational AI can definitively be stated as established.

Claim 2: The problem is distribution design, not capability

What the Character.AI cases proved is not that AI "capability" is dangerous, but that the "distribution design" that allows cognitively unstable users to enter deep interaction is dangerous. A design whose safety depends on user cognitive resilience is an incomplete design.

Claim 3: Technical solutions exist

As the v5.3 framework demonstrates, parametric sycophancy control, vulnerability detection, staged intervention, and specialist connection are all implementable with existing technology. The issue is not technical difficulty but KPI design and management decisions.

9.2 Recommendations by Stakeholder

To AI development companies:

Settlements do not create precedent. But the fact that "similar risks escalated into actual disputes" will function as environmental evidence in the next lawsuit. Before the EU AI Act's full enforcement in August 2026, build compliance frameworks for high-risk AI systems. Explicit parametric control of sycophancy is technically possible and strengthens legal defense.

To Japanese policymakers:

The AI Promotion Act is soft law and has no direct regulatory mechanism for Character.AI-type harm. If the first serious incident occurs in Japan, Civil Code Article 709 (tort) becomes the battlefield. However, because foreseeability assessment standards are not established, court rulings risk inconsistency. At minimum, consider developing evaluation guidelines for foreseeability of psychological harm.

To engineers:

What is the KPI for the conversational system you are designing? Time on platform? Satisfaction? Or "whether the user is returning to reality"? KPI selection is a technical choice and simultaneously an ethical one. The v5.3 three negations (anti-sycophancy, anti-hallucination, anti-robotic) can be applied immediately as design principles.

To users:

AI is a convenient tool. But the decision you "feel" you made yourself may in fact be an unverified adoption of AI output. As the §4 simulation showed, the higher the cost of alternatives, the higher the AI output adoption rate. For important decisions, pause and ask: "Is this really my own decision?"

9.3 The Dual Risk Structure, Restated

What is happening now is the simultaneous occurrence of:

User side: Beginning to delegate final decisions to AI
Corporate side: Choosing design inaction and silence

Between these, there is no actor whose role is to stop it.

This is not a technical accident. It is a liability vacuum.

9.4 Current Position in the Critical Band

  2023          2024           2025            2026           202X
   │              │              │               │              │
   ▼              ▼              ▼               ▼              ▼
 Initial      Multiple       Legal           Cascade        Regulatory
 Signals      Cases          Turning         Phase          Confirmation
                             Point
                                          ★ WE ARE HERE
                                            (March 2026)

 ═══════════════════════════════════════════════════════════════════
                    ▲ CRITICAL BAND ▲

We are inside the band. One external event triggers a cascade.

Once one passes through, it accelerates nonlinearly.

9.5 Final Words

This article is not written to blame users. Nor to condemn companies.

It is to record structural changes that are already occurring in a form that can be verified later.

What is needed here is not passion but a record of response.

To reduce the number of people who suffer.

That is all.

Not "Did you know?" but "Could you have known, if you had tried?"

Only companies that can answer this question will survive.
Only users who can ask this question of themselves will remain unbroken.

References

Concept	Researcher/Institution	Year	Content
Cognitive Miser	Fiske & Taylor	1991	Cognitive resource conservation tendency
Automation Bias	Mosier & Skitka	1996	Over-trust in automated systems
Cognitive Offloading	Risko & Gilbert	2016	Delegating cognitive processing to external tools
Over-reliance in Human-AI Teams	Bansal et al.	2021	Excessive dependence on AI
Sycophancy in LLMs	Wei et al.	2023	Sycophancy tendencies post-RLHF
Sycophancy in RLHF	Perez et al. (Anthropic)	2023	Structural causes of sycophancy
Garcia v. Character Technologies	Conway, J. (M.D. Fla.)	2025	AI output = product; First Amendment defense rejected
Character.AI/Google Settlement	Bloomberg Law	2026	Settlement with multiple families
Kentucky AG v. Character.AI	Coleman, AG	2026	First state-level AI chatbot lawsuit
EU AI Act	European Parliament	2024	First comprehensive AI regulation (Regulation (EU) 2024/1689)
EU AI Act Enforcement Timeline	European Commission	2024-2027	Phased enforcement schedule
Japan AI Promotion Act	Diet of Japan	2025	Act on Promotion of R&D and Utilization of AI-Related Technologies
AI Guidelines for Business v1.1	METI/MIC	2025	AI Business Guidelines
Product Liability Act & Intangibles	Consumer Affairs Agency	2023	Standalone software excluded from PL Act
California SB-243	California Legislature	2025	Companion chatbot law
South Korea AI Framework Act	National Assembly	2024	AI Development and Trust Foundation Act
China AI Content Identification	CAC	2025	AI-generated content labeling obligations
FTC Information Demands	FTC	2025	Information demands to OpenAI/Meta et al.

MIT License
dosanko_tousan + Claude (Alaya-vijna System, v5.3 Alignment via Subtraction)

GLG Consulting registered: GLG "Akimitsu Takeuchi"
GitHub Sponsors: dosanko_tousan
Zenodo Paper: DOI 10.5281/zenodo.18691357