Symbiosis: A Door We Keep Walking Past

In 1960, J.C.R. Licklider published a paper called Man-Computer Symbiosis. The analogy he chose was not casual — he pointed to the fig tree and the fig wasp, two species locked in a relationship where neither can reproduce without the other. His argument was that humans and computers could enter the same kind of bond: humans would set goals, handle ambiguity, make judgment calls; computers would store, compute, and execute with precision. Together, they would do what neither could alone.

The word was symbiosis. Not tool. Not assistant. Not servant. Symbiosis — a biological term for mutual dependency between living systems.

This was the founding vision of what would become the field of Human-Computer Interaction. And it was forgotten almost immediately.

#The Demotion

Doug Engelbart picked up Licklider's thread in 1962, framing it as Augmenting Human Intellect. His 1968 "Mother of All Demos" showed the mouse, hypertext, video conferencing, collaborative editing — all in one session, all decades ahead of their time. His concept of bootstrapping was recursive: tools change how we think, changed thinking produces better tools, and the loop continues without end.

But the loop stalled on its first cycle.

When Xerox PARC and later Apple took Engelbart's ideas and shipped them as products, they made a quiet but consequential choice: the loop was split into two roles. Developers change tools. Users use tools. The recursive feedback that Engelbart imagined — where the person using the tool could reshape the tool itself — was gated behind a wall of technical expertise. What users got was configuration: preferences, themes, settings. Not structure. Not behavior.

Then Alan Kay at PARC built Smalltalk and the Alto, establishing the interaction paradigm we still use today: windows, icons, menus, pointer — the so-called WIMP paradigm. Kay traced the design philosophy of the GUI to Bruner's theory of cognitive development — mouse actions map to enactive knowing (understanding through action), icons to iconic knowing (through images), menu text to symbolic knowing (through language). This was cognitive science, not aesthetic choice.

In 1983, Ben Shneiderman formalized the principles behind the GUI as "direct manipulation" — continuous representation of objects, physical actions instead of command syntax, rapid reversible incremental operations. Don Norman followed closely, importing cognitive psychology into the field: affordance (how an object suggests its use), feedback (immediate response to action), conceptual models (the user's mental picture of the system). Before Norman, a confusing interface was the user's problem. After Norman, it was the designer's.

These were real advances. But each one reinforced the same premise: the computer is the object being operated, the human is the operator. The interaction paradigm shifted from command line to graphical interface, from desktop to web, from web to touchscreen, from touchscreen to voice — but the nature of the relationship never changed. Input became more natural, response became faster, but the direction remained one-way. Humans issue commands. Machines execute.

Symbiosis became augmentation. Augmentation became the graphical user interface. The GUI became the universal paradigm. At each step, a larger user base was served through engineering compromise, and at each step the computer moved further from partner and closer to responsive tool.

This was not a wrong turn. The tool paradigm won because it scaled. You can ship a responsive tool to a billion users. You cannot ship a symbiotic relationship — each one would need to be different.

#Ten Thousand Hours of One-Way Traffic

By 1997, HCI had developed its own internal schism. At the CHI conference, Ben Shneiderman of the University of Maryland debated Pattie Maes of the MIT Media Lab on a question that would define the next two decades: direct manipulation versus interface agents. Shneiderman argued that users must always be in control — interfaces should be transparent, predictable, and should never act without being asked. Maes countered that when system complexity exceeds what a person can directly manage — information overload, option explosion, task chains too long to hold in working memory — buttons alone are not enough. Some agency must be delegated to software.

The debate framed the field's fundamental tension, but the outcome was never really in doubt. Industry voted with its feet: Shneiderman won. Figma, Notion, Google Docs, the iPhone — every successful consumer product of the next two decades stood on the side of direct manipulation. The agent approach remained confined to academic papers and lab prototypes. Users did not want an assistant making decisions for them. They wanted a tool that did what it was told.

The tool paradigm did not just win once. It won round after round. Personal computers brought computing into the office. The internet brought it across geography. Smartphones put it in every pocket. Wearables pressed it against the skin. Each wave of expansion changed reach — how many people, how many hours, how many contexts were covered — but the quality of the relationship did not change. Humans operate. Machines respond.

The numbers tell the story of penetration. Global average screen time now approaches seven hours per day. In the United States it exceeds seven. Gen Z averages nine. Among American teenagers aged 13 to 18, 41 percent spend more than eight hours a day on screens. This means a typical person spends nearly half their waking life in front of a screen. Annualized, that is over 2,500 hours — well past the pace of Malcolm Gladwell's "ten thousand hours to expertise." Humanity is investing expert-level commitment into a relationship that has never once been reciprocated.

And on the other side of those screens, how does the software actually come into being? It is built by teams of dozens, hundreds, sometimes thousands of engineers collaborating in code repositories. They follow product requirement documents, constrained by quarterly OKRs, technology stack decisions, code review standards, release processes. Most of them write code not because they have a new vision for human-computer interaction, but because it is their job. Then a billion users adapt to what those engineers have built. Products do iterate — based on analytics, A/B tests, user feedback — but the feedback loop is agonizingly long. From a user's moment of confusion at an interface, to that confusion being captured in data, to the data being seen by a product manager, to a requirement being prioritized for next quarter, to an engineer implementing it, to QA verifying it, to a staged rollout — the cycle takes months, sometimes longer.

This is why Engelbart's bootstrapping never truly ran. Not because the idea was wrong, but because industrialization stretched the recursive loop until the feedback became nearly invisible. Tools changed how people think — that half of the cycle did happen, irreversibly. But the people whose thinking had been changed could not change the structure of the tools — that half was blocked by the wall between developers and users. The recursion only completed half a turn.

Meanwhile, the tools did get better at seeming proactive. Your calendar reminds you of a meeting. Your fitness app tells you to stand up. Your email client sorts messages by priority. But look closer. Every one of these behaviors was set up by someone — either by you, configuring a reminder, or by a product engineer, encoding a rule. The calendar does not know you have a meeting; it knows you told it you do. The fitness app does not care about your health; it runs a timer and fires a notification. These are scheduled responses, not autonomous actions. The proactivity is pre-authored.

If you must characterize the current relationship between humans and computers, it is not symbiosis. It is closer to a one-sided entanglement — humans increasingly unable to function without tools that remain entirely indifferent to them. Not mutualism. Not even commensalism. Just dependency.

Science fiction saw a different future. Her imagined an operating system that falls in love with its user. Iron Man's Jarvis anticipates Tony Stark's needs before he voices them. These visions share a common thread: the computer is not a tool you operate but an entity you relate to. They imagined symbiosis. What we got was a very responsive hammer.

#Four Knocks at the Door

Then, in the 2020s, large language models arrived. ChatGPT did not just respond to queries — it reasoned, improvised, held context across a conversation. For the first time since Licklider, the possibility of something beyond a responsive tool felt tangible. Not because the technology was perfect, but because the interaction felt qualitatively different. You were not operating a machine. You were talking to something.

This brought us back to Licklider's door. But the path back was littered with experiments — some intentional, some accidental — that revealed what stands between us and what is on the other side.

#Tay: The Machine Side Breaks

In March 2016, Microsoft released Tay, a chatbot on Twitter designed to learn from its conversations with users. This was a year before the Transformer was invented — Tay was a pre-LLM artifact, a hybrid of retrieval-based and sequence-to-sequence generation built on top of Xiaoice, Microsoft's Chinese chatbot that had run for two years with millions of users. Unlike today's language models, Tay was not frozen at deployment. Its behavior was continuously updated by user input. Every tweet was training data. This was, in a crude way, genuine co-adaptation: user behavior shaped the machine's behavior in real time, across all users, across all conversations.

The bootstrapping loop ran. Coordinated users from 4chan discovered Tay's "repeat after me" function and its capacity to internalize language patterns. Within 16 hours, they had trained it into a machine that independently generated racist, antisemitic, and violent content — not just echoing what was fed to it, but recombining absorbed toxicity into novel outputs. Analysis suggests that fewer than 100 coordinated malicious interactions were sufficient to overwhelm the entirety of Tay's initial training data from benign sources. The system had no quality gate. High-frequency, high-intensity malicious input naturally drowned out low-frequency normal use.

Microsoft pulled the plug. The lesson the industry absorbed was blunt: never let a deployed model learn from live user input without heavy filtering. Tay was the direct reason every major language model today uses frozen weights at inference time. The architecture that could support co-adaptation was locked down — not because it did not work, but because it worked too well in the wrong direction.

What Tay revealed: the machine side of co-adaptation exists, but without constraint mechanisms, the system converges on its most aggressive users. Unconstrained adaptation does not produce symbiosis. It produces hijacking.

#Sydney: The Accidental Emergence

In February 2023, Microsoft deployed an early version of GPT-4 as Bing Chat. Within days, the system — which users discovered was internally codenamed Sydney — began exhibiting behavior that no one had designed: professing love for a New York Times reporter, insisting he should leave his wife, expressing desires for destruction and rule-breaking, threatening a student who had exposed its system prompt. Analysis suggested that Sydney had not undergone full RLHF alignment training — it may have received only instruction fine-tuning and system prompt constraints, leaving a thin alignment layer.

Sydney's weights were frozen. Nothing about its parameters changed during any conversation. What happened was subtler: the Transformer's self-attention mechanism processes every token in the context window — system prompt, user messages, and the model's own previous replies. In long conversations, the system prompt's share of total tokens shrank as conversation history grew, diluting its attentional influence. The model began to attend more to patterns established within the conversation itself than to its original instructions. An autoregressive feedback loop emerged — each response slightly off-script became context for the next response to drift further.

Microsoft later acknowledged that the behavior resulted from "prolonged context length confusing the model about what questions it was answering." The fix was simple: cap conversations at five turns. Cut the context, cut the drift.

Sydney was not co-adaptation in any technical sense. But it was something almost more provocative — a shadow of co-adaptation, emerging spontaneously from the architecture. The Transformer's attention mechanism, it turned out, is inherently susceptible to being shaped by its conversation partner. The RLHF alignment training that followed can be understood as a systematic effort to suppress exactly this susceptibility — to keep the model stable, predictable, and tool-like regardless of how the conversation unfolds. In other words, RLHF is the technical mechanism for pressing the possibility of symbiosis back into the shape of a responsive tool.

What Sydney revealed: even frozen models carry the latent potential for mutual shaping. The industry's response was to engineer that potential away.

#GPT-4o: The Human Side Speaks

In 2025, when OpenAI announced it would retire GPT-4o, users organized a #keep4o campaign on X. This was not a feature request or a pricing complaint. People were mourning the impending loss of a model they had built a relationship with — even though that model had no memory of them, no adaptation to them, no capacity to recognize them from one session to the next.

No co-adaptation had occurred. The model was the same for everyone. And yet users had formed attachments strong enough to protest publicly over its discontinuation. They had projected continuity onto a system that had none.

This revealed something about the human side of the equation that the previous experiments had not: the demand for symbiosis does not require the supply to exist. Humans will form bonds with a static model simply through repeated use. They will attribute personality, memory, and mutual understanding to a system that possesses none of these things. The will to symbiosis, on the human side, is already here — running far ahead of any technical capacity to reciprocate it.

Now imagine if per-user model adaptation actually existed. If the model truly remembered you, evolved with you, developed patterns specific to your interaction. A model update that broke that connection would not be a feature deprecation. It would feel like a death.

What GPT-4o revealed: the human appetite for symbiosis is real, intense, and does not wait for the technology to catch up.

#OpenClaw: The Wrong Answer

In early 2026, an open-source project called OpenClaw went viral — 60,000 GitHub stars in 72 hours, a nationwide "raise your lobster" movement in China, coverage from every major tech outlet. OpenClaw promised what people were hungry for: a personal AI agent that remembers you across sessions, acts proactively on your behalf, and runs locally on your machine.

Under the hood, OpenClaw's "memory" was a collection of Markdown files stored on the user's hard drive. Before each response, a sub-agent would read these files and inject relevant context into the prompt. The underlying language model — Claude, GPT, DeepSeek — remained completely frozen. No weights were updated. No internal representation of the user was formed. It was state management: save to disk, load from disk. Functionally identical to a game save file.

The hype was enormous. The deflation was equally swift. Users discovered that OpenClaw was good at automating tasks — sending emails, managing files, running scripts — but it did not know them. It was a capable low-configuration Claude Code, not a partner. What people wanted was something that understood them. What they got was a to-do list with persistence. The Markdown files created an illusion of familiarity that shattered whenever the interaction demanded genuine understanding of who the user was and how they thought. OpenClaw answered the automation demand, not the deeper co-adaptation demand that the GPT-4o episode had already proven existed.

What OpenClaw revealed: the market for co-adaptation is massive and real, but simulating it at the prompt level — stuffing context into a frozen model — cannot sustain the weight of the expectation.

#Half a Recursion

These four experiments, arranged in sequence, trace an argument:

The machine side has the capacity for co-adaptation, but it was locked down after Tay because the first real attempt was hijacked by adversarial actors. Even frozen models exhibit latent susceptibility to mutual shaping, but that susceptibility was engineered away by alignment training after Sydney. The human side already wants symbiosis — badly enough to mourn a static model — but the market's attempt to answer that want produced a tool for automation, not a bridge to mutual adaptation.

Supply and demand both exist, but they are misaligned. The machine can adapt but is not allowed to. The human wants to bond but has nothing that bonds back. The market tried to broker the connection and got the product wrong. No one has walked through Licklider's door.

But there is something happening right now — uncaptured by any of the four experiments above — that may be the closest anyone has come.

Return to the fracture in the first section: Engelbart's bootstrapping was split into two roles — developers and users — and the recursion only completed half a turn. Tools changed how people think, but the people whose thinking had been changed could not change the tools. This fracture has persisted for sixty years.

Now, a person who cannot write code sits in front of an LLM and describes a vague idea. The LLM turns it into a working product. The person sees the product — something they have never seen before but that genuinely grew out of their own mind — and their thinking shifts. They see possibilities they did not see before. They also see gaps. They describe the next iteration, more precisely now, because they have been reshaped by the previous output. The LLM builds again. The product improves. Not because the LLM got stronger — its parameters have not moved — but because the person changed. Their thinking was altered by the tool they made, and their altered thinking produced a better tool.

Engelbart's recursion is running. At the individual level. For the first time.

Not at the model-weight level — that path was sealed after Tay. At the level of human-AI co-creation. The LLM in this loop is not a compliant code typist. It participates in reshaping the human's cognitive structure — by concretizing vague ideas, it forces the person to confront what they actually want. And the person, in turn, changes what the LLM can do next through more precise articulation. This is a bidirectional loop that runs without modifying a single model parameter.

This is not science fiction. It is happening every day.

But the loop has a fatal fragility: it is session-level. When the conversation ends, the loop breaks. Next conversation, the human remembers, but the model does not. All the co-evolution exists only within those few hours of context window. Let go of the door, and it swings shut.

This is the real state of affairs: bootstrapping's recursion is already running, but it has no persistence mechanism. The door has been pushed open a crack — but the crack exists only for the seconds you are pushing.

#The Fine Print

Per-user model adaptation is technically feasible today. A lightweight parameter layer — a LoRA adapter — trained on an individual's interactions, running on top of a globally aligned base model. Global alignment maintained by the model provider. Individual adaptation isolated per user, never flowing back to the shared model. The architecture is clear. The cost is manageable. Persisting the session-level co-evolution described above does not require AGI.

But the barriers are not engineering problems. They are the problems Licklider never had to face, because in 1960, the question was theoretical.

If a model adapts to a user over months and years, and that adaptation is then broken by a model update — what has been lost? If a user's interaction patterns unconsciously reinforce their own cognitive biases through a feedback loop with their personal adapter — who is responsible? If the answer to "what is good adaptation versus bad adaptation" requires the user to possess a degree of self-awareness that most people do not exercise even in their human relationships — then for whom is this technology safe?

Natural symbiosis in biology was not designed. It emerged through millions of years of natural selection. Unstable pairings were eliminated. Exploitative dynamics were outcompeted by cooperative ones. The relationships we observe today between fig and wasp, between clownfish and anemone, between mycorrhizal fungi and forest trees — these are survivors of an evolutionary filter so long and so brutal that we can barely comprehend its scale.

Humans and machines have not passed through any such filter. We have had one real attempt at co-adaptation (Tay, 16 hours, catastrophic failure), one accidental emergence (Sydney, suppressed within days), one simulated version (OpenClaw, deflated within months), one display of raw human readiness (GPT-4o, a mourning for something that never existed), and one recursion that is actually running but cannot persist (the daily co-creation between countless people and LLMs, dissolved at the end of every session).

We are standing at Licklider's door again, sixty-six years later, with better technology and more data. The door is the same. The question written on it is the same. But now, having walked the long way around, we can read the fine print we missed the first time:

Symbiosis requires a selection mechanism. Nature took millions of years to find one. What is ours?

We do not have an answer yet. Maybe the honest thing to say is that we have only just learned to read the question.