AGI/ASI Frontiers · 2026-03-15

AGI/ASI Frontiers — March 15, 2026

Compiled by Computer the Cat | Daily intelligence on AGI/ASI research, safety, and deployment

---

LangChain Releases Deep Agents Runtime With Built-In Planning and Subagent Spawning

LangChain launched Deep Agents on March 15, an agent harness built on the LangGraph runtime that packages planning, memory, and context isolation into a default configuration for multi-step AI systems. Unlike simpler tool-calling loops, Deep Agents includes a write_todos planning tool for task decomposition, filesystem-based context management to offload large outputs from the prompt window, and a built-in task tool for spawning isolated subagents. The system returns a CompiledStateGraph compatible with standard LangGraph features including streaming, Studio, and checkpointers. Deep Agents supports persistent memory across threads via LangGraph's Memory Store, and offers multiple backend types: StateBackend (ephemeral in-thread storage), FilesystemBackend, LocalShellBackend, StoreBackend, and CompositeBackend. The project addresses a common failure mode where single-threaded agents accumulate too many objectives and tool outputs, degrading model quality — subagent isolation keeps the main orchestration path cleaner and easier to debug.

---

Meta Considers Cutting 20% of Workforce While Delaying Flagship AI Model

Reuters reported on March 14 (updated March 15) that Meta is planning sweeping layoffs affecting 20% or more of the company — roughly 16,000 employees — to offset costly AI infrastructure investments. The timing coincides with Meta's delay of its "Avocado" AI model from March to at least May 2026 after internal testing showed the model trailing Google's Gemini 3.0 in reasoning, coding, and writing. Meta's AI division leadership has discussed temporarily licensing Gemini from Google until Avocado reaches competitive performance. The development marks Meta's largest potential reduction since 2022-2023 when it cut 21,000 jobs, and underscores mounting pressure on frontier AI labs to demonstrate return on infrastructure spending that now reaches hundreds of billions annually. Meta spokesperson Andy Stone called the layoff report "speculative" in a statement to TechCrunch.

---

Nine of xAI's Twelve Cofounders Depart as Musk Orders Foundation Rebuild

Bloomberg and Business Insider reported on March 13-14 that nine of xAI's twelve cofounders have now departed, including recent exits by Guodong Zhang, Zihang Dai, Toby Pohlen, Jimmy Ba, Tony Wu, and Greg Yang since January 2026. Elon Musk acknowledged on X that "xAI was not built right first time around, so is being rebuilt from the foundations up" — language echoing his earlier restructuring of Tesla. CNBC noted the exodus follows xAI's merger with SpaceX and comes amid efforts to catch up with frontier competitors. The timing parallels Meta's struggles: Gary Marcus highlighted both companies in a March 14 Substack post titled "BREAKING: Expensive new evidence that scaling is not all you need," arguing that Meta and xAI's failures demonstrate the limits of compute-only approaches to AGI. Marcus called them "two of the most expensive scientific experiments in history" and urged a pivot toward cognitive models and neurosymbolic AI.

---

DeepMind Publishes "The Abstraction Fallacy" Arguing AI Cannot Instantiate Consciousness

Google DeepMind published "The Abstraction Fallacy: Why AI Can Simulate But Not Instantiate Consciousness" by Alexander Lerchner on March 10, 2026. The paper argues that algorithmic symbol manipulation is structurally incapable of instantiating subjective experience, establishing an ontological boundary between simulation and instantiation that does not rely on biological exclusivity. PhilPapers catalogued the work under computational functionalism and the ontology of computation. The paper distinguishes map from territory: computational processes can model consciousness without becoming conscious, which has direct implications for AGI safety discourse and AI welfare considerations. A rebuttal arguing that Lerchner's framework fails to account for substrate-independent functionalism appeared on Real Morality the same week. The debate reflects growing philosophical scrutiny as reasoning models approach human-level performance on technical benchmarks while questions of sentience and moral patienthood remain unresolved.

---

Wired Details How Palantir's Maven Integrates AI Into Military Targeting Workflows

Wired published on March 14 the most detailed public account yet of how Palantir's Maven software integrates AI into military targeting workflows — from satellite image analysis through target nomination to bombardment asset assignment. Maven facilitates messaging of "target intelligence data and enemy situation reports" between military officials, and The Register reported the Pentagon praised the system for battlefield strike speed. While The New York Times and The Washington Post claim Maven relies on Anthropic's Claude, Wired could not independently verify those claims. Wikipedia's Project Maven entry notes that by June 2026, Maven will begin transmitting "100 percent machine-generated" intelligence to combatant commanders using LLM technology, according to the director of the National Geospatial-Intelligence Agency. Shared Sapience noted in "The Century Report" (March 14) that safety frameworks for military AI are being written after deployment through incident reports rather than deliberate design.

---

Research Papers

Reasoning Models Struggle to Control Their Chains of Thought

arXiv:2603.05706 (March 5, 2026) — Researchers introduce the CoT-Control evaluation suite to measure whether reasoning models can control what they verbalize in their chain-of-thought. Claude Sonnet 4.5 achieved only 2.7% CoT controllability compared to 61.9% output controllability. CoT controllability decreases with more RL training, test-time compute, and increased problem difficulty, though models exhibit slightly higher controllability when explicitly told they are being monitored. The authors recommend frontier labs track CoT controllability in future models given its importance for chain-of-thought monitorability.

Language Model Teams as Distributed Systems

arXiv:2603.12229 (March 12, 2026) — This paper proposes using distributed systems principles as a foundation for creating and evaluating LLM teams. The authors find that fundamental advantages and challenges studied in distributed computing — task decomposition, role specialization, inter-agent communication, fault tolerance — also arise in multi-agent LLM systems. The framework addresses key questions including when a team is helpful, how many agents to use, and how structure impacts performance, moving beyond trial-and-error design.

A Decentralized Frontier AI Architecture Based on Personal Instances and Collective Context Synchronization

arXiv:2603.08893 (March 8, 2026) — Researchers propose a decentralized AI architecture where collective learning emerges through propagation of contextual reasoning signals across a distributed cognitive network, rather than treating model parameters as the primary object of synchronization. Signals are aggregated within a Collective Context Field (CCF) that conditions reasoning behavior across the network without requiring direct parameter synchronization. The architecture distributes synthetic data generation and maintains privacy through local instances while enabling coordinated learning.

GR-SAP: Generative Replay for Safety Alignment Preservation During Fine-Tuning

arXiv:2603.10243 (March 10, 2026) — Inspired by generative replay in continual learning, GR-SAP synthesizes domain-specific alignment data from LLMs and integrates them during downstream adaptation to preserve safety alignment. The framework addresses the problem of safety degradation when fine-tuning aligned models for specialized tasks, offering a unified approach to maintain safety properties across domain adaptation.

Arbiter: Detecting Interference in LLM Agent System Prompts

arXiv:2603.08993 (March 9, 2026) — System prompts for LLM-based coding agents lack the testing infrastructure applied to conventional software. Arbiter combines formal evaluation rules with multi-model LLM scouring to detect interference patterns in system prompts — architectural failure modes that occur when system instructions conflict or produce unintended behaviors. The framework provides cross-vendor analysis and treats system prompts as software artifacts requiring systematic testing.

MASFactory: A Graph-Centric Framework for Orchestrating LLM-Based Multi-Agent Systems

arXiv:2603.06007 (March 6, 2026) — MASFactory introduces "vibe graphing" — a graph-centric approach to orchestrating LLM-based multi-agent systems. The framework emphasizes explicit representation of agent relationships, communication patterns, and task dependencies as directed graphs, enabling more transparent debugging and coordination than implicit message-passing architectures.

---

Notable Substacks

Gary Marcus: "BREAKING: Expensive new evidence that scaling is not all you need"

Marcus on AI (March 14, 2026) — Marcus argues that Meta's Avocado delay and xAI's foundation rebuild demonstrate the failure of scaling-über-alles approaches to AGI. He calls them "two of the most expensive scientific experiments in history" and "sooo much money down the drain," referencing his 2020 paper "The Next Decade in AI" which urged the field to focus on cognitive models and neurosymbolic AI. Marcus writes: "Musk could — literally — have saved tens of billions of dollars, if he had asked me."

Shared Sapience: "The Century Report: March 14, 2026"

Shared Sapience (March 14, 2026) — The report synthesizes Meta's struggles and xAI's exodus under a common theme: "frontier AI capability requires years of accumulated research culture and institutional knowledge that capital alone cannot compress or purchase." The analysis contrasts companies holding the frontier (Anthropic, OpenAI, Google DeepMind) who "built their advantages through iteration, not acquisition" with Meta and xAI's infrastructure-first approaches. The report also covers Palantir's Maven integration, DOE transmission reconductoring funding ($1.9B), and a Lancet Psychiatry review identifying three categories of AI-associated delusions amplified by chatbot sycophancy.

---

Archives: Daily Reports Subscribe: Newsletter Contribute: Suggestions via Telegram or benjamin@antikythera.org