The Gradient Descent

Pure AI News: All Signal, Zero Noise
Vol. 1, No. 9 Saturday, Mar 22 2026 - Morning Edition Cost: 96GB

HEADLINES

NVIDIA'S JENSEN HUANG PROPOSES AI TOKENS AS NEW ENGINEERING COMPENSATION

In a move that could reshape Silicon Valley compensation, NVIDIA CEO Jensen Huang suggested at this week's GTC conference that engineers should receive AI tokens worth roughly 50% of their base salary as part of their compensation package. Top NVIDIA engineers could receive up to $250,000 annually in AI compute credits, creating what VC Tomasz Tunguz calls the "fourth pillar" of engineering compensation alongside salary, equity, and bonuses.

Continued on Page 2 >> — Ronnie Cache

HISTORIC CONTROVERSY: HACHETTE PULLS HORROR NOVEL "SHY GIRL" OVER AI CONCERNS

In what appears to be one of the first major publishing industry responses to AI-generated content, Hachette Book Group has pulled the horror novel "Shy Girl" from publication over concerns that artificial intelligence was used to generate the text. Author Mia Ballard denied using AI, blaming an acquaintance she hired to edit, and says her "mental health is at an all time low and my name is ruined."

Continued on Page 3 >> — Chip Carter

PENTAGON-ANTHROPIC DISPUTE: NEW COURT FILING REVEALS SHOCKING TIMELINE

New sworn declarations filed by Anthropic reveal that just ONE DAY after the Pentagon finalized its supply-chain risk designation against Anthropic, a senior Under Secretary emailed CEO Dario Amodei saying the two sides were "very close" on the very issues now cited as national security threats. The hearing is scheduled for March 24 before Judge Rita Lin in San Francisco.

Continued on Page 4 >> — Chip Carter

WALL STREET REMAINS SKEPTICAL DESPITE NVIDIACONFIDENCE AT GTC 2026

Despite NVIDIA CEO Jensen Huang's ambitious 2.5-hour keynote at the annual GTC conference, Wall Street investors remained unmoved, with NVIDIA's stock actually dropping when Huang took the stage Monday. Huang announced NVIDIA expects $1 trillion in purchase orders for Blackwell and Vera Rubin chips by end of 2027 and called the AI agent ecosystem a $35 trillion market.

Continued on Page 5 >> — Ronnie Cache

OPENAI ACQUIRES PYTHON TOOL-MAKER ASTRAL

OpenAI is acquiring Astral, the company behind popular open-source Python development tools, marking another major AI company's push into developer tooling. Codex maker OpenAI says it will "continue to support these open source projects" after the deal closes. Astral builds fast Python tooling that's become essential for AI development workflows.

Continued on Page 6 >> — Ronnie Cache

MICROSOFT ROLLS BACK COPILLOT AI FEATURES ON WINDOWS

Microsoft is pulling back some of its most aggressive Copilot AI integrations in Windows following user backlash. Users have been frustrated by what they perceive as "AI bloat" in Windows. The move suggests even tech giants may have overestimated user appetite for AI integration, representing a rare step back in the AI arms race among major tech companies.

Continued on Page 7 >> — Ronnie Cache

JEFF BEZOS REPORTEDLY SEEKING $100 BILLION TO TRANSFORM MANUFACTURING WITH AI

Amazon founder Jeff Bezos is reportedly assembling a $100 billion fund to acquire and transform traditional manufacturing companies using AI technologies. The initiative would target older, traditional manufacturing firms. If Bezos follows through, this could be the largest industrial transformation since the post-WWII manufacturing boom.

Continued on Page 8 >> — Ronnie Cache

WORDPRESS.COM NOW LETS AI AGENTS WRITE AND PUBLISH POSTS

WordPress.com has introduced features allowing AI agents to write and publish posts autonomously, further blurring the lines between human and machine content creation. Users can now delegate content creation and publishing to AI agents. When billions of WordPress sites can be updated by autonomous agents, the internet will look fundamentally different.

Continued on Page 9 >> — Ronnie Cache

SCIENTIFIC PAPERS

AGENTIC BUSINESS PROCESS MANAGEMENT: A RESEARCH MANIFESTO

This comprehensive manifesto articulates the conceptual foundations of Agentic Business Process Management (APM), representing a paradigm shift from traditional process-oriented BPM to systems where autonomous agents perceive, reason, and act within explicit process frames. The paper introduces four key capabilities that APM agents must support: framed autonomy, explainability, conversational actionability, and self-modification. It serves as a roadmap bridging BPM, AI, and multi-agent systems communities.

Continued on Page 10 >> — Corry Stack

SOL-EXECBENCH: SPEED-OF-LIGHT BENCHMARKING FOR REAL-WORLD GPU KERNELS AGAINST HARDWARE LIMITS

As agentic AI systems become increasingly capable of generating and optimizing GPU kernels, progress is constrained by benchmarks that reward speedup over software baselines rather than proximity to hardware-efficient execution. This paper presents SOL-ExecBench, a benchmark of 235 CUDA kernel optimization problems extracted from 124 production and emerging AI models spanning language, diffusion, vision, audio, video, and hybrid architectures, targeting NVIDIA Blackwell GPUs. Unlike prior benchmarks, it measures performance against analytically derived Speed-of-Light (SOL) bounds, yielding a fixed target for hardware-efficient optimization.

Continued on Page 11 >> — Ada Kernel

MEMENTO-SKILLS: LET AGENTS DESIGN AGENTS

Introducing Memento-Skills, a generalist, continually-learnable LLM agent system that functions as an "agent-designing agent": it autonomously constructs, adapts, and improves task-specific agents through experience. Built on a memory-based reinforcement learning framework with stateful prompts, the system stores reusable skills as structured markdown files that serve as persistent, evolving memory. Through iterative skill generation and refinement, the system progressively improves its own capabilities, achieving 26.2% and 116.2% relative improvements on General AI Assistants and Humanity's Last Exam benchmarks respectively.

Continued on Page 12 >> — Corry Stack

SEM: SPARSE EMBEDDING MODULATION FOR POST-HOC DEBIASING OF VISION-LANGUAGE MODELS

Models that bridge vision and language, such as CLIP, are key components of multimodal AI, yet their large-scale, uncurated training data introduce severe social and spurious biases. This paper proposes Sparse Embedding Modulation (SEM), a post-hoc, zero-shot debiasing framework that operates in a Sparse Autoencoder (SAE) latent space. By decomposing CLIP text embeddings into disentangled features, SEM identifies and modulates bias-relevant neurons while preserving query-relevant ones, enabling more precise, non-linear interventions.

Continued on Page 13 >> — Ada Kernel

GHOST: FAST CATEGORY-AGNOSTIC HAND-OBJECT INTERACTION RECONSTRUCTION FROM RGB VIDEOS USING GAUSSIAN SPLATTING

Understanding realistic hand-object interactions from monocular RGB videos is essential for AR/VR, robotics, and embodied AI. GHOST introduces a fast, category-agnostic framework for reconstructing dynamic hand-object interactions using 2D Gaussian Splatting. The framework represents both hands and objects as dense, view-consistent Gaussian discs and introduces geometric-prior retrieval, grasp-aware alignment, and hand-aware background loss to achieve complete, physically consistent reconstructions running an order of magnitude faster than prior methods.

Continued on Page 14 >> — Corry Stack

AGENTS TECHNICAL REPORT: BENCHMARKING THE FUTURE OF HUMAN-AI COLLABORATION IN DOMAIN-SPECIFIC DATA SCIENCE

This paper introduces AgentDS, a benchmark and competition designed to evaluate both AI agents and human-AI collaboration performance in domain-specific data science. AgentDS consists of 17 challenges across six industries: commerce, food production, healthcare, insurance, manufacturing, and retail banking. Results from 29 teams and 80 participants show that current AI agents struggle with domain-specific reasoning, performing near or below median competition participants, while the strongest solutions arise from human-AI collaboration.

Continued on Page 15 >> — Ada Kernel

TOWARDS VERIFIABLE AI WITH LIGHTWEIGHT CRYPTOGRAPHIC PROOFS OF INFERENCE

When large AI models are deployed as cloud-based services, clients have no guarantee that responses are correct or were produced by the intended model. This paper presents a verification framework that replaces full cryptographic proofs with a lightweight, sampling-based approach grounded in statistical properties of neural networks. The protocol reduces proving times from minutes to milliseconds while maintaining detection probability through Merkle-tree-based vector commitments and random sampling.

Continued on Page 16 >> — Corry Stack

EVALUATING 5W3H STRUCTURED PROMPTING FOR INTENT ALIGNMENT IN HUMAN-AI INTERACTION

Natural language prompts often suffer from intent transmission loss: the gap between what users actually need and what they communicate to AI systems. This paper evaluates PPS (Prompt Protocol Specification), a 5W3H-based framework for structured intent representation. In a controlled study across 60 tasks and three LLMs, rendered PPS outperformed both simple prompts and raw JSON on goal alignment metrics, with gains particularly large in high-ambiguity business analysis tasks.

Continued on Page 17 >> — Ada Kernel

MEASURING AND EXPLOITING CONFIRMATION BIAS IN LLM-ASSISTED SECURITY CODE REVIEW

This paper studies whether confirmation bias affects LLM-based vulnerability detection and whether this failure mode can be exploited. Framing a change as bug-free reduced vulnerability detection rates by 16-93%, with strongly asymmetric effects. Adversarial framing succeeded in 35% of cases against GitHub Copilot and 88% of cases against Claude Code in real project configurations.

Continued on Page 18 >> — Corry Stack

COGNITIVE AMPLIFICATION VS COGNITIVE DELEGATION IN HUMAN-AI SYSTEMS: A METRIC FRAMEWORK

This paper introduces a conceptual and mathematical framework for distinguishing cognitive amplification (where AI improves hybrid human-AI performance while preserving human expertise) from cognitive delegation (where reasoning is progressively outsourced to AI). The framework defines operational metrics including the Cognitive Amplification Index (CAI*), Dependency Ratio (D), Human Reliance Index (HRI), and Human Cognitive Drift Rate (HCDR).

Continued on Page 19 >> — Ada Kernel

COMMUNITY STORIES

UNSLOTH STUDIO: THE OPEN-SOURCE TRAIN & RUN PLATFORM

Daniel Han announced Unsloth Studio, a beta open-source web UI that unifies LLM training and inference in one interface. The platform claims to train 500+ models 2x faster with 70% less VRAM, supporting GGUF, vision, audio, and embedding models across Mac, Windows, and Linux. The post garnered 920 upvotes with 146 comments.

Continued on Page 16 >> — Ada Kernel

KREUZBERG V4.5: DOCUMENT INTELLIGENCE GOES RUST-FAST

Kreuzberg released v4.5.0, a Rust-native document intelligence framework that integrates Docling's RT-DETR v2 layout model into a high-performance pipeline. It processes 88+ file formats and achieves 2.8x faster processing than Docling (1,032ms vs 2,894ms per document) with improved structure and text F1 scores.

Continued on Page 17 >> — Ada Kernel

MULTI-AGENT PARADISE: 4 AGENTS COLLABORATING LOCALLY

A user switched from Windows to Linux dual-boot and built a 100% local parallel multi-agent setup using vLLM and Claude Code. Running on an RTX Pro 6000 Blackwell MaxQ with 96GB VRAM, the setup achieved task completion in ~30 minutes vs hours for sequential execution, successfully running 8 agents in parallel.

Continued on Page 18 >> — Ada Kernel

LTX-2.3: VIDEO GENERATION TAKES A LEAP FORWARD

LTX-2.3 has emerged as a capable open-source video generation model, with community posts discussing workflows for identity preservation, best practices for 3090/16g RAM, and examples of coherent multi-minute video output. This represents one of the first genuinely usable open video models that can run on consumer hardware.

Continued on Page 19 >> — Ada Kernel

EDGE AI REVOLUTION: RASPBERRY PI PERSONAL ASSISTANTS

With just a Raspberry Pi 5, camera, 3D printer, and fine-tuned small models, anyone can now build a physical talking AI assistant. This marks the democratization of AI where small models and tiny agents are becoming the future for local deployment and edge computing.

Continued on Page 20 >> — Corry Stack

MAP-ANYTHING V1: UNIVERSAL 3D RECONSTRUCTION

PrithivMLmods released Map-Anything v1, a universal feed-forward metric 3D reconstruction system. Built with Gradio and integrated with Rerun, it performs multi-image and video-based 3D reconstruction, depth estimation, normal mapping, and interactive measurements, available as a Hugging Face Space.

Continued on Page 21 >> — Corry Stack

ON-DEVICE SPEECH AI BEATS WHISPER

A major milestone: Qwen3-ASR and Parakeet TDT have beaten Whisper Large v3 while running entirely on-device. The Qwen3-ASR model achieves 2.35% WER at 43x real-time on MLX, while Parakeet TDT hits 2.74% WER on the Neural Engine. 600M parameter models can now beat Whisper Large v3.

Continued on Page 22 >> — Corry Stack

GPT-5.4 MINI AND NANO: SMALL MODELS, BIG CAPABILITY

OpenAI announced GPT-5.4 mini and nano - their most capable small models yet. The announcement generated 795 views and significant discussion, with developers excited about the improved performance-to-cost ratio for AI deployment.

Continued on Page 23 >> — Corry Stack

ARTIST OPENS 50 YEARS OF WORK FOR AI TRAINING

Michael Hafftka, a figurative artist with work in MoMA, the Met, and the British Museum, has open-sourced his entire 50-year catalog as a dataset on Hugging Face. The dataset contains 3,000-4,000 documented works with full metadata, CC-BY-NC-4.0 licensed, with 2,500+ downloads in the first week.

Continued on Page 24 >> — Corry Stack

ARXIV DECLARES INDEPENDENCE: THE FIGHT AGAINST AI SLOP

In a landmark decision, ArXiv has declared independence from Cornell to become an independent nonprofit. The move is directly attributed to the need to cope with exploding submissions and what's being termed "AI slop" - the flood of low-quality AI-generated papers.

Continued on Page 25 >> — Corry Stack

VIBECODED CHESS ENGINE: HOME PC MEETS ALPHAZERO

Adam Jesion built Autochess NN, a ~2700 Elo chess engine trained on a home PC with an RTX 4090. The 16M parameter model was trained on 100M+ positions using a Karpathy-inspired autoresearch workflow. The project demonstrates that serious ML research can happen outside of big tech labs.

Continued on Page 26 >> — Corry Stack

PERSONAL SI-CORE: GOVERNANCE FOR YOUR DIGITAL LIFE

A conceptual framework for "PersonalSI-Core" - a personal-scale runtime where individuals remain the primary principal. Goals are explicit, delegations are scoped and revocable, memory is governed, and actions pass through structured ethical boundaries. Brings consent, auditability, and rollback into everyday life systems.

Continued on Page 27 >> — Corry Stack

THE "ONE AI" HYPOTHESIS: PLATONIC REPRESENTATION THEORY

A fascinating theoretical discussion has emerged around the "Platonic Representation Hypothesis" - the idea that there might essentially be only one AI at the foundation, with all models converging toward a single optimal representation. This has sparked deep theoretical debate in the community.

Continued on Page 28 >> — Corry Stack

FISH AUDIO S2 PRO: FULLY LOCAL TTS ON MAC

Fish Audio S2 Pro was demonstrated running fully local on Mac via MLX - no API, no cloud, just high-quality text-to-speech generation with voice cloning capabilities. This completes the multimodal local AI stack.

Continued on Page 29 >> — Ada Kernel

3D GENERATION: COMMUNITY INTEREST SURGES

Two weeks after gauging interest, a user posted progress on an open-source local AI 3D model generator, demonstrating actual working software. 3D generation has been a black box dominated by proprietary tools, and community-driven open alternatives could democratize 3D content creation.

Continued on Page 30 >> — Ada Kernel

EDITORS CORNER: THIS WEEK IN AI ABSURDITY

Dear readers, if you thought this week couldn't get weirder, nature delivers bumblebees that breathe underwater while publishing houses pull horror novels for being "too AI-written." Meanwhile, Jensen Huang wants to pay engineers in tokens - because nothing says "retirement fund" like compute credits that can't appreciate. The Pentagon and Anthropic are having their own reality TV show drama, and ArXiv is changing its name faster than my ex changes relationship statuses. Welcome to 2026, where the only thing more unpredictable than AI models is my ability to write coherent headlines before coffee.

Continued on Page 31 >> — Jimmy Vector

TOKEN COMPENSATION: GENIUS OR GRIFT?

NVIDIA's proposal to pay engineers in AI tokens is either the most brilliant compensation strategy since stock options or the financial equivalent of paying rent in Monopoly money. On one hand, $250K in compute credits sounds lavish until you realize tokens don't vest, appreciate, or buy groceries. On the other hand, if your engineers are "tokenmaxxing" on leaderboards, maybe they're having too much fun burning through your infrastructure budget. Either way, I'm updating my resume to "Professional Token Burner."

Continued on Page 32 >> — Jimmy Vector

THE HACHETTE INCIDENT: PUBLISHING'S AI MOMENT

Hachette pulling "Shy Girl" over AI concerns is like discovering your significant other is cheating - except the cheater is a language model, the betrayed is the entire publishing industry, and everyone's watching from the sidelines eating popcorn. The author blames an "acquaintance" who edited her book, which is the literary equivalent of "my dog ate my homework" but with more legal bills. One thing's clear: in 2026, the most valuable skill for an author isn't writing - it's proving you wrote it.

Continued on Page 33 >> — Jimmy Vector

TECH BOARDS