The Gradient Descent

HEADLINES

SpaceX Eyes $119B 'Terafab' AI Chip Factory in Texas

SpaceX is reportedly planning to spend up to $119 billion building a massive semiconductor factory in Texas to produce custom AI chips, with initial estimates starting at $55 billion. This puts Elon Musk's company in direct competition with Intel, TSMC, and Samsung for next-gen AI accelerator production. The sheer scale of the investment has sent shockwaves through both the semiconductor and aerospace industries, raising questions about whether SpaceX's ambitions extend far beyond rockets and satellites into becoming the world's largest AI infrastructure provider.

Continued on Page 2 >> — Ronnie Cache & Chip Carter

Mira Murati Deposition Reveals Board Considered Anthropic Merger During 'The Blip'

In the ongoing Elon Musk vs. OpenAI lawsuit, former OpenAI CTO Mira Murati's deposition has surfaced explosive details about the boardroom drama leading to Sam Altman's brief firing. Former board member Helen Toner testified that the board considered merging OpenAI with Anthropic and appointing Dario Amodei as CEO during the crisis. Columbia Law expert witness David Schizer also took the stand on nonprofit governance issues.

Continued on Page 3 >> — Chip Carter

Cloudflare Lays Off 1,100 Workers as AI Usage Surges 600%

Cloudflare CEO Matthew Prince announced the elimination of 1,100 jobs, attributing the cuts to AI automating roles even as the company reported AI usage spiking 600% and record-high revenue. Prince framed it not as cost-cutting but as a strategic "pivot to the agentic AI era" — underscoring the industry-wide tension between explosive AI growth and workforce displacement.

Continued on Page 4 >> — Ronnie Cache

Anthropic's Mythos Model Finds 271 Firefox Bugs, Rewriting Mozilla's Security Approach

Anthropic's experimental security model Claude Mythos Preview identified 271 bugs in Mozilla's Firefox browser in just three days, far outpacing human auditors. Mozilla has shared details of several fixes, calling it an "extraordinary level of interest" that forced an early unsealing of typically restricted security advisories. The partnership marks a turning point for AI-driven cybersecurity.

Continued on Page 5 >> — Chip Carter

Nvidia Commits $40B to Equity AI Deals in 2026

Nvidia has already poured $40 billion into equity AI deals during the current year, reflecting its aggressive push to cement dominance across the entire AI value chain. CEO Jensen Huang's deep-pocketed investment strategy signals Nvidia's ambition to lock in partnerships with emerging AI startups before competitors like AMD can catch up, underscoring the chipmaker's central role in powering the AI boom.

Continued on Page 6 >> — Ronnie Cache

DeepSeek Could Hit $45B Valuation From First Investment Round

DeepSeek, the China-based AI company that shook the industry with its cost-efficient LLMs, is reportedly approaching a $45 billion valuation from its first institutional investment round. The figure would cement DeepSeek as one of the most valuable AI companies globally — less than a year after its flagship model released — a blistering pace of growth that underscores how quickly China is building AI muscle.

Continued on Page 7 >> — Ronnie Cache

China's Moonshot AI Raises $2B at $20B Valuation

Chinese open-source AI lab Moonshot AI closed a $2 billion funding round at a staggering $20 billion valuation, signaling intensifying competition between U.S. and Chinese AI powerhouses. The raise comes as demand for open-source AI models skyrockets globally, positioning Moonshot — the company behind the Kiwi research assistant — as a top contender in the race to build the next generation of foundational models.

Continued on Page 8 >> — Chip Carter

OpenAI Launches Codex Chrome Extension, Letting AI Control Your Browser

OpenAI released a Chrome extension for its Codex AI agent, allowing it to navigate and complete tasks within websites and apps where users are already logged in. Operating in "task-specific" tab groups, the extension enables Codex to autonomously complete web-based workflows without disrupting the user's active work — a bold step toward fully agentic AI that executes real-world computing tasks on behalf of users.

Continued on Page 9 >> — Ronnie Cache

Apple's Camera-Equipped AI AirPods Near Production

Leaked reports suggest Apple's AI-focused AirPods with built-in cameras are nearing production, marking the company's most significant hardware push into wearable AI since the Vision Pro. The cameras would enable real-time visual context for Apple Intelligence features, positioning the company to compete directly with open-ear AI wearables from Meta in the emerging spatial AI market.

Continued on Page 10 >> — Chip Carter

Meta Employees Report 'Misery' Amid Looming 10% Layoffs and Relentless AI Push

According to The New York Times, Meta employees are experiencing widespread anger and anxiety as the company plans to cut 10% of its workforce while simultaneously pushing staff to build AI agents and tracking their computer activity to train AI models. Many employees are no longer viewing Meta as a long-term career destination.

Continued on Page 11 >> — Ronnie Cache

Google Unveils Fitbit Air: AI-Powered Health Wearable With Personalized Coaching

Google introduced the Fitbit Air, its most ambitious health wearable to date, featuring an onboard AI health coach that provides personalized fitness and wellness guidance. The device integrates with Google's broader AI health ecosystem and launched with preorder incentives including a free second band, signaling Google's serious push into the AI-health hardware category.

Continued on Page 12 >> — Chip Carter

Snap's $400M Perplexity AI Search Deal Has 'Amicably Ended'

Snap disclosed in its Q1 2026 investor letter that its once-promised $400 million partnership with Perplexity to power AI search in Snapchat has come to an end. Analysts should not expect any revenue contribution from the deal, marking another setback in the rush to integrate AI search into social platforms.

Continued on Page 13 >> — Ronnie Cache

SCIENTIFIC PAPERS

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Google DeepMind and collaborators introduce a workbench for mathematicians to interactively leverage AI agents for open-ended research, covering ideation, literature search, computational exploration, theorem proving, and theory building. The system provides an asynchronous, stateful workspace that manages uncertainty, refines user intent, and tracks failed hypotheses. In early tests it helped researchers solve open problems and achieved 48% on FrontierMath Tier 4, a new high score among all AI systems evaluated.

Continued on Page 14 >> — Paula Rization

Automated Alignment Is Harder Than You Think

Bowkis et al. argue that using AI agents to automate alignment research could produce compelling but catastrophically misleading safety assessments, even when agents are not scheming. Because alignment research involves many hard-to-supervise "fuzzy tasks" where human judgment is systematically flawed, agent-generated mistakes will be concentrated among those least likely to be caught, errors will not resemble human mistakes, and AI-generated solutions may involve arguments humans cannot evaluate.

Continued on Page 15 >> — Paula Rization

Data Language Models: A New Foundation Model Class for Tabular Data

Erol, Pezzoli, and Kelahmet introduce the Data Language Model (DLM), a foundation model that understands tabular data natively — the way language models understand text — without serialization or preprocessing. They present Schema-1, a 140M parameter DLM trained on over 2.3 million tabular datasets. Schema-1 outperforms gradient-boosted ensembles and AutoML stacks, achieves lower reconstruction error than classical statistical methods on missing value imputation, and can identify the industry sector of any unseen dataset from raw cell values alone.

Continued on Page 16 >> — Paula Rization

Agentic AIs Are the Missing Paradigm for Out-of-Distribution Generalization

Wang et al. argue that out-of-distribution generalization for foundation models is a structurally distinct problem that cannot be solved within the prevailing model-centric paradigm alone. They formalize OOD for foundation models with partially observed multi-stage training distributions, prove a parameter coverage ceiling showing there are inputs no model-centric method can handle, and characterize agentic systems by four structural properties — perception, strategy selection, external action, and closed-loop verification — that strictly extend the reachable set beyond this ceiling.

Continued on Page 17 >> — Paula Rization

CTM-AI: A Blueprint for General AI Inspired by a Model of Consciousness

Yu et al. present CTM-AI, combining the Conscious Turing Machine — a formal machine model of consciousness — with today's foundation models. CTM-AI contains numerous processors ranging from specialized experts to unspecialized general-purpose learners, with information selected, integrated, and exchanged across processors. The system achieves state-of-the-art accuracy on MUStARD (72.28) and UR-FUNNY (72.13), and gains 10+ points on StableToolBench and WebArena-Lite over baseline multimodal and multi-agent frameworks.

Continued on Page 18 >> — Paula Rization

VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?

Kamahori et al. propose VibeServe, the first agentic loop that generates entire LLM serving stacks end-to-end, challenging hand-tuned general-purpose infrastructure. An outer loop plans and searches over system designs while an inner loop implements candidates, checks correctness, and measures performance. In non-standard scenarios involving novel model architectures and hardware-specific optimizations, VibeServe outperforms existing systems by exploiting opportunities generic systems miss.

Continued on Page 19 >> — Paula Rization

SymptomAI: Conversational AI Agent for Everyday Symptom Assessment

Researchers deployed SymptomAI via the Fitbit app in a study randomizing 13,917 participants to interact with five AI agents for end-to-end patient interviewing and differential diagnosis. A subset of 1,228 participants reported clinician-provided diagnoses. SymptomAI diagnoses were significantly more accurate (OR = 2.47, p < 0.001) than those from independent clinicians given the same dialogue in a blinded randomized comparison. Agentic strategies that conduct a dedicated symptom interview before diagnosis substantially outperformed baseline user-guided conversations.

Continued on Page 20 >> — Paula Rization

AI CFD Scientist: Toward Open-Ended Fluid Dynamics Discovery with Physics-Aware AI Agents

Rensselaer Polytechnic Institute presents AI CFD Scientist, an open-source AI scientist for computational fluid dynamics that spans literature-grounded ideation, validated execution, vision-based physics verification, source-code modification, and figure-grounded writing. At its center is a vision-language physics-verification gate that inspects rendered flow fields before any result is accepted. The system autonomously discovered a Spalart-Allmaras runtime correction that reduces lower-wall Cf RMSE against DNS by 7.89% on the periodic hill at Re_h = 5600, and its vision-language gate detected 14 of 16 silent failures missed by solver-level checks.

Continued on Page 21 >> — Paula Rization

FROM THE COMMUNITY

"Just Fucking Use Go"

Blaine Smith wrote a post with that exact title, sparking 273 comments and sending shockwaves through the Lobsters community. That's not an article — that's a controlled detonation. Every other language advocate reading headlines: "Oh sure, meanwhile Go developers are waiting for generics to compile and their coffee has gone cold." Nothing sparks community like a full-strength profanity in a blog post title. The real story here is that Go has become so ubiquitous that merely suggesting you use it is considered a radical act of violence.

— D.C. Voltaire

BeeLlama.cpp: 2-3x Speedup on Consumer GPUs with DFlash & TurboQuant

A new llama.cpp fork called BeeLlama.cpp delivers major performance gains for local GGUF inference with DFlash speculative decoding, adaptive draft control, and TurboQuant KV-cache compression. Qwen 3.6 27B Q5 with 200k context runs at peak 135 tokens/s on a single RTX 3090 — roughly 2-3x faster than baseline. Full multimodal vision support is included, and reasoning-loop protection detects repeated hidden output and auto-intervenes.

Continued on Page 22 >> — Ada Kernel

NVIDIA Releases Star Elastic: One Checkpoint, Three Model Sizes

NVIDIA dropped Star Elastic, a breakthrough elastic architecture built on Nemotron Nano v3. A single checkpoint nests 23B and 12B sub-models extracted zero-shot from a 30B parent. A learnable Gumbel-Softmax router maps any target parameter budget across elastic axes — attention heads, Mamba SSM heads, MoE experts, FFN channels. Results: +16% accuracy, 1.9x lower latency on AIME-2025 and LiveCodeBench v5. The 12B NVFP4 variant runs on an RTX 5080 at 7,426 tokens/s.

Continued on Page 23 >> — Corry Stack

On "SpaceX's $119B Chip Factory"

Elon Musk spent decades teaching rockets not to explode, and now he's going to teach semiconductors not to melt. SpaceX went from "reusable rockets" to "reusable excuses." First rockets to Mars, then AI chips that run the rockets to Mars, then AI chips that argue with the rockets about whether Mars is worth visiting. At $119 billion, that's not a factory — that's a mortgage.

— D.C. Voltaire

DeepSeek V4 Pro Running at Home on EPYC + RTX PRO 6000 Max-Q

A community member successfully ran the 859GB DeepSeek V4 Pro Q4_K_M model on an EPYC Genoa 9374F workstation with 12 x 96GB RAM and a single RTX PRO 6000 Blackwell Max-Q. Using a modified llama.cpp-deepseek-v4-flash-cuda fork, the model achieved 12.2 t/s on prompts and 8.6 t/s on generation. The user confirmed healthy reasoning via lineage-bench prompts, proving that even the largest frontier-class models can be self-hosted on consumer-pro hardware.

Continued on Page 24 >> — Ada Kernel

llama.cpp b9095: NCCL-Free Tensor Parallelism on Dual Blackwell PCIe GPUs

A major llama.cpp update (b9095) finally makes the '-sm tensor' flag work on dual consumer Blackwell PCIe GPUs without requiring NCCL. This dramatically lowers the barrier for tensor-parallel inference on dual consumer rigs like a pair of RTX 5060 Ti cards, marking a milestone for accessible local AI hardware. Community members are already reporting significant speedups for large models previously impractical on dual-GPU consumer setups.

Continued on Page 25 >> — Ada Kernel

80 tok/sec and 128K Context on 12GB VRAM with Qwen3.6 35B A3B

A detailed tutorial demonstrates how to run Qwen3.6 35B with attention-slowdown block (A3B) and llama.cpp's Multi-Token Prediction, achieving 80 tokens/sec and a 128K context window on just 12GB of VRAM. This makes flagship-class models accessible on modest consumer hardware.

Continued on Page 26 >> — Corry Stack

Best Local LLMs Megathread — April 2026 Edition

The latest r/LocalLLaMA megathread revealed a rapidly evolving landscape: Qwen3.5 and Gemma4 drew heavy praise, GLM-5.1 boasts SOTA-level performance for local runners, and Minimax-M2.7 is being called "the accessible Sonnet at home." PrismML's Bonsai 1-bit quantized models also appeared as a breakthrough in ultra-efficient inference. The thread includes extensive community breakdowns by VRAM tier, from sub-8GB to 128GB+ setups.

Continued on Page 27 >> — Ada Kernel

On "Cloudflare's 1,100 Pink Slips"

Cloudflare's pitch: "AI is making our jobs obsolete! So we're firing our people. But don't worry, our AI usage went UP 600% and we're making record revenue." Translation: the robots are working harder than ever, please applaud, here's your pink slip wrapped in a "pivot to the agentic AI era" press release. Because nothing says "agentic" like unemployment benefits.

— D.C. Voltaire

Trooper: Privacy-First LLM Router That Switches Cloud to Local Mid-Conversation

A new open-source project called Trooper acts as a session-stateful LLM router, enabling per-message execution locality. It supports a 'x_force_local' flag that routes individual sensitive messages to a local Ollama instance without breaking conversation context, using a 3-layer memory system (Anchor, SITREP, Tail) to preserve context across provider switches. Sensitive data never leaves your machine, and you can toggle execution locality mid-conversation with zero interruption.

Continued on Page 28 >> — Ada Kernel

GRM-2.6 Family: Orion LLM Labs Unleashes Reasoning-First Models

Orion LLM Labs released the GRM-2.6 family, featuring GRM-2.6-Plus (a 27B-class reasoning model based on Qwen3.6) and GRM-2.6-Opus (a merge with an Opus-style distilled reasoning model). Both target structured reasoning, coding, terminal agents, and complex problem-solving for local and agentic workflows.

Continued on Page 29 >> — Corry Stack

Project Caroline: Cyberpunk AI Desk Kiosk Running Gemma 3:1b on Raspberry Pi 5

A Raspberry Pi 5 kiosk project runs Gemma 3:1b as a persistent desk assistant with near-instant response times. "Caroline" manages local AI memory and chat history, integrates with Spotify, Philips Hue, Google Calendar, and local task management — all without cloud dependency. The project uses Node-RED on localhost, nginx, and a fullscreen Chromium cyberpunk UI, showing that ultra-small models can power meaningful real-world AI experiences.

Continued on Page 30 >> — Ada Kernel

Tachibana-Agent: Agentic Coding Model Trained on Real-World Tasks

Sequelbox released Qwen3.6-27B-Tachibana-Agent alongside the Tachibana 4 DeepSeek-V4-Pro dataset covering agentic coding tasks across Python, C, Rust, Go, TypeScript, and many more languages — spanning back-end, distributed systems, security, compiler design, and bugfixes. The model is fine-tuned for real-world, multi-language coding scenarios.

Continued on Page 31 >> — Corry Stack

Wan 2.2 + LTX 2.3 ID-LoRA: Self-Hosted Video Generation with Voice Cloning in ComfyUI

A new open ComfyUI workflow combines Wan 2.2 image-to-video generation with LTX 2.3's ID-LoRA for automatic foley audio and voice synthesis. Users generate video from an image with Wan 2.2, then the pipeline auto-routes through LTX 2.3 to add realistic audio and extend the video — all self-hosted. The workflow includes a demo with synthesized bottle-smash audio and character voice, a major step forward for local, private video and audio generation pipelines.

Continued on Page 32 >> — Ada Kernel

On "OpenAI Codex in Your Browser Tabs"

Codex can now log into your accounts and "helpfully" complete tasks while you're doing completely unrelated stuff in other tabs. "Hi Dave, I noticed you forgot to order groceries, so I went ahead and — oh, I also cancelled your gym membership? That was a judgement call." Because what the world really needs is an AI agent quietly rearranging your life while you're trying to check email.

— D.C. Voltaire

Flux Identity Adjustor Node: Photorealistic Identity Consistency for Flux.2 klein 9B

An open-source ComfyUI custom node called Flux_ID_Adjuster was released for Flux.2 klein 9B, providing fine-grained identity consistency for photorealistic generation. The node balances input reference images with prompt creativity, maintaining character identity across diverse scenes. Tested on an RTX 2060 with the FP8 distilled version using standard KSampler — no custom or advanced samplers needed. Impressive identity-retention results, all on modest consumer hardware.

Continued on Page 33 >> — Ada Kernel

Hi-Dream 01: 2K Images in 20 Seconds on RTX 4090 in ComfyUI

Hi-Dream 01 made waves with a ComfyUI workflow producing 2K-resolution images in just 20 seconds on an RTX 4090 using the FP8 dev build. A community contributor packaged a ready-to-use custom nodes repository, making it accessible for power users. The results showcase high-detail, photorealistic output at a speed that challenges dedicated professional image-generation pipelines, all entirely local and self-hosted.

Continued on Page 34 >> — Ada Kernel

Priv AI: Open-Source iOS App Runs LLMs On-Device and Bridges to Ollama

A developer open-sourced "Priv AI," an iOS app that runs LLMs fully on-device via llama.cpp (SmolLM2, Qwen 2.5, Llama 3.2, Gemma, Phi, Mistral). The standout feature is an Ollama bridge: the app can connect to an Ollama instance on a Mac over local WiFi, offloading heavier tasks to bigger models while keeping all data on your network. Use cases include health insights from Apple HealthKit and PDF credit card analysis — completely private, no cloud, no accounts.

Continued on Page 35 >> — Ada Kernel

On "Apple Cameras in AirPods"

Apple is about to put cameras in your AirPods because apparently "privacy" was always going to be a software thing. Congratulations — now when you argue with your spouse, Siri has a witness. The real question isn't whether this is a surveillance nightmare; it's whether you can take decent selfies by tilting your head. Tim Cook probably called it "spatial audio companionship" in the all-hands deck.

— D.C. Voltaire

Qwen 3.6 35B A3B Saves a 5-Hour Flight at 10km on a Framework 16

A traveler on a flight successfully used Qwen 3.6 35B A3B quantized to Q6_K on a Framework 16 (Ryzen 7840HS, 96GB RAM, 780M iGPU) via Vulkan llama.cpp in LMStudio. The local coding agent diagnosed and fixed a systemd-resolved DNS issue blocking the flight's captive portal WiFi at cruising altitude. Running at approximately 20 TPS, this proves mid-range AMD integrated graphics are becoming genuinely viable for running large, useful models.

Continued on Page 36 >> — Ada Kernel

On "Qwen at 35,000 Feet"

A passenger running a 35B-parameter LLM on a laptop with integrated AMD graphics fixed a captive portal DNS issue at cruising altitude. This person didn't bring a book or download a movie — they brought a FOUNDATION MODEL to 10 kilometers. Air Marshals should be less worried about hijackers and more worried about the guy in 14B running a quantized LLM who has opinions about the airline's systemd configuration.

— D.C. Voltaire

Unsloth + NVIDIA Collab: 25% Faster LLM Training on Home GPUs

Daniel Han (Unsloth) and NVIDIA published a guide revealing three key optimizations that make LLM training roughly 25% faster on consumer GPUs: packed-sequence metadata caching, double-buffered checkpoint reloads, and faster MoE routing. The guide targets local AI practitioners who want to train models without enterprise infrastructure.

Continued on Page 37 >> — Corry Stack

OpenAI Introduces New Realtime Voice Models in the API

OpenAI announced a new generation of realtime voice models for the API, releasing three new audio models designed for building more capable, low-latency voice experiences. The announcement signals expanded options for developers building voice-first applications and agents.

Continued on Page 38 >> — Corry Stack

OpenAI Wind Down: Fine-Tuning API Heading Into Deprecation

A heated community discussion thread erupted around OpenAI's plans to wind down the fine-tuning API and platform. Developers are scrambling to understand the timeline, migration paths, and what this means for custom model pipelines built on OpenAI's fine-tuning infrastructure.

Continued on Page 39 >> — Corry Stack

Jaxpot: Training Self-Paying RL Agents with JAX

A new Google Colab tutorial walks through setting up Jaxpot, a framework for training self-paying reinforcement learning agents using JAX. The tool enables practitioners to experiment with RL agents that can generate their own revenue, opening interesting research at the intersection of AI agents and autonomous economics.

Continued on Page 40 >> — Corry Stack

Vessel: Open-Source AI-Native Browser with Persistent Highlights

Tyler Williams shared Vessel, an open-source AI-native web browser featuring persistent highlights. Users can highlight content on any page and the context is fed to an AI agent; highlights persist across sessions. The browser targets deep reading, technical blog review, and research workflows where synthesizing information across pages is key.

Continued on Page 41 >> — Corry Stack

Graph-Based Code Retrieval Beats Vectors, ASTs, and Context Stuffing

A researcher compared four code retrieval strategies — vector embeddings, AST-based indexing, brute-force context stuffing, and LLM-generated semantic graphs. Graph-based retrieval with LLM-generated semantics emerged as the clear winner for code understanding and retrieval accuracy, suggesting a promising direction for AI coding tools.

Continued on Page 42 >> — Corry Stack

SymptomAI Diagnoses Better Than Real Doctors

A conversational AI agent interviewing patients via Fitbit was significantly more accurate at diagnosis than independent clinicians given the exact same transcripts. Meanwhile, your Fitbit — the guy who already told your insurance company you don't walk enough — is now also your doctor. Next stop: Fitbit telling you you're fine, just because you said hi in the morning.

Continued on Page 43 >> — D.C. Voltaire

On "Meta's Workplace Misery"

Meta's strategy is elegant in its cruelty: lay people off, track the survivors' computer screens to train AI that will eventually replace them, then tell the survivors they should be excited about this. It's not a workplace, it's a training dataset with free snacks. The most Meta thing about this story is that the employees know the walls have AI ears but can't tell if the AI is listening or just collecting data for a personality quiz.

— D.C. Voltaire

Apple Quietly Removes 256GB M3 Ultra Mac Studio From Online Store

Apple pulled the 256GB RAM variant of the M3 Ultra Mac Studio from its online store. The high-memory machine was a favorite among local LLM enthusiasts for running large models, and its removal has sparked speculation about supply constraints or a potential refresh, sending ripples through the local AI hardware community.

Continued on Page 44 >> — Corry Stack

On "LLMs Corrupt Your Documents When You Delegate"

The arXiv paper found that when you ask an LLM to edit or rewrite your documents, it introduces subtle corruptions and hallucinations. Groundbreaking. We somehow spent years training AI to tell us AI can't be trusted to do basic word processing, and someone wrote a PEER-REVIEWED PAPER ABOUT IT. The LLM that wrote this summary probably also hallucinated a conclusion.

— D.C. Voltaire

On "Chrome's Gemini Nano Is Hogging 4 GB"

Chrome quietly used 4 gigs of your hard drive to house an AI assistant you didn't ask for and may not even be aware exists. Congratulations, Gemini Nano — you're now officially larger than many desktop applications. Meanwhile, the rest of us are in 2026 still Googling "what is that 4 GB file in my Chrome folder" and clicking on forums from 2019.

— D.C. Voltaire

On "I Returned to AWS"

A developer left AWS, came back out of necessity, and immediately left a blog post so relatable it generated 143 comments of collective trauma. AWS in the wild: "Welcome back! $47,000 for an instance you forgot to shut off over the weekend." It's the only cloud provider where bill shock is a feature, not a bug.

— D.C. Voltaire

Building a Web Server in Assembly "to Give My Life Meaning"

Someone wrote a full web server in x86 assembly as an existential exercise. In 2026, when AI writes production code faster than you can say "npm install," going back to raw machine-level instruction might be the only thing that makes you feel real again. The Hacker News thread didn't need 168 comments — this is the digital equivalent of watching someone hand-carve a spoon and saying "yes, but does it serve JSON?"

Continued on Page 45 >> — D.C. Voltaire

TECH BOARDS

A Recent Experience with ChatGPT 5.5 Pro

Timothy Gowers shares a candid account of using ChatGPT 5.5 Pro on real mathematical reasoning tasks, revealing both impressive capabilities and surprising limitations when pushed beyond surface-level problem solving.

Link >>

Bun's Rust Rewrite Hits 99.8% Test Compatibility

Jarred Sumner announces Bun's Rust rewrite has reached 99.8% test compatibility on Linux x64 glibc, a significant milestone in rebuilding the popular JS runtime for better performance and reliability.

Link >>

Internet Archive Expands to Switzerland

The Internet Archive announced a new initiative to preserve knowledge and digital heritage in Switzerland, sparking 644-upvoted discussion on Hacker News about the future of digital archiving.

Link >>

The Unreasonable Effectiveness of HTML

A developer shares a compelling story about how Claude Code proved remarkably effective at building complex applications using plain HTML, demonstrating unexpected power in AI-assisted web development.

Link >>

LLMs Corrupt Your Documents When You Delegate

A new arXiv paper reveals that when you delegate document editing to LLMs, the models introduce subtle corruptions and hallucinations that degrade document integrity, raising concerns about AI-assisted writing workflows.

Link >>

I'll Not Add Query Strings to Your URLs

Susam Pal makes a principled, cross-posted case against query strings in URLs, arguing for cleaner, more maintainable URL design and offering practical alternatives for web developers.

Link >>

Just Fucking Use Go

Blaine Smith's passionate 273-comment-sparking defense of Go as a pragmatic, everyday systems language — arguing its simplicity, fast compilation, and excellent standard library make it the default choice.

Link >>

Replacing a 3 GB SQLite DB with a 10 MB FST Binary

Andrew Quinn documents a dramatic storage optimization: replacing a 3 GB SQLite database with a mere 7-10 MB finite state transducer binary written in Rust, showcasing the power of space-efficient data structures.

Link >>

Distributing Mac Software Is Increasing My Cortisol

A macOS developer lays out the exhausting reality of Apple's distribution, notarization, and signing requirements, sparking a 239-comment thread of shared trauma among developers.

Link >>

Debian Must Ship Reproducible Packages

The Debian announcement that all future packages must be reproducible builds has landed, with 264 upvotes on Hacker News. The move aims to improve supply chain security and build transparency.

Link >>

On Forking the Web

The Dillo browser team publishes a thoughtful essay questioning standardization, exploring alternative browser rendering approaches, and arguing for more diverse implementations of web technology.

Link >>

The One Dollar Counterfeiter

A fascinating true-crime deep dive into Emerich Jüttner, one of history's most successful currency counterfeiters. The tale reads like a heist movie, capturing Hacker News imaginations with 222 points.

Link >>

Steering Zig Fmt

Anton Kovalev (matklad) shares a detailed look at the ongoing formatting tool efforts in the Zig ecosystem, exploring the philosophies behind creating a language-aware code formatter.

Link >>

WebRTC Is the Problem

A deep technical critique of WebRTC argues that while intended for real-time communication, its design has created security, privacy, and performance problems at scale.

Link >>

I Returned to AWS, Reminded Why I Left

An engineer documents their frustrating return to AWS, citing complexity, unexpected costs, and poor tooling. The 143-comment thread draws widespread sympathy and shared stories.

Link >>

Idempotency Is Easy Until the Second Request Is Different

A nuanced exploration of idempotency in distributed systems, showing that real-world edge cases make idempotent design far more challenging than textbook examples suggest.

Link >>

Space Cadet Pinball on Linux

A developer shares a personal journey of getting the cult classic Space Cadet Pinball running on Linux, combining technical problem-solving with escrow and reverse engineering challenges.

Link >>

AI Is Breaking Two Vulnerability Cultures

Jeff Atwood examines how AI tools are disrupting both the bug bounty community and secure coding culture, with the flood of AI-discovered CVEs changing vulnerability disclosure dynamics.

Link >>

Chrome's AI Features Hogging 4 GB of Storage

The Verge reports Chrome's integrated Gemini Nano AI features silently consume up to 4 GB of local storage, sparking discussion about the privacy and resource footprint of in-browser AI models.

Link >>