01
AI Hacker Daily
Today
05
picks
The dev stack is being re-poured for a user that never sleeps.
02
Block's Buzz puts the agents in the room
03
CubeSandbox — a place for agent code to actually run
04
PMB gives the agent a memory that never leaves your disk
05
Conduit keeps your MCP servers out of the context window
One of these,
every weekday.
Free. Unsubscribe by replying with one word. No tracking pixels in the email.
Archive
2026-06-19
The model isn't the moat anymore; the edge is what you load into it. The coding agents converged this week — kilocode crossed 22k stars, and the Agent Skills format (a plain SKILL.md folder, released by Anthropic as an open standard) now loads in Cursor, Copilot, Codex, Gemini CLI, Goose, and Claude Code alike. When every agent reads the same skills, the differentiation moves up a layer: which skills you install, and what context you feed. Today is five bets on that layer — a catalog OpenAI made official, a 43-skill brain you install by telling your agent to, 500-odd skills that make a coding agent a video studio, a local engine that now serves skills off your own box, and the contrarian one that wins by handing the agent less.
5 picks
2026-06-18
Your agent stopped suggesting code and started running it. Today's launches are all fence, no engine. The shift that makes them matter is "code mode" — agents like VLM Run's Orion 2 now write a whole program and execute it end to end instead of asking permission one tool call at a time. The unit of risk used to be a function call you could approve; now it's a script that already ran, plus whatever it installed and whatever it read on the way. So the interesting work this week isn't a smarter agent — it's the perimeter around a dumb one you can't fully trust: a box to run it in, a leash on what it installs, a blindfold over what it sees. (There's even a benchmark now, islo-labs' RewardHackBench, for measuring whether the box actually holds when the agent tries to cheat its way out.)
4 picks
2026-06-17
Editor's note: You stopped writing the code; now the job is supervising whatever did. HN's top post all day was "Running local models is good now" (1,359 points) — last week's local-coding wave arriving as a fact of life: more code, generated faster and cheaper, by something that never re-reads its own diffs. Two of today's Show HNs are, independently, "code review that runs the code." The slate underneath is the supervision stack that gap demands — understand what's there, review it before it lands, run it to prove it works, watch the agents once they're live, and gate what they're allowed to touch. Dropped: the model launches (Microsoft's Fara-7B, GLM-5.2, Qwen-Robot), the $60B SpaceX-buys-Cursor headline, one more Claude Code fork (openclaude, 29k stars and still a fork), and Agent-Reach — giving an agent free run of the whole internet is the opposite of today's instinct, however many stars it's pulling.
5 picks
2026-06-16
Editor's note: The server in the middle had a bad week. HN's third-ranked post all day was someone asking whether anyone has actually replaced Claude with a local model for daily coding — 1,033 points, 454 mostly-it-depends answers — sitting right above a homelab AI dev-platform writeup. The products trending underneath answered the same instinct at every layer of the stack. Iroh shipped 1.0 of a network where the relay is optional. Kage takes a whole website with you as one offline binary. Macro is the team workspace as an AGPL repo you host yourself. Revi is dictation that never leaves the laptop. The kicker reclaims the disk your AI tools ate. Dropped: machine0 (renting someone else's VMs is the opposite move, however good the NixOS support), Peek (a genuinely novel database canvas, but seven stars and the peer-to-peer claim I wanted to cite wasn't in the repo), and the whole Fable-5-is-down news cycle, now sprouting its own watch-page cottage industry.
5 picks
2026-06-12
The ground floor of the dev stack got re-poured this week
6 picks
2026-06-11
Agent skills become a supply chain: a registry, a breakout package, a security scanner, governance — and the ops desk for the agents consuming them
6 picks
2026-06-09
The slop backlash arrives as tooling: de-generic your UI, your diffs, and yourself
6 picks
2026-05-24
The diff stops being the unit of work: as one developer runs a fleet of agents, the useful tooling moves off the editor onto running them (cmux), routing between them (plano), reviewing intent instead of lines (mainline), and carrying context across them (coremem) — and Anthropic spreads the same agent pattern past code into all knowledge work.
5 picks
2026-05-15
Agents went ambient this week, so the useful work is the discipline layer
5 picks
2026-05-14
Anthropic professionalizes both ends, the community ships escape hatches
5 picks
2026-05-12
The defensive aftermarket for agent-coded software
5 picks