Did I Just Experience a >40X Productivity Gain? If Yes, What Does This Mean?

Six experimental web pages bursting from a sandbox

My daughter was in the back seat eating pizza, our dog settled beside her. I was driving our favorite route around the city lakes, the parkway lit up with city lights and Christmas decorations reflecting on the dark lake ice. Noise-cancelling AirPods in, listening to Lenny's podcast about AI agents for the second time.

About a week earlier during my first listen to this episode I could sense that the content was triggering some thoughts but I wasn't able to translate these thoughts into action. The second time through, I knew the conversation well enough that my mind could wander. I could let the things they were talking about generate connections. It was quiet, peaceful, the kind of drive where I could just think.

I'd been asking my wife to help me figure out how to reposition Refrigerator Games.[1] What's running in production today is exactly what existed six years ago when I turned off the servers. Frozen in time. Unfortunately, my wife has been too busy to help so this project has just been sitting there waiting.

But driving around those lakes, I realized: I didn't need to wait. I'd been reading about Simon Willison's experiments with Claude Code for the Web and I suddenly knew what to try.

I came home, sat down at my laptop, and just did it.

Thirty minutes later, six completely different landing page experiments were live in production. Not sketches. Not mockups. Fully functional pages with generated images, varied design systems, and radically different positioning strategies with each targeting different age groups and parent pain points.

View all six experiments here

My head exploded. I've felt significant productivity gains using other AI coding tools, but this was majorly different. Did I just experience a >40X productivity gain? Thirty minutes vs. at least 40 hours to build these six experiments by hand. If yes, what does this mean?


I've written hundreds, maybe thousands, of prompts for Claude Code CLI over the past months while resurrecting StartUpWebApp and running other experiments with AI tools. I know how to write prompts to get what I want from Claude Code CLI. But I'd never tried Claude Code for the Web which is the browser-based version that runs in a sandboxed container instead of on my local machine.

When I got home, I opened https://claude.ai/code, pointed it at the private Refrigerator Games client repository, and gave it this prompt:

I want you to explore the content in this repository to learn about the refrigerator games client application. Once you've learned about the existing client application and the apis that it calls on the backend, I want you to create between 4 and 8 new landing pages that are designed to appeal to parents of kids between the ages of 2 and 16. These landing pages should explain how using Refrigerator Games products (food cards and supporting products) can help them in their house. You can decide what specific age kids each landing page targets, you can decide what type of content to include, you can generate new images if needed. Follow the general patterns established within the project where it makes sense, but the goal with these eight new landing experiences is that they should test radically different approaches to how families can use these products and how the Refrigerator Games brand and products are presented to potential customers. Put all eight landing pages in a new project root directory called /jan2826exp/. Create a branch and PR that captures your work. Feel free to introduce new front end technologies that allow you to express these new ideas (don't need to stick with the css, jquery, html pattern that exists in the project today). I'm not available to answer questions. You will have to find solutions to design questions on your own.

This was the first prompt I tried. It worked.

Fifteen minutes later, Claude Code had created a branch with six ready-to-test experiments. I pulled the branch to my local machine, tested it briefly, updated the GitHub Action deployment config to include the new directory, merged the PR, and pushed to production. My total time: fifteen minutes for Claude Code for the Web + fifteen minutes for me to test locally and deploy to prod = thirty minutes total.

Before AI-assisted coding, this would have taken me at least a week. With Claude Code CLI, this would have taken me a day or two. With Claude Code for the Web, this took 30 minutes. Every design decision. Every pixel-perfect CSS adjustment. Every image generation. The sheer volume of decisions is exhausting and stops me before I even start.

The Six Experiments

I loved the variety that Claude came up with. Way better than I would have done even with a week's work. Claude read the existing website content that I had created six years ago and creatively remixed it in genuinely different ways:

The experiment landing page showing all six approaches

Toddler Mealtime Peace (Ages 2-4) — Colorful, playful design focused on the parent's pain point: picky eaters and mealtime stress. The emotional tone is about relief.

Toddler Mealtime Peace

Lunch Box Heroes (Ages 5-10) — Comic book aesthetic with gamification and achievement systems. Positions the child as the protagonist of their nutrition journey.

Lunch Box Heros

Fuel Your Game (Ages 11-16) — Dark, modern, sleek design that speaks directly to teens about athletic performance and independence. No parental lecturing.

Fuel Your Game

Screen-Free Parenting (All Ages) — Minimalist design emphasizing analog advantages in a digital-saturated world. Appeals to parents concerned about screen dependency.

Screen-Free Parenting

5-Minute Meal Planning (All Ages) — Clean layout with timeline visualization. Purely efficiency-driven messaging for busy families.

5-Minute Meal Planning

Family Food Game (All Ages) — Celebratory, colorful design positioning the product as entertainment and family bonding, not just a nutrition tool.

Family Food Game

Six different emotional appeals, design systems, and ways to position the same product.


The Sandbox

I would never have given Claude Code CLI the permissions it needed to do this work on my local machine. Claude Code CLI with "YOLO permissions" could severly damage my computer. That risk keeps me cautious.

But Claude Code for the Web runs in a sandbox. If it breaks everything in the sandbox, it has no impact on me. The worst it can do is create a branch on my repo. That constraint is liberating. I can be radically more experimental.

This pushes me into a new category of productivity gains. Previously I was focused on AI-assisted engineering but now I've unlocked a way to vibe code rapid experimentation. on my existing applications.

I used Claude Code CLI for the heavy lifting required to modernize StartUpWebApp and Refrigerator Games—Django upgrades, infrastructure setup, backend API work. I want control for those tasks. But for rapid client-side iteration? Claude Code for the Web is better. It's more like Replit or other "vibe coding" tools that are optimized for speed and experimentation rather than precision and control.

The modular architecture of Refrigerator Games helps a lot. The Refrigerator Games' client application and backend APIs are completely independent. I expect I'll use Claude Code for the Web to iterate quickly on client-side ideas and Claude Code CLI for backend and infrastructure tasks where I want more control.

The Rate Limiter Shift and What This Means for My Workflow

Reading Simon Willison's writings about Claude Code for the Web matched my experience exactly. This is so much better than what was available even four to six months ago.

The velocity change is real: >40X increase in the speed with which I can experiment. Thirty minutes to build six experiments that would have taken me at least a week by hand. That's at least 40 hours of work compressed into half an hour.

Will any of these six experiments be a clear winner? Probably not. But that's not the point. The point is that the bottleneck has fundamentally changed. It's no longer "how fast can I build?" The rate limiter is now "how fast can I get feedback from customers?"

That's a fundamentally different problem to solve. And it's a much better problem to have.

Of course, now I actually have to figure out how to get that feedback. Six different landing pages are sitting in production, and I have no idea which, if any, messaging resonates with parents. What used to be the hard part, building the experiments, is now the easy part. The new hard part is building a scalable and cost-effective way to get feedback. That's what I get to figure out next and maybe my solution to that will become part 2 of this series.


What experiments are you waiting to run because the build time feels too long? What would you test if you could go from idea to production in thirty minutes?


[1]: A product we built eight years ago and shut down six years ago that I recently resurrected as the first fork of my StartUpWebApp codebase.

References

Next
Next

How I Learned to Leverage Claude Code - Part 1: Resurrecting an 8-Year-Old Codebase