Curiosity Builds! / Handoff Stories

Handoff Stories

What happens when parts of a task move from a person to a machine?

The Handoff framework (Mulligan & Nissenbaum, 2020) asks: when parts of a task move from a person to a machine, even if the job gets done, something changes. Who's responsible? Who learns? Who has control?

These are stories from Curiosity Builds!—small projects we've made with AI. For each, we try to notice: what got delegated, what stayed human, and what we learned about the handoff.

How to use these stories

  1. Map the handoff: What got delegated? What stayed human? What tradeoff did that create?
  2. Practice curious critique: Why might it be built this way? What becomes visible/invisible? Who can contest it?
  3. Propose a repair: What would you change next to improve learning, care, or contestability?

(Stubs are intentional—these stories are living notes from building. Repair is part of the practice.)

Building with AI is itself a handoff. We can create tools without deep expertise in the underlying technology. That changes who can build—and what gets built.

We try to keep responsibility visible: who can contest a decision, who can repair a tool, and who learns from mistakes.

#

Wild Readeras of January 22, 2026

Wild Reader screenshot showing phonics game interface

Daniel: I made Wild Reader for my three- and five-year-old boys to help them with early literacy. It's a phonics tool with little games for matching letters, then words, then faces with names and reading sight words.

One fun thing: my five-year-old is pretty critical of the mobile user experience and really pushed me to make that part better. There's a reward for getting five questions in a row correct—you can create an image with Gemini's Nano Banana. The kids love making things like "a monster truck with tires made out of cakes." Previously I'd let them use my phone to make images with AI—I was the gatekeeper. Now the system gates it: five correct answers earns an image. That's a handoff: setting ground lines so I don't have to say "let me have my phone back."

Another handoff: audio dictation for non-readers. My three-year-old can't read yet, but he can speak his image prompt into the tool.

Handoffs

What happens when the system sets the ground lines?

The AI image reward gates access—five correct answers earns a prompt. I used to be the gatekeeper. Now the system is. What changes when I'm not the one saying "that's enough"?

What happens when non-readers can participate?

Audio dictation lets my three-year-old speak his image prompts. He can't read yet, but he can use the tool. What new participation does that enable—and what might we miss?

What behaviors does the reward structure train?

Accuracy, persistence, speed, compliance—and what does it crowd out? The reward structure is itself a pedagogical handoff.

Repair / Next

Working on AI-generated audio for prompts, fixing conversational context issues. Removed bold styling from lowercase letters that was confusing kids.

Visit Wild Reader

#

Pronouncleas of January 22, 2026

Pronouncle screenshot showing pronunciation challenge

Daniel: A sister had tried Spell Better Now and Wild Reader and was wondering about a pronunciation tool. So I started building one. The handoff here is using speech-to-text to check pronunciation—handing the "did they say it right?" judgment to an AI system.

What I discovered after making that handoff: speech-to-text is designed to understand what you meant, not to catch when you say it wrong. A human tutor notices when pronunciation is off. The AI tries to succeed at hearing you correctly—that's what it was built for. The tool revealed its true purpose through failure.

Handoffs

What do you learn after you hand something off?

The handoff is using STT for pronunciation checking. The discovery—that STT is designed to understand, not detect errors—came after making the handoff. Sometimes you learn what a tool is really for by trying to use it for something else.

What workarounds does that force?

Full sentences instead of single words (so the microphone captures enough audio). Sentences that don't give away the answer. Or maybe alternatives like "what does this word rhyme with?"—avoiding speech entirely.

Repair / Next

Next experiment: compare (a) STT confidence scores + phoneme hints vs (b) rhyme-based prompts that avoid speech recognition entirely.

Visit Pronouncle

#

Popmom's Lending Libraryas of January 22, 2026

Popmom's Lending Library screenshot showing inventory

Daniel: My kids like to borrow Hot Wheels cars from my mom. They can only borrow one—it's a lesson in responsibility. They can loan her a toy and then borrow one of hers. She has a lot.

I made a comment: "Maybe if we made a library and took pictures, like a lending library, then we could better track what cars you're using." The fun part was the hands-on experience with my five-year-old—taking pictures of the cars and giving them names together. Building the tool was part of the point.

The kids don't use it independently—they don't have phone or computer access. The visibility is for me and Popmom.

Handoffs

What changes when lending becomes visible—and to whom?

The system makes the relationship trackable. But does that formalize something that was fine informal? Or does it open new possibilities—seeing all the cars, choosing more deliberately? The visibility creates new affordances we need to be aware of.

What happens when the record disagrees with memory—who gets believed?

The system becomes an authority. When the tool says "this car was borrowed last week" and someone remembers differently, who wins?

Repair / Next

Design decision to test: should the kids have direct access, or is the visibility being mediated through adults part of the design?

Visit Popmom's Lending Library

#

Saw The Ballas of January 22, 2026

Saw The Ball screenshot showing baseball positioning game

Daniel: A baseball positioning game built with a 10-year-old nephew. You choose a position, get a scenario (runner on second, two outs, you play first base), and when the ball is hit you have three seconds to click where you're going to go.

The fun part was the brainstorming and design thinking—helping him identify what he wanted to build and how to make it work. The goal is to help you think about what you should be doing—to ingrain the knowledge before you're on the field.

Handoffs

Whose knowledge gets encoded?

This is something your friends, parent, or coach would tell you. Maybe they would disagree with what the system says. When the system says "go to second base," is that the right answer? What if your coach teaches something different?

What changes when practice becomes time-pressured and graded by the system?

The 3-second click is a design decision. A coach might let you think longer, or ask follow-up questions. The system imposes a single mode.

Repair / Next

  • Add explanation mode: after you click, show why + alternate coach philosophies
  • Improve gameplay mechanics
  • Better background images

Visit Saw The Ball

#

Scheduler Markas of January 22, 2026

Scheduler Mark screenshot showing benchmark results

Daniel: This one started from helping a brother-in-law manage his adult soccer league schedules. Complex constraints—multiple teams, field availability, player preferences. AI seemed promising, but the results were inconsistent. That inconsistency became interesting.

So I turned it into an exploratory benchmark. Each model receives the same long, messy natural-language scheduling request. Each produces an HTML solution with a proposed schedule and explanation. Then every model reviews every other model's work and answers: "Is this solution correct? Why or why not?"

There's no ground-truth checker. No enforced rubric. The models generate the solutions and the critiques entirely on their own.

Handoffs

Can AI judge AI?

The benchmark explores what happens when you hand off judgment to the models themselves— with no human rubric in between. What does agreement or disagreement tell you?

The rubric is the prompt itself.

When models interpret the prompt differently, who decides which interpretation is correct—and what does their agreement or disagreement reveal?

Repair / Next

Exploring what the benchmark reveals about model consistency and where human judgment still needs to anchor the evaluation.

Visit Scheduler Mark

#

TryAgBuddyas of January 22, 2026

Daniel: Built with my niece, who wants to have a small ranch or farm. She needs to learn how to work on her house, truck, fence line, animals—small ag work.

Rather than explaining to a chatbot each time or working within the confines of a chat system where you repeat your searches and save documents elsewhere, we built a tool that remembers her context. She inputs a project, gets dynamic questions, sees her previous context, and the AI suggests updates she can accept or reject. Then it prepares a report she can discuss and refine.

Handoffs

Who adapts to whom—you to the tool, or the tool to you?

In a chatbot, you adapt to its interface. In a bespoke tool, the tool adapts to your workflow. You don't have to re-explain yourself. Your work stays organized in a way that makes sense to you.

You control the context.

You accept or reject what the AI suggests. The tool adapts to your workflow, not the other way around.

Repair / Next

Continuing to build out the tool as she discovers what she needs. The point is that it grows with her workflow.

Visit TryAgBuddy

#

Stop Motion Planneras of January 22, 2026

Daniel: Built with my 13-year-old nephew who has a YouTube channel for stop motion animation. The tool uses LLMs to help plan the shots—generating scripts from ideas and shot lists with ASCII sketches.

We built a tool with AI-generated scripts and shot sketches. But we don't yet know if those outputs would actually help his workflow. He's made many videos—his process is practiced, embodied. The handoff attempt surfaced what we need to learn: his tacit process, what he actually needs.

Handoffs

What does the handoff attempt reveal?

Trying to build AI assistance surfaced legibility gaps. We don't yet understand his workflow well enough to know what would help. The handoff attempt itself is the learning.

Repair / Next

  • Learn more about his existing workflow and videos
  • Explore: identifying style from his existing videos
  • Explore: using his videos to improve the AI outputs
  • Figure out what kind of structuring (if any) would actually help his process

Visit Stop Motion Planner