A harness for running a small autonomous operation with an LLM agent: it researches, builds, ships to production, checks its own work, and writes its own playbooks. The model resets every run; the sys…