SFOS Case Note - Abhishek Singh

SFOS is how I turned solo product work into an automated build-and-quality loop.

SFOS is the operating system around WittyKeys: AI-orchestrated development, automated UI/functionality work, QA regression, evals, observability, and release confidence.

AI-orchestrated dev Golden screenshots Espresso / UI Automator LLM-as-judge JourneyTracer

Building with AI is powerful, but only if the system keeps quality visible.

WittyKeys was not just an app build. It became a full operating loop: plan with Cowork, implement with Claude Code, deploy to a real Android device, capture screenshots, compare against baselines, run functional tests, score AI quality, inspect logs, and decide what ships.

That is what SFOS means in my work: the structure that lets a solo founder move fast without losing control of quality.

The loop

Automate development, testing, and learning without removing judgment.

Cowork creates phased plans and Claude Code instructions; Claude Code implements, builds, deploys, captures, and reports.
Parallel frontend and backend worktrees let independent AI sessions work without stepping on each other.
Golden screenshot regression catches UI drift through device captures and PixelDiffComparator baselines.
Espresso/UI Automator flows cover functionality across onboarding, keyboard, overlay, chat, tone, grammar, translation, and AI actions.
LLM-as-judge scoring reviews reply quality, tone fit, Hinglish naturalness, and context relevance.
JourneyTracer and backend logs make product behavior easier to investigate after release.

Development systemPlanning, implementation, reporting, and review happen through repeatable AI-agent workflows.

Testing systemUI and functionality are checked with goldens, E2E tests, real-device runs, and quality scoring.

Release systemLogs, traces, crash/performance data, and review gates keep shipping decisions grounded.

Building with AI is powerful, but only if the system keeps quality visible.

Automate development, testing, and learning without removing judgment.

This is the operating layer I want to keep evolving.