Configure
Start from your app, a preview URL, or a staging flow. Choose the personas and give Flock the task.
- https://preview.example.com
- Busy Executive
- Complete profile setup
AI-powered personas navigate your app, find confusing or broken moments, and hand you addressable fixes. AI-powered personas navigate your app like real users. They find what's confusing, broken, or frustrating, cite what they read or saw, and hand you addressable fixes for your team or coding agent.
Find user friction before it reaches production.
Each finding includes what the persona saw or ran into: screenshots, page text, trace events, console output, or network evidence.
Point Flock at your app, staging environment, or preview URL without adding an SDK or analytics pipeline.
Findings include severity, context, and suggested fixes your team or coding agent can act on.
The problem
Real user testing is slow, expensive, and biased. You can't run 100 users through your checkout flow before every deploy. So you guess, ship, and hope.
The solution
Point Flock at your app or a preview URL, choose the goal, and let personas attempt the flow. They capture what they saw, where they got stuck, and what your team can fix.
A synthetic tester that explores your product, captures what happened, and gives your team something concrete to fix.
From run to fix
The workflow reads like the product itself: a run request becomes captured evidence, then a finding, then a verified rerun.
Start from your app, a preview URL, or a staging flow. Choose the personas and give Flock the task.
Personas attempt the flow while Flock captures screens, DOM context, traces, and narrated decisions.
Each finding lands with the visible evidence, severity, confidence, and the user-facing reason it matters.
Patch the issue and rerun the same persona set against your app or preview URL.
Flock tests Flock
Dashboard state, replay evidence, consensus, scores, and timeline artifacts all come from the same sanitized set of runs against this site.
Technical proof
Flock is shaping the dashboard, CLI, and GitHub App around the same idea: findings should be easy to inspect, share, and act on.
npm run tester -- \
--url https://dashboard.example.com/signup \
--persona novice-user \
--goal "Create a trial account" \
--goal-outcome-type onboarding \
--success-signals '[{"type":"url_match","value":"**/welcome"}]' What this writes
manifest.json output.json timeline.jsonl errors.jsonl engineering_events.json report.md Agent-first workflow supported
Tell Cursor, Copilot, Claude, or another agent which finding to work on. The prompt carries the observed behavior, artifacts, and expected outcome.
Capabilities
Flock keeps the signal practical: what happened, why it matters, and what your team can do about it.
Screenshots, page text, traces, console output, or network evidence attached to every claim.
Severity, context, supporting evidence, and a suggested patch path in one finding.
Post findings and PR summaries with linked evidence so reviewers can inspect the issue.
Run outputs, screenshots, DOM excerpts, timeline events, errors, and reports stay inspectable.
Self-contained prompts for Cursor, Copilot, Claude, or other coding agents.
Rank findings so teams can decide what to fix now, review, or track later.
Define the user goal and success signals, then report where completion gets blocked.
Rerun after a patch and compare against previous artifacts to confirm the experience changed.
Use narration to understand why the friction matters, with artifacts showing what happened.
Run personas from terminal and CI workflows with repeatable commands and artifacts.
Who it's for
Flock starts with the teams responsible for fixing what users run into. Product, design, QA, and regulated teams can all work from the same evidence.
Catch user friction with artifacts, fix prompts, and reruns that fit the way modern teams already ship code.
See the exact UI state, component area, and suggested patch path behind a finding before it turns into production feedback.
Add UX regression signal to CI without replacing functional tests. Use severity thresholds and reruns to confirm behavior changed.
Inspect evidence for what to fix next, with persona context that explains user impact without replacing the artifact trail.
Evaluate preview and staging flows with no SDK requirement and no production analytics dependency when a URL-based run is enough.
Meet the personas
Each persona has distinct patience, risk tolerance, and goals. Their context helps teams understand why the same flow can fail in different ways.
Sam Rivera
Gets overwhelmed by dense layouts and jargon. Hesitates on ambiguous buttons or unclear labels.
Alex Chen
Skims quickly and reacts to visual contradictions. Near-zero tolerance for queues or disabled actions.
Riley Morgan
Notices inconsistencies and treats them as trust breakers. Pauses on mismatched labels that imply risk.
Jordan Wu
Knows what they want. Impatient with hand-holding. Explores aggressively, tries shortcuts.
Morgan Taylor
Looking for reasons to say no. Scrutinizes pricing. Compares to alternatives mentally.
Jordan Park
Navigates keyboard-only. Flags contrast issues, focus traps, missing ARIA labels, and broken heading hierarchy.
Use it to finish setup and start your first run.
You'll finish setup on the Flock dashboard.