Overview
Biz-Tech Analytics built and ran a sustained web-agent trajectory collection pipeline for a frontier AI research lab, generating structured behavioral data for computer-use model training. Each data point pairs a precisely sequenced interaction trace with structured edge-case documentation for deterministic replay and benchmarking.
Annotation Framework
Every click, keystroke, scroll, and URL navigation was treated as an independent, screenshotted, and recorded event. Each step gets a clear label saying what was done and why.
QA Pipeline
Multi-axis quality enforcement at 500 tasks / day
Every task went through a multi-point review before delivery - checking task completion, whether actions were purposeful rather than exploratory, whether the recording was clean, and whether screenshots were usable for replay. This kept client-side rework low (under 10%) while the pipeline held a steady daily output.
Annotators were onboarded to gold-standard delivery within 48 hours, with the active cohort sustaining 500+ long tasks/day across browsers, file managers, spreadsheets and desktop apps.