data_entry_ops · workflow
Dropbox improves document scanner responsiveness with hybrid quad tracking
Dropbox's document detection algorithm required 100 ms per frame while the camera captures at 30 fps, making real-time quad overlay impossible — especially on older devices like iPhone 5 that lacked the processing power of newer hardware.
How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · Camera frame capture
The camera continuously pumps out images at 30 fps, triggering the detection pipeline.
Tools used
gyroscope
Outcome
The hybrid approach achieves 30 Hz image and quad throughput with approximately 30 ms latency and zero image-to-quad offset, combining the frame-rate of asynchronous mode with the quad accuracy of synchronous mode.
What failed first
Asynchronous processing showed quads offset 100 ms from the displayed image, causing laggy and choppy overlays. Synchronous processing reduced frame rate to 10 fps with jarring 100 ms latency. Several tracking approaches — brute-forcing, RANSAC, and digest-based alignment — were not fast enough.
Results
Time saved100ms
Volume30 fps
Grounding & classification
Source type: technical build writeup
23 fields verified against source quotes.
computer visionfailure mode describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedsoftwarecycle time reductionthroughput increasetechnical build writeupdata entry opsautonomous resolution