quality_assurance · saas · workflow

Cursor improves Tab code completion with online reinforcement learning

The Tab model showed too many incorrect suggestions, disrupting coding flow; the core challenge was not just improving model accuracy but knowing when to suggest and when to stay silent.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · User action triggers Tab

Whenever a user types a character or moves their cursor, the Tab model tries to predict the next action.

Tools used

PyTorch

Outcome

The new Tab model makes 21% fewer suggestions while achieving a 28% higher accept rate, and has become the new default in Cursor.

Results

Time saved1.5 to 2 hours

Volumeover 400 million

Source

https://cursor.com/blog/tab-rl

How we source this →

Grounding & classification

Source type: technical build writeup

18 fields verified against source quotes.

code generationpredictive analyticscode diff prmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedsoftwareaccuracy improvementemployee productivitytechnical build writeupquality assurance