Workflow · saas · workflow

Inside GitHub: Working with the LLMs behind GitHub Copilot

Code generation was considered too difficult for existing models before GPT-3, and early Copilot versions suffered from wrong-language suggestions and had no awareness of files beyond the one currently being edited.

How it works

Common implementation structure

How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.

Stage 1 · Developer edits code in IDE

The workflow starts when a GitHub Copilot user is actively editing a file in their IDE.

Tools used

GitHub CopilotGPT-3Codex modelGitHub Copilot X

Outcome

Model accuracy on evaluation problems rose from about half to upwards of 90 percent; prompt techniques including neighboring-tab context retrieval and file-path injection produced large lifts in code acceptance; GitHub Copilot gained new capabilities and GitHub Copilot X was announced as the next evolution.

What failed first

An initial chatbot prototype was abandoned in favour of the IDE modality, and early models defaulted to the most popular programming languages instead of detecting the correct one from context.

Results

Volumeabout half of the problems

Source

https://github.blog/2023-05-17-inside-github-working-with-the-llms-behind-github-copilot/

How we source this →

Grounding & classification

Source type: technical build writeup

24 fields verified against source quotes.

code generationbuilder submittedfailure mode describedhuman review describedmetric backednamed customerproduction runtime claimedsource backedtools describedworkflow describedsoftwareaccuracy improvementemployee productivitytechnical build writeupai draft human approval