it_support · workflow

incident.io describes prompt engineering techniques for LLM-driven incident management

Engineers at incident.io hit persistent walls building LLM-powered incident features: combining classification and generation tasks caused interference, conflicting prompt instructions produced inconsistent results, poorly structured prompts degraded reasoning, and unpreprocessed timestamps caused the model to miscalculate durations.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · Incident alert triggers investigation
An alert (e.g. database CPU high) triggers an investigation that looks for relevant code changes and Slack messages.
Tools used
openaiSonnet4o
Outcome

Applying task splitting, instruction deconfliction, structured prompts, preprocessing, and model switching improved results, including one eval suite moving from 50% failures to 100% passing after switching to Sonnet.

What failed first

Combining classification and natural language generation in one prompt caused the tasks to interfere; a user-role message phrased as 'this is the message that is related' biased the LLM into over-identifying relevance because LLMs give more attention to recent tokens; and raw timestamps caused incorrect duration calculations.

Results
Volume50% failures to 100% passing
Source

https://incident.io/building-with-ai/tricks-to-fix-stubborn-prompts

How we source this →

Grounding & classification
Source type: technical build writeup
18 fields verified against source quotes.
document classificationsummarizationchat transcriptcode diff prfailure mode describedmetric backednamed customerproduction runtime claimedtools describedworkflow describedsoftwareaccuracy improvementtechnical build writeupit supportextract classify route