back_office_ops · public · workflow

T-RAG: on-premise RAG with finetuned LLM and tree-based entity context for governance document QA

A large non-profit organization needed secure, on-premise question answering over its private governance documents but could not use API-based LLMs due to data leakage risks, faced limited computational resources, and required reliably correct responses to organizational queries.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · PDF document ingested
The original PDF governance document is parsed into text format using the LangChain library.
Tools used
LangChainLlama-2QLoRAvector database
Outcome

T-RAG, combining RAG with a finetuned open-source LLM and a tree structure for organizational entity hierarchies, performs better than a simple RAG or finetuning implementation alone.

Results
Volume1,614 question and answer pairs
Source

https://arxiv.org/html/2402.07483v2

How we source this →

Grounding & classification
Source type: technical build writeup
20 fields verified against source quotes, 3 dropped as unverifiable.
document aiknowledge searchragknowledge basepolicy documentfailure mode describedhuman review describedproduction runtime claimedworkflow describednonprofitaccuracy improvementtechnical build writeupback office opsrag answering