quality_assurance · workflow

DREAM: A blueprint for distributed RAG experimentation using Ray, LlamaIndex, Ragas, MLFlow and MinIO on Kubernetes

Evaluating which combination of LLMs, embedding models, and retrieval methods works best for a RAG use case is time-consuming when combinations must be explored sequentially.

How it works
Common implementation structure
How this type of workflow is generally built, generalized across documented cases — not tied to any one vendor's stack. Click any stage to read what happens there. Specific products that implement these stages appear in “Tools commonly seen” below.
Stage 1 · Load and store unstructured data
PDFs are downloaded and pushed to MinIO S3 object storage using boto3.
Tools used
Ray TunePostgreSQLboto3gpt-3.5-turbogpt-4text-embedding-3-smalltext-embedding-3-large
Outcome

DREAM demonstrates how distributed RAG experimentation, evaluation, and tracking can be conducted using open-source technologies, enabling comparison and selection of the best-performing RAG parameter combination.

Source

https://mlops.community/blog/dream-distributed-rag-experimentation-framework

How we source this →

Grounding & classification
Source type: technical build writeup
14 fields verified against source quotes, 9 dropped as unverifiable.
data extractionragknowledge basesource backedworkflow describedaccuracy improvementtechnical build writeupquality assurancerag answering