Skip to content

Tutorial: Retrieval-augmented service

Cephalon.Retrieval provides runtime services for retrieval/RAG workloads. Pair it with Cephalon.Data.Qdrant or Postgres + pgvector to index, query, and trace a full retrieval pipeline.

  • an indexing pipeline that ingests documents into Qdrant.
  • a retrieval behavior that returns the top-k passages for a query.
  • traces that capture the embedding call, vector query, and downstream LLM call.

Full walkthrough lands in the next docs push. See Technology → Retrieval.