STEP 01
Ingest Sources
PDF and URL content enters through API routes with parsing, cleanup, and source metadata.
System Design
A compact RAG architecture focused on the pieces that matter after launch: source ingestion, retrieval boundaries, cited answers, and observable failure states.
STEP 01
PDF and URL content enters through API routes with parsing, cleanup, and source metadata.
STEP 02
Text is chunked for retrieval quality, then embedded with OpenAI embeddings.
STEP 03
Pinecone session namespaces return top context chunks for each user query.
STEP 04
Chat responses stream back with citations, source tags, and fallback behavior when evidence is thin.
Per-endpoint throttling controls abuse and API spend.
Every response includes X-Request-Id for incident tracing.
Runtime counters track success/error mix and average latency.