Agentic invoice auditor. Auto-approves the clean ones, escalates only edge cases.
Six specialised agents orchestrated in LangGraph + LangChain extract, translate, validate, audit, and save each invoice through a Groq Llama 3.3 LLM — with human-in-the-loop on flagged ones, a RAG chat over the corpus, and end-to-end LangSmith tracing. Numbers below are pulled live from the running system, not hardcoded.
Reads PDFs, DOCX, and OCR'd scans into typed JSON.
At a glance · 8 layers · 21 tools
The full stack — hover for details
Wires the 6 agents into a state machine with pause/resume.
Provides bind_tools and with_structured_output helpers.
Schema validates every LLM output; auto-retries on bad JSON.
Exposes 12 invoice tools to Claude Desktop, Cursor, Zed.
70B reasoning model behind extraction, audit, chat synthesis.
all-MiniLM-L6-v2 sentence-transformer for chunk embeddings.
Hosted vector DB; stores chunk embeddings for RAG.
LangGraph checkpoints — pipeline survives restarts.
Auto-traces every agent node and LLM call.
Streams per-agent events and chat tokens live to the UI.
Two-tier PDF text extraction with layout-aware fallback.
Reads scanned-image invoices via pytesseract.
Parses .docx invoice tables and paragraphs.
ASGI backend exposing /process, /chat, /stats, /upload.
MCP server talks back to FastAPI over HTTP.
App Router; dashboard, chat, tech, invoice detail pages.
Dark theme with violet→cyan AI-reserved gradient accent.
Page transitions, streaming pulses, stagger animations.
Treemap, donut, area chart on the analytics dashboard.
Slash command palette in chat (/flagged, /vendor, …).
PII redaction, prompt-injection block, off-topic gate on chat I/O.
Where an invoice goes
Every box is a real file or service. Arrows show how data physically moves.
- PDF / DOCX / PNGdrag-drop or /upload
- pdfplumber + pypdfPDF text
- Tesseract OCRscans
- python-docxDOCX tables
- LangGraphorchestrates 6 nodes
- Groq Llama 3.3reasoning model
- Pydanticstructured outputs
- SQLiteHITL checkpoints
- LangSmithtraces everything
- ChromaDB Cloudvector embeddings
- HuggingFaceMiniLM embeddings
- FastAPI /streamSSE telemetry
- Next.js + EChartsdashboard + chat
- MCP ServerClaude Desktop link