Prolog-RAG: Formal Logic for Financial Reasoning

A hybrid RAG system bridging the gap between semantic retrieval and symbolic precision.

📖 Table of Contents

🛑 The Problem with Traditional RAG
🚀 The Solution: Prolog-RAG
🏗️ System Architecture
🔍 Example: The "Audit Proof" Difference
📊 Benchmark Results
🛠️ How It Works
🏗️ Project Structure
💻 Tech Stack
🛠️ Installation & Setup
📜 Explainability Trace Example
🤝 Contributing
📄 License

🛑 The Problem with Traditional RAG

Standard Vector-based RAG architectures consistently fail in the financial domain because:

Numerical Hallucinations: LLMs struggle with multi-step arithmetic, leading to incorrect calculations for growth rates, total costs, and margins.
Logical Multi-Hop Gaps: Information scattered across fragmented documents (e.g., across 2012–2014 SEC filings) results in "lost-in-the-middle" reasoning errors.
Mathematical Imprecision: Simple semantic search cannot handle complex constraints or temporal comparative logic (e.g., "Find the year with the lowest margin").
Opaque Reasoning: Answers are generated as "black boxes," providing no verifiable audit trail—a critical failure for regulatory compliance.

🚀 The Solution: Prolog-RAG

Prolog-RAG solves these issues by offloading Reasoning from the LLM to a Symbolic Logic Engine.

Symbolic Fact Extraction: Uses LLMs to turn natural language context into structured Prolog predicates (revenue(aal, 2023, 500)).
Deterministic Reasoning: Executes complex financial rules (growth, margin, threshold analysis) through a symbolic Prolog backend.
100% Numerical Accuracy: Performs exact mathematical calculations, eliminating the "rounding" errors inherent in LLM-based synthesis.
Verifiable Proof Traces: Every answer comes with a step-by-step logic trace, showing exactly which financial facts and reasoning rules were used to derive the result.

🏗️ System Architecture

Prolog-RAG uses a hybrid routing mechanism to ensure high precision for structured queries while maintaining the flexibility of semantic search.

       ┌───────────────┐
       │   User Query  │
       └───────┬───────┘
               ▼
       ┌───────────────┐
       │ Query Router  │ (LLM-Based Decision Entry)
       └───────┬───────┘
               │
      ┌────────┴────────┐
      ▼ (Arithmetic)    ▼ (Semantic)
┌───────────────┐   ┌───────────────┐
│  Prolog Path  │   │  Vector Path  │
├───────────────┤   ├───────────────┤
│ Fact Extract  │   │ Chroma Search │
│ Logic Engine  │   │ LLM Synthesis │
└────────┬───────┘   └───────┬───────┘
         │                   │
         └────────┬──────────┘
                  ▼
          ┌───────────────┐
          │  Final Answer │ (With Proof Trace if Prolog)
          └───────────────┘

🔍 Example: The "Audit Proof" Difference

Query: "What was the gross profit margin for the company in 2017?"

🟢 Prolog-RAG (Reasoning Path)

Answer: "The gross profit margin for 2017 was 18.91%."
Verification Trace:

1. [Extract] revenue(company, 2017, 3314.0).
2. [Extract] cost_of_sales(company, 2017, 2687.0).
3. [Rule]    gross_profit(C, Y, GP) :- revenue(C, Y, R), cost(C, Y, S), GP is R - S.
4. [Rule]    margin(C, Y, M) :- gross_profit(C, Y, G), revenue(C, Y, R), M is (G/R)*100.
5. [Execute] M is ((3314 - 2687) / 3314) * 100 = 18.9197...

🔴 Traditional RAG (Semantic Path)

Answer: "The company reported a strong gross margin in 2017, approximately 19% based on the consolidated statements."
Verification Trace:

❌ None. Source of the "19%" figure is opaque and subject to LLM rounding/estimation.

📊 Benchmark Results

Evaluated on our Grounded Financial QA Suite (10 high-stakes financial reasoning questions):

System	Avg Accuracy Score	Proof Trace Availability	Best For...
🟢 Prolog-RAG	4.6 / 5	100% (10/10)	Logic, Arithmetic, Auditing
🔵 Contextual RAG	4.8 / 5	0%	General Semantic Lookups
🔴 Naive RAG	3.2 / 5	0%	Basic FAQ retrieval
🟣 Graph RAG	0.7 / 5	0%	Complex entity mapping

🛠️ How It Works

Query Input: The user provides a natural language financial query.
Hybrid Routing: An LLM analyzes the query to determine if it is Semantic (FAQ/Summary) or Arithmetic/Logical (Calculations/Multi-hop).
Fact Extraction: For logical queries, the system retrieves relevant document chunks and extracts structured financial facts (e.g., revenue(co, 2023, 500)).
Symbolic Unification: The facts are asserted into a SWI-Prolog knowledge base alongside domain-specific financial reasoning rules.
Logic Execution: The Prolog engine executes a symbolic query to derive the exact numerical answer or logical conclusion.
Answer Synthesis: The system generates a natural language answer, appending a Proof Trace for full transparency and explainability.

🏗️ Project Structure

prolog-rag/
├── prolog_rag_project/
│   ├── core/           # Hybrid pipeline, Query Router, NL-to-Prolog translator
│   ├── baselines/      # Naive, Graph, CRAG, and Contextual RAG implementations
│   └── utils/          # Auto-Evaluator, Reporting, & Visualization tools
├── benchmarks/         # Data generators for NIAH, HotpotQA, and FRAMES
├── docs/               # Comparative analysis and PRD documentation
├── assets/             # Performance charts and visualizations
├── demo_app.py         # Streamlit Interactive Demo
└── arena.py            # Unified benchmarking arena

💻 Tech Stack

Component	Technology	Role
Language	Python 3.11+	Orchestration & Pipeline
Logic Engine	SWI-Prolog 9.x	Symbolic Reasoning & Arithmetic
LLM (Backbone)	Llama 3.1 (via Groq)	Fact Extraction & Query Routing
Vector Database	ChromaDB	Semantic Retrieval & Context Management
Embeddings	Sentence-Transformers	Vectorizing Financial Documents
Visualization	Matplotlib	Performance & Benchmark Charting

🛠️ Installation & Setup

Prerequisites

Python 3.11 or higher.
SWI-Prolog installed and added to your system PATH.
A Groq API Key (Set in .env).

Step-by-Step

Clone the Repo:

git clone https://github.com/RedLordezh7Venom/prolog-RAG.git && cd prolog-RAG

Install Dependencies:
```
uv pip install -e .
```
Environment Setup: Create a .env file in the root and add your Groq API key:
```
GROQ_API_KEY=your_key_here
```
Run the Benchmark:
```
uv run python arena.py
```

📜 Explainability Trace Example

Query: "What was the growth in Technical Solutions operating income from 2017 to 2018?"
Trace:
  -> fact: operating_income('Technical Solutions', 2017, 21.0)
  -> fact: operating_income('Technical Solutions', 2018, 32.0)
  -> rule: op_income_growth(Company, 2017, 2018, Growth)
  -> calc: (32.0 - 21.0) / 21.0 * 100 = 52.38%
Answer: "The operating income grew by 52.4%."

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for details.

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
assets		assets
benchmarks		benchmarks
data		data
docs		docs
prolog_rag_project		prolog_rag_project
scripts		scripts
tests		tests
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING.md		CONTRIBUTING.md
FINAL_PRD_PROLOG_RAG.md		FINAL_PRD_PROLOG_RAG.md
LICENSE		LICENSE
PHASE1_README.md		PHASE1_README.md
README.md		README.md
app.py		app.py
arena.py		arena.py
arena_results.json		arena_results.json
build_vector_store.py		build_vector_store.py
debug_doc0.txt		debug_doc0.txt
demo.ipynb		demo.ipynb
download_data.py		download_data.py
eval_summary.json		eval_summary.json
evaluator.py		evaluator.py
final_verify.txt		final_verify.txt
index_contextual.py		index_contextual.py
main.py		main.py
manual_evaluation.md		manual_evaluation.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
sample_docs.json		sample_docs.json
test_prolog.py		test_prolog.py
test_questions.json		test_questions.json
uv.lock		uv.lock
verify_llm.py		verify_llm.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prolog-RAG: Formal Logic for Financial Reasoning

📖 Table of Contents

🛑 The Problem with Traditional RAG

🚀 The Solution: Prolog-RAG

🏗️ System Architecture

🔍 Example: The "Audit Proof" Difference

🟢 Prolog-RAG (Reasoning Path)

🔴 Traditional RAG (Semantic Path)

📊 Benchmark Results

🛠️ How It Works

🏗️ Project Structure

💻 Tech Stack

🛠️ Installation & Setup

Prerequisites

Step-by-Step

📜 Explainability Trace Example

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Prolog-RAG: Formal Logic for Financial Reasoning

📖 Table of Contents

🛑 The Problem with Traditional RAG

🚀 The Solution: Prolog-RAG

🏗️ System Architecture

🔍 Example: The "Audit Proof" Difference

🟢 Prolog-RAG (Reasoning Path)

🔴 Traditional RAG (Semantic Path)

📊 Benchmark Results

🛠️ How It Works

🏗️ Project Structure

💻 Tech Stack

🛠️ Installation & Setup

Prerequisites

Step-by-Step

📜 Explainability Trace Example

🤝 Contributing

📄 License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages