🧠 FormIntel - AI-Powered Document Classification and Summarization

FormIntel is a full-stack document intelligence tool that uses OCR, NLP, and Machine Learning to:

Extract text from uploaded document images,
Classify the type of document (e.g., Invoice, Receipt, or Timesheet),
Summarize long documents using T5 Transformers,
Extract structured fields using simple pattern-based logic.

Built with Flask, TensorFlow, TfidfVectorizer, Tesseract OCR, and HuggingFace Transformers.

🚀 Features

✅ OCR image-to-text with Tesseract
✅ Document classification (ML model)
✅ Transformer-based text summarization
✅ REST API interface
✅ Fast JSON output (suitable for backend pipelines)
✅ Built-in data generator + stress test via /predict

🛠️ Installation

Clone the repo

git clone https://github.com/cmdsnr/FormIntel.git
cd FormIntel

Install Dependencies:

pip install -r requirements.txt

Then:

python train_model.py

Which will generate the keras file in the /model directory. Finally run: python app.py and use the generate_and_post.py file to try with a data sample I made. (The file will also send a post request with the json file) You can also post an image file: curl -X POST -F "file=@somefile.png" http://127.0.0.1:5000/process

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
models		models
README.md		README.md
app.py		app.py
classifier.py		classifier.py
extractor.py		extractor.py
generate_and_post.py		generate_and_post.py
ocr_engine.py		ocr_engine.py
requirements.txt		requirements.txt
summarizer.py		summarizer.py
train_model.py		train_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 FormIntel - AI-Powered Document Classification and Summarization

🚀 Features

🛠️ Installation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

cmdsnr/FormIntel

Folders and files

Latest commit

History

Repository files navigation

🧠 FormIntel - AI-Powered Document Classification and Summarization

🚀 Features

🛠️ Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages