GitHub - ashwin975/Databricks_RAG

🎉 Inspiration

I was struck by the lightning bolt ⚡ of inspiration when I discovered the incredible potential of Retrieval-Augmented Generation (RAG) and the mind-blowing scalability of Databricks. I thought, why not combine these two superpowers to create a RAG-based LLM model that can swoop in like a superhero 🦸‍♂️ and save the day for Husky's field service engineers?

Company Website - Husky Webpage

About the Company - Husky Technologies specializes in Injection Molding systems and excels in providing top products and services for industries like consumer goods, medical, beverages, and automotive. They focus on delivering high-performance, efficient solutions globally, with extensive support including installation, training, and maintenance.

Problem Statement - To enhance productivity for Husky Technologies' field service agents, implementing a Large Language Model (LLM) or a context-aware Q&A bot could be transformative. This technology would allow agents to quickly navigate complex documentation necessary for diagnosing equipment issues, significantly speeding up the process. The main challenge is integrating Husky's large knowledge database and internal knowledge into the LLM to ensure it can effectively retrieve and interpret specific information.

🤖 What it does

Our RAG-based LLM model is like a genius librarian 📚 that lives inside Databricks. It has memorized Husky's entire collection of technical documents, User manuals, Service manuals and can provide spot-on answers to any question thrown its way. It's like having a pocket-sized expert 🧠 that engineers can consult anytime, anywhere!

🧅 How I built it

Resources were created on Azure cloud including Resource groups, Databricks workspace, Unity Access Connector, Data lake storage (Metastore storage), Azure Key Vault Access (Secret Storage), cluster configurations were setup initially
Connections and access management between Azure storage container and databricks workspace were established

Ingested the Pdfs as into Unity Catalog volume store as raw data
Split the pdfs in small chunks of text
Computed the embeddings using a Databricks Foundation model - (bge-large-en) as part of our Delta Live Tables
Created a Vector Search index based on our Delta live Table
Trained the model with input and output examples
Registered the fine tuned LLM model
Endpoint created and Served using ML-flow endpoint creation
Established a space on hugging face for Chatbot User Interface

🏛️ Architecture:

🍽️ Dataset Used:

Documents used for RAG memory:

Hot Runner Product Handbook [238pgs]

Ultrasync Service Manual [78pgs]

Ultrashot Service Manual [186pgs]

Training Course Doc [20pgs]

😅 Challenges I ran into

Ensuring that the model's responses made sense and stayed on topic was like herding cats 🐱. Husky's documents are complex, and getting the retrieval process and model architecture just right took more trial and error than a mad scientist's lab 🧪. But I persevered, fueled by coffee ☕ and determination! And also obviously limited Budget 💸

🚀 What's next for our RAG-based LLM on Databricks

The sky's the limit! 🌟

✅ I plan to keep expanding the knowledge base, like a sponge soaking up water 🧽.
✅ Incorporating user feedback from industry experts and field service engineers/SMEs.
✅ Integration with existing enterprise systems and support for multiple languages?🌍!
✅ As a future scope, I plan to web scrape wikipedia pages, research papers, ebooks

The future is bright, and I can't wait to see where this journey takes us! 🎈

🧠 What I learned

I learned that combining retrieval and generation techniques is like mixing peanut butter and jelly 🥪 - they're just meant to be together! I also discovered the importance of fine-tuning the retrieval process and optimizing the model architecture, like a chef perfecting a secret recipe 👨‍🍳. Databricks was our trusty sous-chef 👨‍🍳, handling the heavy lifting of data processing and model training.

🎉 Accomplishments that I'm proud of

I did it! I successfully combined the retrieval and generative components into a unified RAG model that provides accurate, context-specific answers. It's like I created a mind-reading machine 🔮 that can tap into Husky's collective knowledge and deliver coherent responses. I couldn't be prouder of our brainchild! 👶

🛠️ Tools Used:

Programming Language - SQL, PySpark, Python
Azure Cloud, Databricks
Azure Data lake Gen 2 for metastore
Delta Live Tables for documents storage

Gradio App - https://ashwin975-mfg-injection.space

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Technical Docs		Technical Docs
_resources - Clone		_resources - Clone
DBRX_RAG 3.0 - Clone.ipynb		DBRX_RAG 3.0 - Clone.ipynb
DBRX_RAG 3.0 - Clone.py		DBRX_RAG 3.0 - Clone.py
Databricks_RAG.svg		Databricks_RAG.svg
README.md		README.md
config - Clone.py		config - Clone.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎉 Inspiration

🤖 What it does

🧅 How I built it

🏛️ Architecture:

🍽️ Dataset Used:

😅 Challenges I ran into

🚀 What's next for our RAG-based LLM on Databricks

🧠 What I learned

🎉 Accomplishments that I'm proud of

🛠️ Tools Used:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎉 Inspiration

🤖 What it does

🧅 How I built it

🏛️ Architecture:

🍽️ Dataset Used:

😅 Challenges I ran into

🚀 What's next for our RAG-based LLM on Databricks

🧠 What I learned

🎉 Accomplishments that I'm proud of

🛠️ Tools Used:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages