🤖 Document Intelligence RAG Agent

AI-powered document understanding and conversational retrieval system built using LangChain, ChromaDB, FastAPI, Streamlit, and Groq LLMs.

📌 Overview

Document Intelligence RAG Agent allows users to:

Upload PDF documents
Perform semantic search on document content
Ask natural language questions
Retrieve context-aware AI-generated answers

The system uses a Retrieval-Augmented Generation (RAG) pipeline to combine vector search with Large Language Models for accurate responses.

🚀 Features

✅ PDF Upload & Processing ✅ Semantic Document Search ✅ Conversational AI Q&A ✅ Vector Embeddings using Sentence Transformers ✅ ChromaDB Vector Store ✅ FastAPI Backend ✅ Streamlit Interactive UI ✅ Groq LLM Integration ✅ Modern Glassmorphism UI ✅ Dark-Themed AI Interface

🛠️ Tech Stack

Technology	Purpose
Python	Core programming language
LangChain	RAG pipeline orchestration
ChromaDB	Vector database
Sentence Transformers	Embedding generation
FastAPI	Backend API
Streamlit	Frontend UI
Groq API	LLM inference
PyPDF	PDF text extraction

🧠 RAG Architecture

PDF Upload
    ↓
Text Extraction
    ↓
Chunking
    ↓
Embeddings Generation
    ↓
ChromaDB Vector Storage
    ↓
Similarity Search
    ↓
Relevant Context Retrieval
    ↓
LLM Response Generation

📂 Project Structure

rag-agent/
│
├── app/
│   ├── services/
│   ├── utils/
│   ├── uploads/
│
├── frontend/
│   └── frontend.py
│
├── .streamlit/
│   └── config.toml
│
├── requirements.txt
│
└── README.md

⚙️ Installation

1️⃣ Clone Repository

git clone <YOUR_GITHUB_REPO_URL>
cd rag-agent

2️⃣ Create Virtual Environment

python -m venv venv

Activate environment:

Windows

venv\Scripts\activate

Linux / Mac

source venv/bin/activate

3️⃣ Install Dependencies

pip install -r requirements.txt

🔑 Environment Variables

Create a .env file:

GROQ_API_KEY=your_api_key_here

▶️ Run Backend

uvicorn app.main:app --reload

Backend runs on:

http://127.0.0.1:8000

▶️ Run Frontend

streamlit run frontend/frontend.py

Frontend runs on:

http://localhost:8501

📸 Application Preview

Main Interface

Add your screenshot here

README_images/app_preview.png

Example Markdown after adding screenshot:

![App Preview](README_images/app_preview.png)

💡 Example Workflow

Upload PDF document
System extracts and indexes content
Ask questions in chat
AI retrieves relevant chunks
LLM generates contextual answer

📈 Future Improvements

Multi-document support
Conversation memory
Authentication system
Cloud deployment
Source citation highlighting
OCR support for scanned PDFs
Hybrid search (BM25 + Vector)

🧪 Sample Queries

Summarize this document
What are the key findings?
Explain the methodology section
What technologies are discussed?

👨‍💻 Author

Built by Anurag Wanwe

📄 License

This project is for educational and portfolio purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.devcontainer		.devcontainer
.streamlit		.streamlit
app		app
frontend		frontend
.gitignore		.gitignore
Readme.md		Readme.md
config.py		config.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Document Intelligence RAG Agent

📌 Overview

🚀 Features

🛠️ Tech Stack

🧠 RAG Architecture

📂 Project Structure

⚙️ Installation

1️⃣ Clone Repository

2️⃣ Create Virtual Environment

Windows

Linux / Mac

3️⃣ Install Dependencies

🔑 Environment Variables

▶️ Run Backend

▶️ Run Frontend

📸 Application Preview

Main Interface

💡 Example Workflow

📈 Future Improvements

🧪 Sample Queries

👨‍💻 Author

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🤖 Document Intelligence RAG Agent

📌 Overview

🚀 Features

🛠️ Tech Stack

🧠 RAG Architecture

📂 Project Structure

⚙️ Installation

1️⃣ Clone Repository

2️⃣ Create Virtual Environment

Windows

Linux / Mac

3️⃣ Install Dependencies

🔑 Environment Variables

▶️ Run Backend

▶️ Run Frontend

📸 Application Preview

Main Interface

💡 Example Workflow

📈 Future Improvements

🧪 Sample Queries

👨‍💻 Author

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages