RAG Chatbot Workshop

This workshop guides you through building a RAG (Retrieval-Augmented Generation) chatbot from scratch, using only local resources with Ollama. You'll learn how to create a complete system that can intelligently answer questions based on your documents.

Author

Carlos Villuendas - @carlosvillu

Prerequisites

Node.js v22 or higher
Ollama installed locally
Docker and Docker Compose
Basic understanding of TypeScript and JavaScript
Basic knowledge of RAG systems and embeddings

Installation

Clone the repository:

git clone https://github.com/carlosvillu/chatbot-workshop

Install dependencies:

cd chatbot-workshop
npm run phoenix

Installing and Setting up Ollama

Linux Installation

Install Ollama using the official install script:

curl -fsSL https://ollama.com/install.sh | sh

Verify the installation:

ollama --version

Starting Ollama Server

Start the Ollama server:

ollama serve

Download required models:

# Download base model for chat
ollama pull llama3.1:8b

# Download smaller model for testing
ollama pull llama3.1:3b

# Download model for embeddings
ollama pull nemotron:mini

You can verify installed models with:

ollama list

Docker Environment

Starting Docker Services

In the 03-ingest-vectorstore branch, you'll find a docker-compose.yml file. Start the required services with:

docker-compose up -d

This will set up:

ChromaDB for vector storage
Additional required services

Cleaning Up Docker Resources

After finishing the workshop, clean up all resources with:

# Stop all containers
docker-compose down

# Remove all volumes
docker-compose down -v

# Remove all related images
docker rmi $(docker images -q 'chromadb/*')

# Verify cleanup
docker ps -a
docker volume ls

Workshop Structure

The workshop is organized in branches, each focusing on a specific aspect of RAG systems. Follow them in order:

1. Document Ingestion (01-ingest-documents)

Learn how to load and process documents from different sources.

2. Document Embedding (02-ingest-embedding)

Transform documents into vector representations using Ollama's embedding model.

3. Vector Storage (03-ingest-vectorstore)

Store and manage document embeddings efficiently using ChromaDB.

4. Question Embedding (04-consumer-embedding-question)

Process user questions and convert them into vector representations.

5. Semantic Search (05-consumer-vectorstore-search)

Implement semantic search to find relevant documents for user questions.

6. Chat Integration (06-consumer-chat)

Create a chat interface that uses the RAG system to answer questions.

Learning Path

Start with branch 01-ingest-documents and follow the commits
Once you understand each part, move to the next branch
Each branch builds upon the previous one
The final branch 06-consumer-chat contains the complete working chatbot

Documentation

The slides for this workshop are available at: https://docs.google.com/presentation/d/1QwgaD35z1KK7CqexYm5HWy2oezk_nK9Y_RcoM6Ej8oY/edit?usp=sharing

Troubleshooting

Ollama

If you encounter issues with Ollama, ensure the server is running with ollama serve
Check server status: curl http://localhost:11434/api/tags
Verify model installation: ollama list

Docker

Check running containers: docker ps
View logs: docker-compose logs -f
If ChromaDB fails to start, ensure ports are not in use: lsof -i :8000

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
packages		packages
.gitignore		.gitignore
.nvmrc		.nvmrc
Readme.md		Readme.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Chatbot Workshop

Author

Prerequisites

Installation

Installing and Setting up Ollama

Linux Installation

Starting Ollama Server

Docker Environment

Starting Docker Services

Cleaning Up Docker Resources

Workshop Structure

1. Document Ingestion (01-ingest-documents)

2. Document Embedding (02-ingest-embedding)

3. Vector Storage (03-ingest-vectorstore)

4. Question Embedding (04-consumer-embedding-question)

5. Semantic Search (05-consumer-vectorstore-search)

6. Chat Integration (06-consumer-chat)

Learning Path

Documentation

Troubleshooting

Ollama

Docker

License

About

Releases

Packages

Languages

carlosvillu/chatbot-workshop

Folders and files

Latest commit

History

Repository files navigation

RAG Chatbot Workshop

Author

Prerequisites

Installation

Installing and Setting up Ollama

Linux Installation

Starting Ollama Server

Docker Environment

Starting Docker Services

Cleaning Up Docker Resources

Workshop Structure

1. Document Ingestion (01-ingest-documents)

2. Document Embedding (02-ingest-embedding)

3. Vector Storage (03-ingest-vectorstore)

4. Question Embedding (04-consumer-embedding-question)

5. Semantic Search (05-consumer-vectorstore-search)

6. Chat Integration (06-consumer-chat)

Learning Path

Documentation

Troubleshooting

Ollama

Docker

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages