RAG with LangChain

A Retrieval-Augmented Generation (RAG) project using LangChain, PostgreSQL with pgVector, and OpenAI to create an intelligent question-answering system based on web documents.

What is RAG and Why Do We Need It?

The Problem

Imagine you want to ask an AI assistant questions about a specific document, website, or your company's knowledge base. Language models like GPT are very intelligent, but they have a major limitation: they only know information they were trained on and don't have access to your private data or recent information.

The Solution: RAG (Retrieval-Augmented Generation)

RAG solves this problem by combining two capabilities:

🔍 Retrieval: Search for relevant information in your documents
✨ Generation: Use an LLM to formulate an answer based on that information

Concrete Example:

You ask: "What are the new product features?"
The RAG system:
1. Searches your technical documentation for relevant sections
2. Provides this information to the AI as context
3. The AI generates a precise answer based on YOUR data

Why RAG Instead of Training?

💰 Cost-effective: No need to retrain an expensive model
⚡ Fast: Instant updates with new documents
🎯 Accurate: Responds with your exact data, not approximations
🔒 Secure: Your data remains private

🎯 Objective

This project demonstrates how to build a complete RAG system that:

Extracts content from web articles
Splits content into manageable chunks
Stores embeddings in a PostgreSQL vector database
Allows asking questions about the content and getting contextual answers

🏗️ Architecture

Frontend/Interface: Jupyter Notebook for interactive experimentation
LLM: OpenAI GPT-4o-mini for response generation
Embeddings: OpenAI text-embedding-3-large for vectorization
Vector Database: PostgreSQL with pgVector extension
Framework: LangChain for RAG orchestration
Containerization: Docker Compose for easy deployment

📋 Prerequisites

Docker and Docker Compose
OpenAI API Key
Python 3.8+ (if running locally)

🚀 Installation and Configuration

1. Clone the project

git clone <your-repo-url>
cd rag-langchain

2. Environment variables configuration

# Copy the example file
cp .env.example .env

# Edit the .env file and add your OpenAI API key
# OPENAI_API_KEY=your-openai-api-key-here

3. Launch with Docker Compose

# Start all services
docker-compose up -d

# Check that services are running
docker-compose ps

4. Service access

Jupyter Lab: http://localhost:8888
pgAdmin: http://localhost:8080 ([email protected] / admin)
PostgreSQL: localhost:5432

🛠️ Usage

Via Jupyter Notebook

Open your browser and go to http://localhost:8888
Open the notebook rag-lanchain.ipynb
Execute the cells sequentially to:
- Install dependencies
- Configure the LLM model and embeddings
- Create the PostgreSQL vector store
- Load and process web content
- Split content into chunks
- Store embeddings in the database

Local installation (optional)

If you prefer to run the project locally:

# Install dependencies
pip install -r requirements.txt

# Start only PostgreSQL and pgAdmin
docker-compose up postgres pgadmin -d

# Launch Jupyter locally
jupyter lab

📂 Project Structure

rag-langchain/
├── rag-lanchain.ipynb    # Main notebook with RAG code
├── requirements.txt      # Python dependencies
├── Dockerfile           # Jupyter container configuration
├── compose.yml          # Docker Compose configuration
├── .env.example         # Environment variables template
├── README.md           # Project documentation (English)
└── README_FR.md        # Project documentation (French)

🔧 Technologies Used

Frameworks and Libraries

LangChain: Framework for LLM applications
LangGraph: Graphs for complex workflows
langchain-openai: OpenAI integration
langchain-postgres: PostgreSQL integration
langchain-text-splitters: Text splitting
langchain-community: Community loaders and utilities

Database

PostgreSQL 15 with pgVector extension
psycopg[binary]: Python-PostgreSQL connector

Development Tools

Jupyter Lab: Interactive development environment
Docker: Containerization
pgAdmin: PostgreSQL administration interface

🌐 Data Source

The project uses as an example Lilian Weng's blog article on AI agents: https://lilianweng.github.io/posts/2023-06-23-agent/

The content is extracted, processed, and indexed to enable intelligent queries.

🔍 Key Features

Web Extraction: Automatic web content loading with Beautiful Soup
Text Processing: Intelligent splitting into chunks with overlap
Vectorization: Text conversion to embeddings via OpenAI
Vector Storage: Persistence in PostgreSQL with pgVector
Semantic Search: Vector similarity search
Contextual Generation: Responses based on retrieved content

🔄 RAG Workflow

The following diagram illustrates the complete RAG system process implemented:

graph TD
    %% Indexing Phase
    A[🌐 Web Content<br/>Lilian Weng Blog] --> B[🔍 WebBaseLoader<br/>Beautiful Soup Parsing]
    B --> C[📄 Raw Document<br/>~43k characters]
    C --> D[✂️ Text Splitter<br/>RecursiveCharacterTextSplitter<br/>chunk_size=1000, overlap=200]
    D --> E[📝 Document Chunks<br/>~66 fragments]
    E --> F[🔢 OpenAI Embeddings<br/>text-embedding-3-large]
    F --> G[🗄️ PostgreSQL + pgVector<br/>Vector Store]

    %% Query Phase
    H[❓ User Question] --> I[🔢 Question Embedding<br/>OpenAI Embeddings]
    I --> J[🔍 Similarity Search<br/>pgVector Database]
    G --> J
    J --> K[📋 Retrieved Chunks<br/>Relevant Context]
    K --> L[🤖 OpenAI GPT-4o-mini<br/>LLM Generation]
    H --> L
    L --> M[✅ Generated Answer<br/>Contextual Response]

    %% Styling
    classDef webSource fill:#e1f5fe
    classDef processing fill:#f3e5f5
    classDef storage fill:#e8f5e8
    classDef query fill:#fff3e0
    classDef output fill:#ffebee

    class A webSource
    class B,C,D,E,F processing
    class G storage
    class H,I,J,K query
    class L,M output

Project developed as part of learning RAG technologies with LangChain and PostgreSQL.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG with LangChain

What is RAG and Why Do We Need It?

The Problem

The Solution: RAG (Retrieval-Augmented Generation)

Why RAG Instead of Training?

🎯 Objective

🏗️ Architecture

📋 Prerequisites

🚀 Installation and Configuration

1. Clone the project

2. Environment variables configuration

3. Launch with Docker Compose

4. Service access

🛠️ Usage

Via Jupyter Notebook

Local installation (optional)

📂 Project Structure

🔧 Technologies Used

Frameworks and Libraries

Database

Development Tools

🌐 Data Source

🔍 Key Features

🔄 RAG Workflow

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
README_FR.md		README_FR.md
compose.yml		compose.yml
rag-lanchain.ipynb		rag-lanchain.ipynb
requirements.txt		requirements.txt

francky-d/rag-langchain

Folders and files

Latest commit

History

Repository files navigation

RAG with LangChain

What is RAG and Why Do We Need It?

The Problem

The Solution: RAG (Retrieval-Augmented Generation)

Why RAG Instead of Training?

🎯 Objective

🏗️ Architecture

📋 Prerequisites

🚀 Installation and Configuration

1. Clone the project

2. Environment variables configuration

3. Launch with Docker Compose

4. Service access

🛠️ Usage

Via Jupyter Notebook

Local installation (optional)

📂 Project Structure

🔧 Technologies Used

Frameworks and Libraries

Database

Development Tools

🌐 Data Source

🔍 Key Features

🔄 RAG Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages