Document Processor for RAG Chatbot

This project handles document ingestion and processing for the RAG (Retrieval-Augmented Generation) chatbot. It's separate from the main chatbot deployment to keep the cloud instance clean and focused.

Directory Structure

src/: Python source code for document processing
data/input/: Place input documents here
data/output/: Processed data will be stored here

Setup

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Linux/Mac
# or
.venv\Scripts\activate  # On Windows

Install dependencies:
```
pip install -r requirements.txt
```

Usage

Place documents to be processed in the data/input/ directory
Run the processing scripts from the src/ directory
The processed data will be stored in the data/output/ directory, ready for use by the RAG engine

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
check_vector_db/vector_db		check_vector_db/vector_db
documents		documents
src		src
.gitignore		.gitignore
README.md		README.md
commands.txt		commands.txt
optimized-tour-chunker.py		optimized-tour-chunker.py
optimized_tour_chunker.py		optimized_tour_chunker.py
process.sh		process.sh
process_documents.py		process_documents.py
pyproject.toml		pyproject.toml
rebuild_vector_db.py		rebuild_vector_db.py
requirements.txt		requirements.txt
run_processing.sh		run_processing.sh
run_processor.py		run_processor.py
setup.sh		setup.sh
test_vector_db.py		test_vector_db.py
test_vector_store.py		test_vector_store.py
upload_to_gcs.py		upload_to_gcs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Document Processor for RAG Chatbot

Directory Structure

Setup

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SystemicVoid/create-vectordb

Folders and files

Latest commit

History

Repository files navigation

Document Processor for RAG Chatbot

Directory Structure

Setup

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages