GitHub - OvidiuCif/Semantic-FAQ-Assistant: Python-based solution that uses semantic search using OpenAI embeddings, integrated with PostgreSQL for data storage and retrieval

Semantic FAQ Assistant

A semantic search system that uses OpenAI embeddings to find relevant FAQ answers and generates responses using GPT models - included PostgreSQL integration for embeddings. Both CLI interface and REST API with JWT authentication - Docker setup is also done for avoiding postgresql and pgvector installation process.

Overview - Process Map

graph TB
    A[User Input] --> B{Interface}
    B -->|CLI| C[main.py]
    B -->|API| D[api.py + Auth via HTTP call]
    
    C --> E[chat_logic.py]
    D --> E
    
    E --> F[embedding_utilities.py]
    E --> G[database_utilities.py]
    
    F --> H[OpenAI Embeddings API]
    G --> I[(PostgreSQL + pgvector)]
    
    E --> J[OpenAI GPT API]
    
    I --> K[Vector Similarity Search]
    K --> E
    
    E --> L[Generated Response]
    L --> M[User]

Local Setup

Prerequisites

python 3.11
PostgreSQL 17
OpenAI key

Setup Steps

Create virtual environment

python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt

Install PostgreSQL (version 17 -development)

winget install PostgreSQL.PostgreSQL

Install pgvector extension (via cmd)

set PGROOT=C:\Program Files\PostgreSQL\17
set PATH=C:\Program Files\PostgreSQL\17\bin;%PATH%
set PG_CONFIG=C:\Program Files\PostgreSQL\17\bin\pg_config

#init VS Build Tools
"C:\Program Files (x86)\Microsoft Visual Studio\2022\BuildTools\VC\Auxiliary\Build\vcvars64.bat"

#build and install pgvector
cd C:\temp\pgvector\pgvector-0.5.1
nmake /f Makefile.win
nmake /f Makefile.win install

Create db/tables Run the PostgreSQL.sql script to create the database schema and insert FAQ data - adjust with different insert(data) or names
Create .env file

OPENAI_API_KEY=your_openai_api_key_here
DATABASE_URL=postgresql://postgres:your_password@localhost:5432/faq_database
JWT_SECRET_KEY=your_jwt_secret_here
API_PASSWORD=your_api_password_here -- to be used in API call

Generate embeddings

python resources/update_embeddings.py

This connects to the database, finds FAQ entries without embeddings, generates embeddings using OpenAI and updates the database

Run the application

# CLI version
python main.py

# API version
python api.py

Docker setup

requirements

Docker desktop

setup Steps

create .env file (layout mentioned above)
start containers

docker-compose up -d

generate embeddings

docker-compose exec app python resources/update_embeddings.py

Access API at http://localhost:8000/docs

API usage

auth

Get access token from /token endpoint body:
- username: api_user
- password: value from API_PASSWORD in your .env
Use token in Authorization header
```
Authorization: Bearer <value>
```

Ask question/use chatbot

POST to /ask-question with:

{
  "question_text": "my questions",
  "model": "gpt-4o-mini",
  "temperature": 0.7,
  "similarity_threshold": 0.7,
  "embedding_model": "text-embedding-3-small"
}

How it works (high-level)

you submit a question via CLI or API
system generates embedding for the question using OpenAI
cector similarity search finds most relevant FAQ in PostgreSQL db
if - similarity above threshold, GPT generates response based on FAQ context
response returned to user
user input/or terminate

File structure

main.py - CLI interface
api.py - REST API/auth
chat_logic.py - core question processing logic
api_security.py - JWT authentication
database_utilities.py - db operations
embedding_utilities.py -openAI embedding operations
update_embeddings.py -populate embeddings
PostgreSQL.sql - db schema and sample data

other notes

embeddings column is initially empty when you create the database
run update_embeddings.py to populate embeddings for existing FAQ entries
db(postgresSQL) and OpenAI key must be configured in .env file
doccker setup eliminates the complex PostgreSQL + pgvector installation process

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
modules		modules
resources		resources
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.MD		README.MD
api.py		api.py
docker-compose.yaml		docker-compose.yaml
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic FAQ Assistant

Overview - Process Map

Local Setup

Prerequisites

Setup Steps

Docker setup

requirements

setup Steps

API usage

auth

Ask question/use chatbot

How it works (high-level)

File structure

other notes

About

Uh oh!

Uh oh!

Languages

OvidiuCif/Semantic-FAQ-Assistant

Folders and files

Latest commit

History

Repository files navigation

Semantic FAQ Assistant

Overview - Process Map

Local Setup

Prerequisites

Setup Steps

Docker setup

requirements

setup Steps

API usage

auth

Ask question/use chatbot

How it works (high-level)

File structure

other notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages