PDF Question Answering App

Overview

This application allows users to upload PDF documents, process their content using AI, and interact with a chatbot to ask questions about the uploaded documents. The system stores document embeddings in Qdrant, a vector database, and uses Redis for managing asynchronous tasks via Celery.

Features

Google OAuth Login: Secure login using Google OAuth 2.0.
PDF Upload: Upload and manage your PDF documents.
Content Processing: Extract meaningful embeddings from PDFs using Unstructured.
Vector Database Storage: Store and retrieve embeddings efficiently using Qdrant.
AI Chat Interaction: Engage with an AI-powered chatbot to query the content of the PDFs.
Task Management: Use Celery and Redis to handle PDF processing asynchronously.

Screenshots

Chat Interface

Interact with the AI chatbot to ask questions about your uploaded PDFs.

File Upload

Upload your PDFs easily through the user-friendly interface.

Technology Stack

Backend: Python with Django for handling views and requests.
AI Model: Leverages advanced NLP models for embedding extraction and question answering.
Vector Database: Qdrant for efficient similarity search.
Task Queue: Celery with Redis for asynchronous task management.
Database: SQLite for storing user and conversation data.

Installation

Prerequisites

Python 3.8+
Docker and Docker Compose
Google Cloud Platform account for OAuth credentials

Steps

Clone the Repository:

git clone https://github.com/your-repo/pdf-question-answering-app.git
cd pdf-question-answering-app

Set Up Python Environment:

python3 -m venv venv
source venv/bin/activate  # On Windows use `venv\Scripts\activate`
pip install -r requirements.txt

Run Docker Services:
```
make up
```
Apply Migrations:
```
make django-migrate
```
Configure Google OAuth:
- Go to the Google Cloud Console.
- Create a new project or select an existing one.
- Navigate to APIs & Services > Credentials.
- Create OAuth 2.0 credentials and configure the consent screen.
- Add authorized redirect URIs (e.g., http://localhost:8000/accounts/google/login/callback/).
- Download the credentials JSON file and place it in your project directory.
- Set the required environment variables or add them to your settings.py:
```
SOCIAL_AUTH_GOOGLE_OAUTH2_KEY = '<your-client-id>'
SOCIAL_AUTH_GOOGLE_OAUTH2_SECRET = '<your-client-secret>'
```
Start Django Server:
```
make django-run
```

Start Celery Worker:

celery -A myproject worker --loglevel=info

Usage

Login with Google

Access the application at http://localhost:8000.
Click on "Login with Google" to authenticate.
Once authenticated, you can upload PDFs and interact with the chatbot.

Load PDFs

After logging in, upload your PDF files.
The system will process the PDFs asynchronously, and embeddings will be stored in Qdrant.

Chat with the AI

Navigate to the chatbot interface.
Ask questions about the uploaded PDFs.
The AI will retrieve relevant information and provide answers based on the document content.

Architecture

Backend

Django: Handles web requests, renders views, and manages user sessions.
Google OAuth: Enables secure login and user authentication.
Celery: Manages asynchronous PDF processing tasks.
Redis: Used as a message broker for Celery.

Storage and Retrieval

Qdrant: Stores embeddings for fast similarity searches.
SQLite: Stores user data and conversation history.

PDF Processing

Unstructured: Extracts text and generates embeddings from uploaded PDFs.

Docker Compose Configuration

The following services are defined in the docker-compose.yml file:

services:
  qdrant:
    image: qdrant/qdrant:latest
    container_name: qdrant
    ports:
      - "6333:6333"
    volumes:
      - qdrant_storage:/qdrant/storage
    environment:
      QDRANT__SERVICE__GRPC_PORT: 6334
      QDRANT__LOG_LEVEL: "info"

  redis:
    image: redis:latest
    container_name: redis
    ports:
      - "6379:6379"
    volumes:
      - redis_data:/data
    command: ["redis-server", "--appendonly", "yes"]
    restart: always

volumes:
  qdrant_storage:
    driver: local
  redis_data:
    driver: local

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs/images		docs/images
myproject		myproject
templates		templates
.env.dist		.env.dist
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
docker-compose.yaml		docker-compose.yaml
manage.py		manage.py
my_assistant.py		my_assistant.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF Question Answering App

Overview

Features

Screenshots

Chat Interface

File Upload

Technology Stack

Installation

Prerequisites

Steps

Usage

Login with Google

Load PDFs

Chat with the AI

Architecture

Backend

Storage and Retrieval

PDF Processing

Docker Compose Configuration

About

Releases

Packages

Languages

mik3lon/my-assistant

Folders and files

Latest commit

History

Repository files navigation

PDF Question Answering App

Overview

Features

Screenshots

Chat Interface

File Upload

Technology Stack

Installation

Prerequisites

Steps

Usage

Login with Google

Load PDFs

Chat with the AI

Architecture

Backend

Storage and Retrieval

PDF Processing

Docker Compose Configuration

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages