Skip to content

Angitha10/Document-Ask

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Document-Ask

This repository contains a Flask web application that allows users to upload PDF documents, extract text from them, generate preview images, and engage in a chat-like interface for asking questions about the content of the PDF. The application utilizes the PyMuPDF library for PDF processing and the Hugging Face Transformers library for question-answering.

Interface

Document-Ask Upload Interface Document-Ask Chat Interface

Features

  • Upload PDF documents.
  • Extract text from PDFs and store it for further processing.
  • Generate preview images for all pages in the PDF.
  • Implement a chat interface for users to ask questions about the PDF content.
  • Use a pre-trained question-answering model from Hugging Face to provide answers based on the extracted text.

Setup Instructions

Prerequisites

  • Python 3.x
  • Pip (Python package installer)

Installation Steps

  1. Clone the repository to your local machine:
    git clone https://github.com/Angitha10/Document-Ask.git
  2. Navigate to the project directory:
    cd Document-Ask
  3. Create a virtual environment (optional but recommended):
    python -m venv venv
  4. Activate the virtual environment:
    • On Windows:
      venv\Scripts\activate
    • On Unix or MacOS:
      source venv/bin/activate
  5. Install the required dependencies:
    pip install -r requirements.txt
  6. Run the Flask application:
    python app.py
  7. Open your web browser and go to http://127.0.0.1:5000/ to access the application.

Usage

  1. Upload a PDF document using the provided interface.
  2. The application will extract text from the PDF and generate preview images for each page.
  3. Access the chat interface by navigating to the /process/ route.
  4. Ask questions about the PDF content in the chat interface.
  5. The application will utilize a pre-trained question-answering model to provide answers based on the extracted text.

Acknowledgments

  • This project uses the Flask web framework, PyMuPDF for PDF processing, and Hugging Face Transformers for question-answering.

Feel free to contribute, report issues, or suggest improvements!

About

A web based application to perform question-answer tasks on Pdf document.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors