Skip to content

SunnySydney123/AIBasedExtraction

Repository files navigation

AI-Based Extraction

A simple Python application in two parts:

  • main.py watches the Input folder. Any document dropped in there is picked up and forwarded to an Azure OpenAI endpoint (configured in config.py) with the prompt and required extraction fields.
  • validation.py is used to validate and update any extracted fields if necessary. The finalized data is then appended to a single CSV file.

📦 Requirements

  • Python 3.x
  • Use the provided requirements.txt file to install dependencies.

🚀 How to Run

  1. Clone the repository:
    git clone https://github.com/SunnySydney123/AIBasedExtraction.git
    cd AIBasedExtraction
    rename config_copy.py to config.py
    Update the config.py with your Open AI API keys.
    
    
  2. (Optional) Create a virtual environment: python -m venv venv
  3. Activate the virtual environment: .ON WIndows venv\Scripts\activate
  4. Install dependencies pip install -r requirements.txt
  5. Run the application python main.py
  6. Run Validation python validation.py

Drop your images into the Input Folder. Once the main.exe has processed the files, you can run the validation.exe to validate the extracted results. The extracted data will be available in csv file in the data folder

🙌 Author

Sunil Sharma

Sn Sales Consultant (Tungsten Australia)

About

Python application to extract and validate handwritten forms using Azure Open AI models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors