A simple Python application in two parts:
- main.py watches the
Inputfolder. Any document dropped in there is picked up and forwarded to an Azure OpenAI endpoint (configured inconfig.py) with the prompt and required extraction fields. - validation.py is used to validate and update any extracted fields if necessary. The finalized data is then appended to a single CSV file.
- Python 3.x
- Use the provided
requirements.txtfile to install dependencies.
- Clone the repository:
git clone https://github.com/SunnySydney123/AIBasedExtraction.git cd AIBasedExtraction rename config_copy.py to config.py Update the config.py with your Open AI API keys. - (Optional) Create a virtual environment: python -m venv venv
- Activate the virtual environment: .ON WIndows venv\Scripts\activate
- Install dependencies pip install -r requirements.txt
- Run the application python main.py
- Run Validation python validation.py
Drop your images into the Input Folder. Once the main.exe has processed the files, you can run the validation.exe to validate the extracted results. The extracted data will be available in csv file in the data folder
🙌 Author
Sunil Sharma
Sn Sales Consultant (Tungsten Australia)