Koranizer is a Python-based utility designed to streamline the process of extracting and transforming Indonesian bank statements (Rekening Koran) into structured Excel data.
- Batch Processing: Convert multiple PDF statements simultaneously.
- Text Extraction: High-speed extraction using
PyPDF2. - Automated Parsing: Uses Regular Expressions (Regex) to identify dates, transaction descriptions, and amounts.
- Excel Export: Organized output with account headers and clean transaction tables.
Koranizer/
├── src/
│ ├── pdftotxtLOOPER.py
│ └── txttoexcelLOOPER.py
├── folder_pdf_anda/ # Place source PDFs here
├── .gitignore
├── requirements.txt
└── README.md
Ensure your environment is ready before running the automation.
- Python Installation: Download and install the latest version of Python from python.org.
- Windows users: Ensure "Add Python to PATH" is checked during installation.
- macOS users: (optional) install newest version via (brew install python)
- pip Installation: read guide here (https://pip.pypa.io/en/stable/installation/)
- Required Libraries: Open your Terminal (macOS) or Command Prompt (Windows) and run:
pip install -r requirements.txt- Directory Setup:
- Download as .zip
- Extract zip in a dedicated project folder
This step converts raw PDF data into searchable text files.
- Prepare Input: Create a folder named
folder_pdf_andaand place your PDF statements there. - Configuration: Open
pdftotxtLOOPER.pyand verify the folder names in the# --- KONFIGURASI ---section.
Execution:
-
Windows:
python src/pdftotxtLOOPER.py -
macOS:
python3 src/pdftotxtLOOPER.py -
Output: Extracted
.txtfiles will appear infolder_hasil_teks.
This step parses the text data into a structured financial spreadsheet.
- Configure Paths: Open
txttoexcelLOOPER.pyand update thefolder_sumberandfolder_hasilvariables with the absolute paths of your directories.
Execution:
-
Windows:
python src/txttoexcelLOOPER.py -
macOS:
python3 src/txttoexcelLOOPER.py -
Output: Finalized
.xlsxfiles will be generated in your designated output folder.