An automated, multi-threaded Python command-line utility designed to scrape, download, and compile webcomics and digital comic book chapters into organized PDF files.
- Multi-threaded Downloads: Utilizes
ThreadPoolExecutorto download asset images concurrently, optimizing network bandwidth. - Automatic PDF Compilation: Compiles downloaded chapter images directly into single-chapter PDF documents using
img2pdf. - Chapter Merging: Merges multiple sequential chapter PDFs into a single combined volume using
PyPDF2. - Flexible Batch Processing: Reads targets (comic slug name, chapter range) from a YAML configuration file (
manhwa_list.yml). - Caching Mechanism: Proactively checks for existing files to prevent redundant downloads.
download_manhwa.py: Core script containing web scraping, image downloading, and PDF compilation logic.manhwa_list.yml: Configuration file specifying download tasks.requirements.txt: Standard Python package dependencies list.
-
Clone the repository:
git clone https://github.com/DileepKumarD/DownloadManhwa.git cd DownloadManhwa -
Install dependencies:
pip install -r requirements.txt
-
Configure your download list: Edit
manhwa_list.ymlto specify targets. The format is a comma-separated and space-separated list of items:comic-slug,start-chapter,end-chapter,download-name -
Run the compiler:
python download_manhwa.py