feat: support multiple OCR parameter sets and improve text selection accuracy by tvone · Pull Request #86 · SWHL/RapidVideOCR

tvone · 2025-07-10T13:24:27Z

Changes

Added support for multiple OCR parameter sets (ocr_params_list) instead of a single config.
Improved detection accuracy by testing various configurations (e.g., Det.limit_side_len min/max) to reduce missed detections of short or long text lines within images.
Multiprocessing support (multiprocessing.Pool) can optionally be used to speed up processing by running different parameter sets in parallel.

Notes

Default behavior remains the same if ocr_params_list is not provided.
Parallel execution is optional and only applied if explicitly implemented by the user.

Copilot

Pull Request Overview

Enhance OCRProcessor to accept multiple OCR configurations and select the most comprehensive text result per image, adjusting the entry point to use the new ocr_params_list.

Refactored OCRProcessor.__init__ and get_ocr_result to handle a list of parameter sets.
Updated single_rec to iterate over all engines and pick the longest text output.
Changed RapidVideOCRInput and RapidVideOCR in main.py to use ocr_params_list.

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
rapid_videocr/ocr_processor.py	Support multiple OCR parameter sets in single_rec, updated engine initialization and result logic.
rapid_videocr/main.py	Replaced `ocr_params` with `ocr_params_list` in input schema and processor instantiation.

Comments suppressed due to low confidence (1)

rapid_videocr/ocr_processor.py:61

Add unit tests to verify that single_rec selects the longest text among multiple OCR parameter sets, covering this conditional branch.

                if max_txt_len < len(txts):

rapid_videocr/ocr_processor.py

SWHL · 2025-07-22T10:30:00Z

Thanks, and I will merge it later.

…accuracy

…t text per image

tvone · 2025-07-22T11:25:51Z

Thanks, and I will merge it later.

Thank you.

SWHL · 2025-09-10T01:30:42Z

I'm really sorry for the late response; I've only just found the time to look into RapidVideOCR recently. I've carefully reviewed your submission. The core idea is to instantiate multiple OCR instances with different configurations simultaneously, perform multiple recognitions on the same image, and select the longest result as the final output.

However, this approach would lead to increased resource consumption. Out of curiosity, could you please share what specific scenario prompted you to consider this solution?

tvone force-pushed the feature branch 2 times, most recently from a8f7a86 to a2fa3f4 Compare July 10, 2025 17:09

SWHL requested a review from Copilot July 11, 2025 00:04

Copilot AI reviewed Jul 11, 2025

View reviewed changes

rapid_videocr/ocr_processor.py Outdated Show resolved Hide resolved

rapid_videocr/ocr_processor.py Outdated Show resolved Hide resolved

tvone added 6 commits July 22, 2025 18:08

feat: support multiple OCR parameter sets and improve text selection …

7dead4c

…accuracy

feat: Support multiple OCR parameter sets in batch_rec and select bes…

857e189

…t text per image

rename get_ocr_result to get_ocr_result

0dfd9d0

update comments

42a0d01

Optimize batch_rec for single OCR config

41de404

test: add parametrize test for OCR with multiple configs

4bf1530

tvone force-pushed the feature branch from 07251e6 to 4bf1530 Compare July 22, 2025 11:12

SWHL added the feature_request 新的功能点需求 label Sep 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: support multiple OCR parameter sets and improve text selection accuracy#86

feat: support multiple OCR parameter sets and improve text selection accuracy#86
tvone wants to merge 6 commits intoSWHL:mainfrom
tvone:feature

tvone commented Jul 10, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

SWHL commented Jul 22, 2025

Uh oh!

tvone commented Jul 22, 2025

Uh oh!

SWHL commented Sep 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

tvone commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

SWHL commented Jul 22, 2025

Uh oh!

tvone commented Jul 22, 2025

Uh oh!

SWHL commented Sep 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tvone commented Jul 10, 2025 •

edited

Loading