feat: integrate Modi OCR into Vakil Friend chatbot#455
Conversation
|
@nimkarprachi17 is attempting to deploy a commit to the CodeBlooded's projects Team on Vercel. A member of the Team first needs to authorize it. |
|
Hi @viru0909-dev , I’ve completed the implementation for the Modi OCR integration into Vakil Friend and would really appreciate a review whenever you get time. This PR adds a full end-to-end OCR-aware conversational workflow rather than just a UI upload feature. The implementation includes:
I also made sure the feature follows the existing styling/theme system and integrated it into the current Vakil Friend architecture instead of creating a separate scanner flow. Since this involved coordinated frontend state management, API integration, backend DTO/service updates, and contextual LLM reasoning support, I hope the contribution can be considered under the higher-effort/full-stack category during evaluation. Thank you! |
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
|
@nimkarprachi17 add screenshot of output. Thank you. |
|
Hi @nimkarprachi17, thank you for this contribution! The UI integration using However, during testing, we found an architectural issue with the OCR routing flow that needs to be fixed before this PR can be merged. Currently, in const response = await axios.post(`${API_BASE_URL}/ocr/modify`, formData, ...);Since To properly maintain the architecture and keep internal services isolated, please update this flow as follows:
This keeps the frontend communicating only with the Spring Boot gateway while the backend securely handles internal service communication. Once updated, please test the complete OCR upload flow locally and update the PR. Great work so far! |
|
Updated the OCR architecture flow as requested. |


Pull Request
Description
Integrated the existing Modi OCR pipeline into the Vakil Friend chatbot experience for seamless historical document analysis.
Features Added
lucide-react.jpg,.jpeg, and.pnguploads/ocr/modiFastAPI endpointdocumentContextstate management for scanned document contextocrContextBackend Changes
ChatMessageRequestDTO withocrContextNotes
Closes #381
Type of change
Checklist: