Skip to content

Cloud OCR using 3rd party API #639

@aldavies1

Description

@aldavies1

Describe the pain point and your solution
Could you add a function to select 3rd party API tools (user bringing own key) to perform the OCR? E.g. Claude, Groq, OpenAI etc? I'd like to keep the existing on-device options but having cloud OCR option could be quite useful for those of us who don't have a copilot+ PC so can't use the new windows AI framework.

Mode which would include change

  • Settings Window
  • General

Describe alternatives you've tried or considered
This approach is similar to applications such as Open Whispr, and the following screenshot shows an example of how this application has implemented this kind of functionality.

Screenshots or sketches

Image Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions