[feat]【开源实习】基于MindSpore NLP实现DeepSeek-OCR文本识别与结构化解析可交互DEMO开发 #2064#49
Open
lyyyym wants to merge 2 commits intomindspore-lab:devfrom
Open
[feat]【开源实习】基于MindSpore NLP实现DeepSeek-OCR文本识别与结构化解析可交互DEMO开发 #2064#49lyyyym wants to merge 2 commits intomindspore-lab:devfrom
lyyyym wants to merge 2 commits intomindspore-lab:devfrom
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
任务描述:DeepSeek-OCR 文本识别与结构化解析可交互 DEMO
一、任务概述
基于 MindSpore 2.7.0 + MindNLP 0.5.1,参考 https://huggingface.co/spaces/khang119966/DeepSeek-OCR-DEMO,在华为 Ascend NPU 910B 上实现 DeepSeek-OCR 多场景文本识别与结构化解析的可交互 DEMO,支持流式生成与性能优化。
二、实现内容
基于 Gradio 构建 Web 交互界面,支持以下功能:
将模型输出改为流式生成模式,核心实现:
实施优化方案,并提供优化前后实测数据对比。