Skip to content

Commit 293bdb9

Browse files
committed
Create Publications “2025-06-30-bridging-the-gap-with-retrieval-augmented-generation-making-prosthetic-device-user-manuals-available-in-marginalised-languages”
1 parent 5fd8815 commit 293bdb9

1 file changed

Lines changed: 56 additions & 0 deletions
Lines changed: 56 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,56 @@
1+
---
2+
layout: publication-single
3+
title: "Bridging the Gap with Retrieval-Augmented Generation: Making Prosthetic
4+
Device User Manuals Available in Marginalised Languages "
5+
abstract: Millions of people in African countries face barriers to accessing
6+
healthcare due to language and literacy gaps. This research tackles this
7+
challenge by transforming complex medical documents—in this case, prosthetic
8+
device user manuals—into accessible formats for underserved populations. This
9+
case study in cross-cultural translation is particularly pertinent/relevant
10+
for communities that receive donated prosthetic devices but may not receive
11+
the accompanying user documentation. Or, if available online, may only be
12+
available in formats (e.g., language and readability) that are inaccessible to
13+
local populations (e.g., English-language, high resource settings/cultural
14+
context). The approach is demonstrated using the widely spoken Pidgin dialect,
15+
but our open-source framework has been designed to enable rapid and easy
16+
extension to other languages/dialects. This work presents an AI powered
17+
framework designed to process and translate complex medical documents, e.g.,
18+
user manuals for prosthetic devices, into marginalised languages. The system
19+
enables users—such as healthcare workers or patients—to upload
20+
English-language medical equipment manuals, pose questions in their native
21+
language, and receive accurate, localised answers in real time. Technically,
22+
the system integrates a Retrieval-Augmented Generation (RAG) pipeline for
23+
processing and semantic understanding of the uploaded manuals. It then employs
24+
advanced Natural Language Processing (NLP) models for generative question
25+
answering and multilingual translation. Beyond simple translation, it ensures
26+
accessibility to device instructions, treatment protocols, and safety
27+
information, empowering patients and clinicians to make informed healthcare
28+
decisions. This framework supports integrating additional languages, making it
29+
adaptable to a wide range of global health challenges, including public health
30+
campaigns and disaster relief, where accurate communication in native
31+
languages can save lives. With far-reaching implications, this research serves
32+
as an interim solution for health organisations/providers in such communities
33+
and a call to action for policymakers and governments of the Global South to
34+
reduce inequities in accessing critical medical information. Overall, it
35+
offers a vision of healthcare that empowers marginalised communities, fosters
36+
trust, and ensures no one is left behind due to language or literacy barriers.
37+
published: 2025-06-30
38+
authors:
39+
internal_authors:
40+
- Soumya Banerjee
41+
external_authors:
42+
- family: Harthikote Nagaraja
43+
given: "Vikranth "
44+
- family: Kenney
45+
given: Laurence
46+
- family: Dasgupta
47+
given: " Abhishek "
48+
- family: Davidson
49+
given: Lesley
50+
- family: Ogbonna
51+
given: "Ikechukwu "
52+
details:
53+
html: https://arxiv.org/abs/2506.23958
54+
pdf: https://arxiv.org/pdf/2506.23958
55+
container-title: Data Science Africa 2025 Workshop
56+
---

0 commit comments

Comments
 (0)