PathwayQA

Benchmark QA dataset generation from Reactome and Evaluation with 9 LLMs
Original Reactome data found on Zenodo-PathwayQA https://zenodo.org/records/16704967

Graphical overview of prompt generation:

Install dependencies

We recommend installing all dependencies in a conda environment:

conda env create --file environment.yml

Download data

Fully parsed reaction and disease data from Reactome can be found on Zenodo [link]
Prompt and answers for the reaction and disease tasks can be found in the /data folder

Run Models

The LLM models must first be downloaded from HuggingFace. The vllm python package is required to run the script found in /run_models.

Evaluation

The evaluation scripts are run on the output files of the LLM models in order to judge whether the generated answer matches the true answer.

compare_answers_gpt.py queries GPT 4.1 to test if the generated and true answers match. For every entity in the true answer, the model determines if it is in the generated output. postprocess_validate.py converts the output into a score per reaction.
disease_agreement.py performs the validation for the disease queries using the LLM. string_match.py performs a simple string match.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PathwayQA

Install dependencies

Download data

Run Models

Evaluation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data		data
run_models		run_models
.gitignore		.gitignore
README.md		README.md
compare_answers_gpt.py		compare_answers_gpt.py
disease_agreement.py		disease_agreement.py
environment.yml		environment.yml
postprocess_validate.py		postprocess_validate.py
query_utils_disease.py		query_utils_disease.py
string_match.py		string_match.py
supp.pdf		supp.pdf

Folders and files

Latest commit

History

Repository files navigation

PathwayQA

Install dependencies

Download data

Run Models

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages