Supporting material for GeneKnow. This repository contains workflows, scripts, test cases, and evaluation results for benchmarking GeneKnow against other methods on biomedical literature retrieval and claim verification tasks.
| Directory | Description |
|---|---|
workflow/ |
Step-by-step evaluation protocols |
scripts/ |
Python/R scripts for automated extraction, verification, and evaluation |
discover_cases/ |
Test cases and results for the discover benchmark |
inspect_cases/ |
Test cases and results for the inspect benchmark |
- All raw method outputs and evaluation results are included in this repository under
discover_cases/andinspect_cases/. - Full-text reference papers are not included for copyright and licensing reasons; reviewers should obtain them independently via the provided DOIs/PMIDs.
- Discover Benchmark Workflow — Open-domain retrieval and synopsis generation.
- Inspect Benchmark Workflow — Targeted paper inspection and alignment evaluation.
- Set the
OPENAI_API_KEYenvironment variable. - Set
SCRIPTSandABLATIONto the absolute paths ofscripts/eval/andscripts/ablation/in this repository. - Follow the instructions in the workflow documents under
workflow/.
- Python 3.x
- Install required packages:
pip install -r requirements.txt - The ablation baseline scripts (
naive_inspect_1.py,naive_inspect_2.py) require thegeneknowPython package.