GeneKnow Benchmark Evaluation

Supporting material for GeneKnow. This repository contains workflows, scripts, test cases, and evaluation results for benchmarking GeneKnow against other methods on biomedical literature retrieval and claim verification tasks.

Repository Structure

Directory	Description
`workflow/`	Step-by-step evaluation protocols
`scripts/`	Python/R scripts for automated extraction, verification, and evaluation
`discover_cases/`	Test cases and results for the discover benchmark
`inspect_cases/`	Test cases and results for the inspect benchmark

Data Availability

All raw method outputs and evaluation results are included in this repository under discover_cases/ and inspect_cases/.
Full-text reference papers are not included for copyright and licensing reasons; reviewers should obtain them independently via the provided DOIs/PMIDs.

Workflows

Discover Benchmark Workflow — Open-domain retrieval and synopsis generation.
Inspect Benchmark Workflow — Targeted paper inspection and alignment evaluation.

Quick Start

Set the OPENAI_API_KEY environment variable.
Set SCRIPTS and ABLATION to the absolute paths of scripts/eval/ and scripts/ablation/ in this repository.
Follow the instructions in the workflow documents under workflow/.

Dependencies

Python 3.x
Install required packages: pip install -r requirements.txt
The ablation baseline scripts (naive_inspect_1.py, naive_inspect_2.py) require the geneknow Python package.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GeneKnow Benchmark Evaluation

Repository Structure

Data Availability

Workflows

Quick Start

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
discover_cases		discover_cases
inspect_cases		inspect_cases
scripts		scripts
workflow		workflow
README.md		README.md
discover_benchmark_chat_prompt.txt		discover_benchmark_chat_prompt.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

GeneKnow Benchmark Evaluation

Repository Structure

Data Availability

Workflows

Quick Start

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages