An Alibaba open-source multi-language benchmark for evaluating LLMs in repository-level automatic code review, featuring an AI-assisted and expert-verified dataset.
-
Updated
Mar 16, 2026 - Python
An Alibaba open-source multi-language benchmark for evaluating LLMs in repository-level automatic code review, featuring an AI-assisted and expert-verified dataset.
🛠️ Enhance code quality with structured reviews, focusing on architecture, security, performance, and code hygiene for robust development.
Add a description, image, and links to the repository-level-context topic page so that developers can more easily learn about it.
To associate your repository with the repository-level-context topic, visit your repo's landing page and select "manage topics."