Hi authors,
Thank you for releasing the code for your NAACL 2025 paper, it's an exciting and timely contribution!
After reading through the README.md and exploring the repository, I’m having trouble understanding how to reproduce the experiments reported in the paper. In particular, it would be really helpful to have:
- Step-by-step instructions or commands for reproducing the main results/figures/tables from the paper.
- Expected input/output formats for
main.py — what arguments (if any) does it take, and what should we expect as output?
- An example run or configuration file, if applicable, to help illustrate the full workflow.
- Clarification on Docker usage: is Docker required for reproducing the paper results, or is it just an optional setup utility?
- Details on external dependencies, such as datasets, model APIs (e.g., OpenAI, Claude, etc.), or keys that need to be set up beforehand.
If you could provide a minimal example (e.g., one command and its expected output), that would go a long way in helping the community reproduce and build upon your work.
Thanks again for sharing your code — looking forward to your guidance!
Best regards,
Zhiyuan Ning
Hi authors,
Thank you for releasing the code for your NAACL 2025 paper, it's an exciting and timely contribution!
After reading through the
README.mdand exploring the repository, I’m having trouble understanding how to reproduce the experiments reported in the paper. In particular, it would be really helpful to have:main.py— what arguments (if any) does it take, and what should we expect as output?If you could provide a minimal example (e.g., one command and its expected output), that would go a long way in helping the community reproduce and build upon your work.
Thanks again for sharing your code — looking forward to your guidance!
Best regards,
Zhiyuan Ning